Machine Learning — Getting Data Into Right Shape by Andrejus Baranovskis
Blog: PaaS Community
When you build machine learning model, first start with the data — make sure input data is prepared well and it represents true state of what you want machine learning model to learn. Data preparation task takes time, but don’t hurry — quality data is a key for machine learning success. In this post I will go through essential steps required to bring data into right shape to feed it into machine learning algorithm.
Sample dataset and Python notebook for this post can be downloaded from my GitHub repo.
Each row from dataset represents invoice which was sent to customer. Original dataset extracted from ERP system comes with five columns:
customer — customer ID
invoice_date — date when invoice was created
payment_due_date — expected invoice payment date
payment_date — actual invoice payment date
grand_total — invoice total
Read the complete article here.
For regular information on Oracle PaaS become a member in the SOA & BPM Partner Community for registration please visit www.oracle.com/goto/emea/soa (OPN account required) If you need support with your account please contact the Oracle Partner Business Center.
Blog Twitter LinkedIn Facebook Wiki
Technorati Tags: SOA Community,Oracle SOA,Oracle BPM,OPN,Jürgen Kress
Leave a Comment
You must be logged in to post a comment.