Challenges
- Extremely large data – Over 2.5 million records
- Data inconsistencies
- Traffic variation at different time of the day
Solution
- Univariate analysis to identify significant variables
- Clustered clients based on number of orders from a specific vendor
- Developed various models and recommended the most suitable one
