Whether you shop from meticulously planned grocery lists or let whimsy guide your grazing, our unique food rituals define who we are. Instacart, a grocery ordering and delivery app, aims to make it easy to fill your refrigerator and pantry with your personal favorites and staples when you need them. After selecting products through the Instacart app, personal shoppers review your order and do the in-store shopping and delivery for you.
This project used data on customer orders over time to predict which previously purchased products will be in a user’s next order.
Please see the requirements.txt
file for the required packages and versions.
Please run the files in the following order:
Master Dataset.ipynb
Tranforms the Kaggle dataset into the master dataset that.EDA.ipynb
The exploratory data analysis contains some uni-variate, bi-variate analysis and some advanced visualizations. The train-test split is also done in this notebook.Feature Engineering.ipynb
Adds new features, imputes missing values and encodes the categorical variables.Updated feature selection.ipynb
Fixes the distribution shift and handles imbalances. The information leakage analysis is also done in this notebook, along with the final feature selection.Modeling.ipynb
The classification models are trained and evaluated in this notebook. Their results are interpreted and the best performing model is fine tuned and saved as a pickle file.
To view the streamlit app, please click here.