This repo contains the full materials for my TopTal screening assignment submission.
The README page displays the directory of all the files used and created by this assignment.
toptal_assignment
│ README.md
│ requirements.txt
└───code
│ │ EDA.ipynb
│ │ Modeling.ipynb
│ │ prediction_pipeline.py
│ │ utils.py
└───data
│ │ dataset.json
│ │ processed_data.csv
│ │ result.csv
│ │ sites_encoded.csv
│ │ verify.json
└───model
│ │ Random_Forest_model.pkl
│ │ countvec.joblib
│ │ feature_engineer_pipeline.pkl
│ │ feature_selector.pkl
│ │ sites_vectorizer.pkl
Clone the project
https://github.com/GeorgeMcIntire/toptal_assignment.git
Go to the project directory
cd toptal_assignment
Start up virtual environment
virtualenv -p python3 envname
Activate virtual environment
source envname/bin/activate
Run the following command in terminal to install the required packages
pip3 install -r requirements.txt
Make predictions on the verify.json
data.
python3 prediction_pipeline.py