Reselling cars is a crucial task in the finance sector, often requiring accurate pricing predictions based on various criteria such as brand, model, year, engine, transmission type, fuel, accident history, and title status. This project automates the prediction process using machine learning models.
The project utilizes the used_cars.csv
dataset available on Kaggle, containing relevant information for resale price prediction.
- Data Preprocessing: Handled missing values and smoothed noisy data using Inter Quartile Range Method.
- Feature Encoding: Converted categorical features into numeric ones using target encoding and one-hot encoding.
- Feature Scaling: Scaled features using min-max normalization.
- Model Evaluation: Analyzed the performance of five regression models: Linear Regression, Decision Tree Regressor, Random Forest Regressor, KNeighbor Regressor, and Support Vector Regressor.
- Model Selection: Identified KNeighbor and Random Forest Regressor as top-performing models.
- Hyperparameter Tuning: Planned for optimizing the selected models for improved performance.
- Train score
- Test score
- Mean squared error
- Mean absolute error
- R2 score
- Root mean squared error
- Perform hyperparameter tuning on the KNeighbor and Random Forest Regressor models for optimization.
- Deploy the most optimized model for the task of predicting resale prices for cars.
Special thanks to Kaggle for providing the dataset used in this project.