Skip to content

The aim of this project is to develop a machine learning model to predict the levels of CO in the air using historical datasets containing atmospheric variables. The project makes use of variables selection, decision trees, and cross-validation techniques to ensure robustness and model accuracy.

License

Notifications You must be signed in to change notification settings

Scrayil/Air-Quality-Prediction

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

15 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Air Quality Prediction

The aim of this project is to develop a machine learning model to predict the levels of CO in the air using historical datasets containing atmospheric variables. The project makes use of variables selection, decision trees, and cross-validation techniques to ensure robustness and model accuracy.
Data taken from: https://www.kaggle.com/datasets/fedesoriano/air-quality-data-set

Project phases

  1. Data preprocessing
    • Handling missing values
    • Time-related data normalization
    • Quantile based outliers capping
  2. Variable selection
    • Using Lasso regression
  3. Model training
    • Decision trees (RandomForest) and cross-validation
  4. Model evaluation
    • RMSE as an accuracy measure
    • Variables importance visualization
  5. Results visualization
    • Predictions plots
    • Trends plots

License

Copyright 2024 Mattia Bennati
Licensed under the GNU GPL V2: https://www.gnu.org/licenses/old-licenses/gpl-2.0.html

About

The aim of this project is to develop a machine learning model to predict the levels of CO in the air using historical datasets containing atmospheric variables. The project makes use of variables selection, decision trees, and cross-validation techniques to ensure robustness and model accuracy.

Topics

Resources

License

Stars

Watchers

Forks

Languages