Skip to content

Latest commit

 

History

History
14 lines (11 loc) · 553 Bytes

README.md

File metadata and controls

14 lines (11 loc) · 553 Bytes

income

Binary Classification of incomes as <50k or >50k using decision trees and random forests in R.

Data can be found here: https://www.kaggle.com/uciml/adult-census-income

Kaggle notebook: https://www.kaggle.com/lavanyask/adult-census-income-classify

The project aims at classifying incomes as <50k or >50k based on census data. It is organised as follows:

  1. Data exploration
  2. Data cleaning and preprocessing
  3. Training a decision tree model and a random forest model
  4. Evaluating the performance of model by ROC and AUC curves