Automated approach from feature engineering to modeling on the Kaggle Home Credit Default Risk competition dataset
-
Updated
Mar 3, 2021 - Jupyter Notebook
Automated approach from feature engineering to modeling on the Kaggle Home Credit Default Risk competition dataset
Portfolio of data science projects completed by me for academic, self learning, and hobby purposes.
berserkr converts retailer data and performs machine learning modeling at scale
Files and Notebooks for Kaggle Titanic
Kaggle competition
Automl with Featuretools generate features and use tpot to select model
Classification and Oversampling Algorithms Comparison, using Deep Feature Synthesis and Feature Selection with RFE
Automated creation of EntitySets from relational data stored in SQL databases
The project provides a complete end-to-end workflow for building a binary classifier in Python to recognize the risk of housing loan default. It includes methods like automated feature engineering for connecting relational databases, comparison of different classifiers on imbalanced data, and hyperparameter tuning using Bayesian optimization.
Predict the poverty of households in Costa Rica using automated feature engineering.
Exemplary, annotated machine learning pipeline for any tabular data problem.
A simplified version of featuretools for Spark
An R interface to the Python module Featuretools
Woodwork is a Python library that provides robust methods for managing and communicating data typing information.
A collection of demos showcasing automated feature engineering and machine learning in diverse use cases
Use advanced feature engineering strategies and select best features from your data set with a single line of code. Created by Ram Seshadri. Collaborators welcome.
Add a description, image, and links to the featuretools topic page so that developers can more easily learn about it.
To associate your repository with the featuretools topic, visit your repo's landing page and select "manage topics."