Skip to content

business-science/ai-data-science-team

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 

Repository files navigation

An AI-powered data science team of agents to help you perform common data science tasks 10X faster
PyPI versions license

Your AI Data Science Team (πŸͺ– An Army Of Agents)

An AI-powered data science team of agents to help you perform common data science tasks 10X faster.

Please ⭐ us on GitHub (it takes 2 seconds and means a lot).

Beta - This Python library is under active development. There may be breaking changes that occur until release of 0.1.0.


The AI Data Science Team of Copilots includes Agents that specialize data cleaning, preparation, feature engineering, modeling (machine learning), and interpretation of various business problems like:

  • Churn Modeling
  • Employee Attrition
  • Lead Scoring
  • Insurance Risk
  • Credit Card Risk
  • And more

Table of Contents

Companies That Want A Custom AI Data Science Team (And AI Apps)

Want to have your own customized enterprise-grade AI Data Science Team and domain-specific AI-powered Apps?

Send inquiries here: https://www.business-science.io/contact.html

Generative AI for Data Scientists Workshop

If you're an aspiring data scientist who wants to learn how to build AI Agents and AI Apps for your company that performs Data Science, Business Intelligence, Churn Modeling, Time Series Forecasting, and more, then I'd love to help you.

Register for my next Generative AI for Data Scientists workshop here.

Data Science Agents

This project is a work in progress. New data science agents will be released soon.

Data Science Team

NEW: Multi-Agents

This is the internals of the SQL Data Analyst Agent that connects to SQL databases to pull data into the data science environment. It creates pipelines to automate data extraction, performs Joins, Aggregations, and other SQL Query operations. And it includes a Data Visualization Agent that creates visualizations to help you understand your data.:

Business Intelligence SQL Agent

Coming Soon: Data Science Apps

This is a top secret project I'm working on. It's a multi-agent data science app that performs time series forecasting.

Multi-Agent Data Science App

Agents Available Now

πŸ”₯πŸ”₯ NEW! Machine Learning Agents

  1. πŸ”₯ H2O Machine Learning Agent: Builds and logs 100's of high-performance machine learning models. See Example

Data Science Agents

  1. Data Wrangling Agent: Merges, Joins, Preps and Wrangles data into a format that is ready for data analysis. See Example
  2. Data Visualization Agent: Creates visualizations to help you understand your data. Returns JSON serializable plotly visualizations. See Example
  3. Data Cleaning Agent: Performs Data Preparation steps including handling missing values, outliers, and data type conversions. See Example
  4. Feature Engineering Agent: Converts the prepared data into ML-ready data. Adds features to increase predictive accuracy of ML models. See Example
  5. SQL Database Agent: Connects to SQL databases to pull data into the data science environment. Creates pipelines to automate data extraction. Performs Joins, Aggregations, and other SQL Query operations. See Example

Multi-Agents

  1. SQL Data Analyst Agent: Connects to SQL databases to pull data into the data science environment. Creates pipelines to automate data extraction. Performs Joins, Aggregations, and other SQL Query operations. Includes a Data Visualization Agent that creates visualizations to help you understand your data. See Example

Agents Coming Soon

  1. Data Analyst: Analyzes data structure, creates exploratory visualizations, and performs correlation analysis to identify relationships.
  2. MLOps Agent: Track experiments, log models, and deploy models to production.
  3. Interpretability Agent: Performs Interpretable ML to explain why the model returned predictions including which features were the most important to the model.
  4. Supervisor: Forms task list. Moderates sub-agents. Returns completed assignment.

Disclaimer

This project is for educational purposes only.

  • It is not intended to replace your company's data science team
  • No warranties or guarantees provided
  • Creator assumes no liability for financial loss
  • Consult an experienced Generative AI Data Scientist for building your own custom AI Data Science Team
  • If you want a custom enterprise-grade AI Data Science Team, send inquiries here.

By using this software, you agree to use it solely for learning purposes.

Installation

pip install git+https://github.com/business-science/ai-data-science-team.git --upgrade

Usage

See all examples here.

Example 1: Feature Engineering with the Feature Engineering Agent

See the full example here.

feature_engineering_agent = FeatureEngineeringAgent(model = llm)

feature_engineering_agent.invoke_agent(
    data_raw = df,
    user_instructions = "Make sure to scale and center numeric features",
    target_variable = "Churn",
    max_retries = 3,
)
---FEATURE ENGINEERING AGENT----
    * CREATE FEATURE ENGINEER CODE
    * EXECUTING AGENT CODE
    * EXPLAIN AGENT CODE
feature_engineering_agent.get_data_engineered()

Example 2: Cleaning Data with the Data Cleaning Agent

See the full example here.

data_cleaning_agent = DataCleaningAgent(model = llm)

response = data_cleaning_agent.invoke_agent(
    data_raw = df,
    user_instructions = "Don't remove outliers when cleaning the data.",
    max_retries = 3,
)
---DATA CLEANING AGENT----
    * CREATE DATA CLEANER CODE
    * EXECUTING AGENT CODE
    * EXPLAIN AGENT CODE
data_cleaning_agent.get_data_cleaned()

Contributing

  1. Fork the repository
  2. Create a feature branch
  3. Commit your changes
  4. Push to the branch
  5. Create a Pull Request

License

This project is licensed under the MIT License. See LICENSE file for details.

Want To Become A Full-Stack Generative AI Data Scientist?

Generative AI Data Scientist

I teach Generative AI Data Science to help you build AI-powered data science apps. Register for my next Generative AI for Data Scientists workshop here.