This repository contains a TFX (TensorFlow Extended) pipeline for building and deploying machine learning models using Vertex Pipelines in Google Cloud.
TFX is an end-to-end platform for deploying production machine learning pipelines. Vertex Pipelines is a managed service provided by Google Cloud for building, deploying, and monitoring machine learning pipelines.
This project demonstrates how to set up a TFX pipeline using Vertex Pipelines to train, validate, and deploy machine learning models on Google Cloud.
- Utilizes TFX components such as ExampleGen, Transform, Trainer, and Pusher.
- Integrates with Vertex AI for model training and deployment.
- Implements custom components for data preprocessing and feature engineering.
- Includes a sample dataset and model for demonstration purposes.
- Clone this repository:
git clone https://github.com/pradeep-016/ML_Production_deployment.git
cd tfx-vertex-pipeline
- Install the necessary dependencies:
pip install -r requirements.txt
- Configure your Google Cloud project and authenticate with the gcloud command-line tool:
gcloud init
- Set up your GCP environment variables:
export PROJECT_ID=your_project_id
export REGION=your_region
-
Customize the pipeline configuration in
pipeline/config.py
according to your project requirements. -
Run the pipeline using the following command:
python -m pipeline.run_pipeline
- Monitor the pipeline execution in the Vertex AI Console.
Contributions to this project are welcome! If you find any bugs or have suggestions for improvement, please open an issue or submit a pull request.