Guidelines to deploy AI models in different cloud providers aligned with green AI goals.
The repository is structured as follows:
- app | API, schemas - models | This folder contains our trained or pretrained models - notebooks | This folder contains the jupyter notebooks - reports | Generated PDFs, graphics and figures to be used in reporting - utils | Python functions - manuals | self-contained manuals - requirements.txt: The dependencies of our implementation
Guide:
- API creation. Guide to create an API to deploy ML models.
- Add pretrained model. Guide to add pretrained ML models (from HuggingFace, hdf5 format, pickle format) to do inferences through an API.
- Deploy ML models in a cloud provider (General). Guide to deploy ML models using an API in a cloud provider.
- Deploy in Virtech. Virtech setup, Guide to deploy ML models using an API in an AWS VM.
- AWS. AWS setup, Guide to deploy ML models using an API in an AWS VM.
- GCP. GCP setup, Guide to deploy ML models using an API in an GCP VM.
- Azure. Azure setup, Guide to deploy ML models using an API in an Azure VM.
- FAQ. Documentation with problems arised during deployments.
- Other. Other notes.
* Initial proposed cloud providers
- Amazon Elastic Compute Cloud (Amazon EC2) from Amazon Web Services (AWS) | URL: https://aws.amazon.com/ - Azure Virtual Machines from Microsoft Windows Azure | URL: https://azure.microsoft.com/ - Google Compute Engine from Google Cloud Platform (GCP) | URL: https://cloud.google.com/ - Virtech, UPC cloud provider (By OpenNebula) | URL: https://www.fib.upc.edu/es/la-fib/servicios-tic/cloud-docente-fib | URL: https://opennebula.io/
* Initial proposed models
- BERT model
- T5
- CodeGen
- Pythia-70m
- CNN model
- Codet5p-220m
- CodeGen
- Pythia-70m
- Codet5p-220m
See manuals/01_create_api.md to check how to create an API to deploy ML models.
- TensorFlow
- PyTorch
- ONNX
- h5, complete model
- h5, weights only
- Pickle
- CV
- NLP
- ...
Role: ML Engineer
- Data engineer: Manage DBs
- Data scientist: Train ML models
- BI: Dashboards, analytics, BI
- ML Engineer: SE --deploy--> ML systems
- codecarbon
- ...
- Track energy efficiency.
- Trade-off between green-AI related metrics and accuracy.
- Monitor models' performance
See manuals/references
ToDo:
- Add info cloud providers
- Add FastAPI info