Skip to content

Deploying post-training quantized mobinenetv2 on Googke Kubernetes Engine using flask, gunicorn

Notifications You must be signed in to change notification settings

Midhilesh29/k8s

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

13 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

k8s

This repository provides the code to deploy Post Quantized mobilnetv2 model on Google Kubernetes Engine.

Code Architecture

  1. flask
    1. Dockerfile [Dockerfile the dependencies to run gunicorn+flask based application server for predicting the POST request image]
    2. deploy.py [Python code for running guincorn]
    3. FlaskApp.py [Python code for preprocessing the input image and predicting it with post quantized mobilenetv2 model]
    4. MobileNetV2Quantized.pth [Is the torchscript based serialized post quantized mobilenetv2 model]
  2. deployment.yaml [YAML file to create pods using replicaset]
  3. service.yaml [YAML file to create a service (type loadbalancer) for external word to reach this application service]
gcloud container clusters create cluster-1 --zone us-central1-a

gcloud container clusters get-credentials cluster-1 --zone us-central1-a

kubectl get nodes

kubectl apply -f deployment.yaml

kubectl apply -f service.yaml

Note

  1. Selector for service should be the label app given for pods
  2. name in deployment, service are basically to identify it.

About

Deploying post-training quantized mobinenetv2 on Googke Kubernetes Engine using flask, gunicorn

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published