KServe Inference Graph

Create VEnv

virtualenv venv --python=python3.10

source venv/bin/activate

Install requirements

pip install -r requirements-torch.txt
pip install -r requirements.txt

MAR Generation

torch-model-archiver --model-name cat-classifier --handler ts_handlers/hf-image-classification/hf_image_classification_handler.py --requirements-file ts_handlers/hf-image-classification/requirements.txt --extra-files models/cat-classifier/ --version 1.0

Test Model in TorchServe

torchserve --model-store model-store/cat-classifier/model-store --start --models all --foreground

Docker Installation

curl -fsSL https://get.docker.com -o get-docker.sh
sudo sh get-docker.sh
sudo usermod -aG docker $USER

AWS CLI Setup

curl "https://awscli.amazonaws.com/awscli-exe-linux-x86_64.zip" -o "awscliv2.zip"
unzip awscliv2.zip
sudo ./aws/install

aws s3 cp --recursive model-store s3://tsai-emlo/kserve-ig/

Minikube

curl -LO https://storage.googleapis.com/minikube/releases/latest/minikube-linux-amd64
sudo install minikube-linux-amd64 /usr/local/bin/minikube

minikube start --driver=qemu --memory 40960 --cpus 16

minikube start --driver=docker --memory 12288 --cpus 4

For 5 models you'll need this, each model will take 1 vCPU and atleast 2GiB RAM each

minikube start --driver=docker --memory 28672 --cpus 8 --disk-size 180g

curl -LO "https://dl.k8s.io/release/$(curl -L -s https://dl.k8s.io/release/stable.txt)/bin/linux/amd64/kubectl"
sudo install -o root -g root -m 0755 kubectl /usr/local/bin/kubectl

Exposing MiniKube to EC2 Public IP

minikube tunnel --bind-address 0.0.0.0

KServe Installation

curl -s "https://raw.githubusercontent.com/kserve/kserve/release-0.11/hack/quick_install.sh" | bash

Notes

JAVA Installation

sudo apt install default-jdk

update-alternatives --config java

Put JAVA_HOME="/lib/jvm/java-11-openjdk-amd64" in .bashrc

curl -v -H "Host: ${SERVICE_HOSTNAME}" -H "Content-Type: application/json" "http://${INGRESS_HOST}:${INGRESS_PORT}/v1/models/sklearn-iris:predict" -d @./a.json
curl -v -H "Host: ${SERVICE_HOSTNAME}" -H "Content-Type: application/json" "http://127.0.0.1:${INGRESS_PORT}/v1/models/sklearn-iris:predict" -d @./a.json

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
deployment		deployment
model-store		model-store
models		models
ts_handlers/hf-image-classification		ts_handlers/hf-image-classification
.gitignore		.gitignore
README.md		README.md
create_mar.py		create_mar.py
download_all.py		download_all.py
requirements-torch.txt		requirements-torch.txt
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

deployment

deployment

model-store

model-store

models

models

ts_handlers/hf-image-classification

ts_handlers/hf-image-classification

.gitignore

.gitignore

README.md

README.md

create_mar.py

create_mar.py

download_all.py

download_all.py

requirements-torch.txt

requirements-torch.txt

requirements.txt

requirements.txt

Repository files navigation

KServe Inference Graph

MAR Generation

Test Model in TorchServe

Docker Installation

AWS CLI Setup

Minikube

KServe Installation

Notes

JAVA Installation

About

Languages

satyajitghana/kserve-inference-graph

Folders and files

Latest commit

History

Repository files navigation

KServe Inference Graph

MAR Generation

Test Model in TorchServe

Docker Installation

AWS CLI Setup

Minikube

KServe Installation

Notes

JAVA Installation

About

Topics

Resources

Stars

Watchers

Forks

Languages