Runbooks - Finetune LLMs on K8s with notebooks

NOTICE: Changes coming! runbooks.git (previously substratus.git) will be refactored to focus on Notebooks on K8s.

🎵 Fine-tune LLM models with no/low code
📔 Provide a Colab style seamless Notebook experience
☁️ Provide a unified ML platform across clouds
⬆️ Easy to install with minimal dependencies

Looking for serving?
🚀 substratusai/lingo: Serve popular OSS LLM models in minutes on CPUs or GPUs

Support the project by adding a star on GitHub! ❤️

Quickstart

Create a local Kubernetes cluster using Kind.

kind create cluster --name substratus --config - <<EOF
apiVersion: kind.x-k8s.io/v1alpha4
kind: Cluster
nodes:
- role: control-plane
  extraPortMappings:
  - containerPort: 30080
    hostPort: 30080
EOF

Install Substratus.

kubectl apply -f https://raw.githubusercontent.com/substratusai/substratus/main/install/kind/manifests.yaml

Import a small Open Source LLM.

kubectl apply -f https://raw.githubusercontent.com/substratusai/substratus/main/examples/facebook-opt-125m/base-model.yaml

apiVersion: substratus.ai/v1
kind: Model
metadata:
  namespace: default
  name: facebook-opt-125m
spec:
  image: substratusai/model-loader-huggingface
  params:
    name: facebook/opt-125m

Serve the LLM.

kubectl apply -f https://raw.githubusercontent.com/substratusai/substratus/main/examples/facebook-opt-125m/base-server.yaml

apiVersion: substratus.ai/v1
kind: Server
metadata:
  name: facebook-opt-125m
spec:
  image: substratusai/model-server-basaran
  model:
    name: facebook-opt-125m

Checkout the progress of the Model and the Server.

kubectl get ai

When they report a Ready status, start a port-forward.

kubectl port-forward service/facebook-opt-125m-server 8080:8080

Open your browser to http://localhost:8080/ or curl the LLM's API.

PS: Because of the small size of this particular LLM, expect comically bad answers to your prompts.

curl http://localhost:8080/v1/completions \
  -H "Content-Type: application/json" \
  -d '{ \
    "model": "facebook-opt-125m", \
    "prompt": "Who was the first president of the United States? ", \
    "max_tokens": 10\
  }'

Delete the local cluster.

kind delete cluster --name substratus

If you want to try out a more capable LLM, running on substantial hardware, try Kind with GPU support, or try deploying Substratus in GKE.

Docs

Creators

Feel free to contact any of us:

Name		Name	Last commit message	Last commit date
Latest commit History 136 Commits
.github/workflows		.github/workflows
api/v1		api/v1
assets		assets
cmd		cmd
config		config
containertools/cmd/nbwatch		containertools/cmd/nbwatch
docs		docs
examples		examples
hack		hack
install		install
internal		internal
test		test
.dockerignore		.dockerignore
.gitignore		.gitignore
.goreleaser.yaml		.goreleaser.yaml
Dockerfile		Dockerfile
Dockerfile.sci-gcp		Dockerfile.sci-gcp
Dockerfile.sci-kind		Dockerfile.sci-kind
LICENSE.md		LICENSE.md
Makefile		Makefile
PROJECT		PROJECT
go.mod		go.mod
go.sum		go.sum
skaffold.gcp.yaml		skaffold.gcp.yaml
skaffold.kind.yaml		skaffold.kind.yaml

License

substratusai/runbooks

Folders and files

Latest commit

History

Repository files navigation

Runbooks - Finetune LLMs on K8s with notebooks

Quickstart

Docs

Creators

About

Topics

Resources

License

Stars

Watchers

Forks

Languages