A high-throughput and memory-efficient inference and serving engine for LLMs
-
Updated
May 15, 2024 - Python
A high-throughput and memory-efficient inference and serving engine for LLMs
An orchestration platform for the development, production, and observation of data assets.
An awesome & curated list of best LLMOps tools for developers
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
This repo automates training of a Deep-AR algorithm using a Sagemaker pipeline.
Build applications that make decisions (chatbots, agents, simulations, etc...). Monitor, persist, and execute on your own infrastructure.
🔥 A tool for visualizing and tracking your machine learning experiments. This repo contains the CLI and Python API.
Production Grade Nifi & Nifi Registry. Deploy for VM (Virtual Machine) with Terraform + Ansible, Helm & Helmfile for Kubernetes (EKS)
Turns Data and AI algorithms into production-ready web applications in no time.
Scalable and flexible workflow orchestration platform that seamlessly unifies data, ML and analytics stacks.
Extensible Python SDK for developing Flyte tasks and workflows. Simple to get started and learn and highly extensible.
Serve, optimize and scale PyTorch models in production
The most flexible way to serve AI/ML models in production - Build Model Inference Service, LLM APIs, Inference Graph/Pipelines, Compound AI systems, Multi-Modal, RAG as a Service, and more!
Hamilton helps data scientists and engineers define testable, modular, self-documenting dataflows, that encode lineage and metadata. Runs and scales everywhere python does.
Sample code and notebooks for Vertex AI, the end-to-end machine learning platform on Google Cloud
PostgreSQL vector database extension for building AI applications
🦋 A personal research and development (R&D) lab that facilitates the sharing of knowledge.
AI Observability & Evaluation
An open-source data logging library for machine learning models and data pipelines. 📚 Provides visibility into data quality & model performance over time. 🛡️ Supports privacy-preserving data collection, ensuring safety & robustness. 📈
Weaviate is an open-source vector database that stores both objects and vectors, allowing for the combination of vector search with structured filtering with the fault tolerance and scalability of a cloud-native database.
Add a description, image, and links to the mlops topic page so that developers can more easily learn about it.
To associate your repository with the mlops topic, visit your repo's landing page and select "manage topics."