#

llm-serving

Here are 52 public repositories matching this topic...

valyu-network / LLM-Manager

∇ Valyu LLManager simplifies and scales LLM application deployment, reducing infrastructure complexity and costs.

llm-serving llm-inference llm-framework llmstack

Updated May 17, 2024
Python

friendliai / lm-evaluation-harness

A framework for few-shot evaluation of autoregressive language models.

llms generative-ai llm-serving llm-inference

Updated Sep 30, 2023
Python

InquestGeronimo / horizon-takeoff

Automating the deployment of the Takeoff Server on AWS for LLMs

aws machine-learning cloud deep-learning ec2 server artificial-intelligence llmops llm-serving llm-inference

Updated Jan 16, 2024
Python

liux2 / DL_env_Setups

Deep learning environment setups

docker deep-learning environments llm-serving

Updated Jan 24, 2024
Shell

okikorg / okik

Okik is a command-line interface (CLI) tool for LLM, RAG and model serving.

llm llm-serving llm-inference

Updated May 22, 2024
Python

george-mountain / LLM-Local-Streaming

Streaming of LLM responses in realtime using Fastapi and Streamlit.

ai fastapi streamlit llm llm-serving llm-streaming

Updated Jan 21, 2024
Python

george-mountain / web-app-builder--LLM

Building Static Web Applications using Large Language Model. From hand sketched documents, images and screenshots to proper web pages.

ai pypi pypi-package streamlit llm llm-serving

Updated Mar 12, 2024
Python

ivynya / illm

internet llm - access your ollama (or any other local llm) instance from across the internet

llm-serving ollama ollama-interface

Updated Dec 31, 2023
Go

suleymansevimli / run-llm-model-locally

You can run any large language model on your local machine with this repository.

python git-lfs huggingface llm llm-serving

Updated Dec 19, 2023
Python

Stosan / commentator

generative-ai llm-serving

Updated Jul 5, 2023
Python

fork123aniket / LLM-RAG-powered-QA-App

A Production-Ready, Scalable RAG-powered LLM-based Context-Aware QA App

question-answering ray fine-tuning context-aware-system large-language-models ray-serve llmops llm-serving eleutherai llm-training llm-inference retrieval-augmented-generation parameter-efficient-fine-tuning

Updated Jan 8, 2024
Python

biosfood / intel-llm-guide

A guide on how to run LLMs on intel CPUs

setup machine-learning tutorial guide intel setup-development-environment llm llm-serving llm-inference

Updated Jan 23, 2024
Python

CentML / llm-inference-bench

Lightweight and extensible LLM Inference serving benchmark tool written in Rust.

benchmarking llm-serving llm-inference

Updated Apr 4, 2024
Rust

awesome-software / EasyEdit

An Easy-to-use Knowledge Editing Framework for LLMs.

Updated Aug 24, 2023
Jupyter Notebook

LoopGlitch26 / Hinglish-AI-Mentor

Hinglish Chatbot powered by Azure Cognitive Services, Google Translate and Open AI

google azure nlp-machine-learning prompt-engineering llm-serving

Updated Jul 11, 2023
Jupyter Notebook

ray-project / llm-application

nlp scalable-machine-learning ray-distributed anyscale llm llm-serving

Updated Jun 14, 2023
Jupyter Notebook

IvanLuLyf / bunny-llm

Deno LLM API Service

llm chatgpt chatgpt-api llm-serving cloudflare-workers-ai

Updated May 15, 2024
TypeScript

mddunlap924 / LLM-Inference-Serving

This repository demonstrates LLM execution on CPUs using packages like llamafile, emphasizing low-latency, high-throughput, and cost-effective benefits for inference and serving.

deepspeed large-language-models llms llm-serving llamacpp vllm llm-inference llamafile

Updated Dec 4, 2023
Jupyter Notebook

Awesome-LLMs-ICLR-24

azminewasi / Awesome-LLMs-ICLR-24

It is a comprehensive resource hub compiling all LLM papers accepted at the International Conference on Learning Representations (ICLR) in 2024.

pretrained-models pretrained-weights pretrained-language-model large-language-models llm llms llmops large-language-model llm-serving llm-prompting llm-agent llm-security llm-training llm-inference llm-framework llm-privacy llm-evaluation large-language-models-for-graph-learning large-language-models-and-translation-systems

Updated Apr 4, 2024

ein-llm

ehsanghaffar / ein-llm

A self-hosted personal chatbot API with FastAPI. It allows you to interact with the Llama2 LLM (and other open-source LLMs) to have natural language conversations, generate text, and perform various language-related tasks.

langchain llm-serving llamacpp llama2

Updated Feb 18, 2024
Jupyter Notebook

Improve this page

Add a description, image, and links to the llm-serving topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the llm-serving topic, visit your repo's landing page and select "manage topics."