gptq

Star

Here are 18 public repositories matching this topic...

SujanNeupane42 / LLM_Quantization

Star

Quantizing LLMs using GPTQ

nlp machine-learning quantization huggingface llms gptq

Updated Dec 31, 2023
Jupyter Notebook

ElDokmak / LLMs-variety

Star

Hands on some LLMs

openai llama mamba mistral groq huggingface-transformers llm langchain llama-index gptq mixtral

Updated May 17, 2024
Jupyter Notebook

SJD1882 / LLMCheatSheet

Star

Personal GitHub repository for stashing resources on Large Language Models (LLM), including Jupyter Notebooks on open source LLMs, use-cases with Langchain and R&D paper review.

python deep-learning literature-review colab-notebook large-language-models llamacpp gptq

Updated Jun 20, 2023
Jupyter Notebook

SujanNeupane42 / NEPSE-Chatbot-Using-Retrieval-augmented-generation-and-reranking

Star

This project will develop a NEPSE chatbot using an open-source LLM, incorporating sentence transformers, vector database and reranking.

python flask faiss reranking-mechanism vector-database sentence-transformers llm langchain gptq retrieval-augmented-generation

Updated Dec 31, 2023
Jupyter Notebook

seyf1elislam / LocalLLM_OneClick_Colab

Star

Run gguf LLM models in Latest Version TextGen-webui

python colab-notebook llm llms gptq localllm exllama gguf localllama

Updated Jun 3, 2024
Jupyter Notebook

Aqirito / A.L.I.C.E

Star

A.L.I.C.E (Artificial Labile Intelligence Cybernated Existence). A REST API of A.I companion for creating more complex system

text-to-speech anime rest-api text-generation artificial-intelligence tts waifu otaku pygmalion fastapi huggingface-transformers genshin-impact vits llm llms langchain gptq langchain-python exllama

Updated Dec 3, 2023
Python

BobaZooba / shurale

Star

Conversation AI model for open domain dialogs

Updated Nov 15, 2023
Python

This repository is for profiling, extracting, visualizing and reusing generative AI weights to hopefully build more accurate AI models and audit/scan weights at rest to identify knowledge domains for risk(s).

ai deep-learning blender tiff transformers weights image-to-image blender-python llm stable-diffusion foundational-models generative-ai safetensors blip2 gptq

Updated Dec 18, 2023
Python

tripathiarpan20 / self-improvement-4all

Star

Private self-improvement coaching with open-source LLMs

python transformers faiss langchain text-generation-webui gptq

Updated Mar 7, 2024
Python

chinoll / chatsakura

Star

ChatSakura：Open-source multilingual conversational model.（开源多语言对话大模型）

bloom transformers pytorch gradio llm chatgpt bloomz instruct-gpt gptq

Updated Apr 2, 2023
Python

ziwang-com / zero-lora

Star

zero零训练llm调参

llama gpt lora llm gptq

Updated Jul 20, 2023

abhinand5 / gptq_for_langchain

Star

A guide about how to use GPTQ models with langchain

ai gpt quantization language-model llm langchain gptq wizardlm

Updated Aug 19, 2023
Jupyter Notebook

intel / auto-round

Star

SOTA Weight-only Quantization Algorithm for LLMs. This is official implementation of "Optimize Weight Rounding via Signed Gradient Descent for the Quantization of LLMs"

rounding quantization awq int4 gptq neural-compressor weight-only