LLMs 大型语言模型

Model	作者l	Size l	类型 l	开源l
LLaMa	Meta AI	7B-65B	Decoder	open
OPT	Meta AI	125M-175B	Decoder	open
T5	Google	220M-11B	Encoder-Decoder	open
mT5	Google	235M-13B	Encoder-Decoder	open
UL2	Google	20B	Encoder-Decoder	open
PaLM	Google	540B	Decoder	no
LaMDA	Google	2B-137B	Decoder	no
FLAN-T5	Google	同T5	Encoder-Decoder	open
FLAN-UL2	Google	同U2	Encoder-Decoder	open
FLAN-PaLM	Google	同PaLM	Decoder	no
FLAN	Google	同LaMDA	Decoder	no
BLOOM	BigScience	176B	Decoder	open
T0	BigScience	3B	Decoder	open
BLOOMZ	BigScience	同BLOOM	Decoder	open
mT0	BigScience	同T0	Decoder	open
GPT-Neo	EleutherAI	125M-2.7B	Decoder	open
GPT-NeoX	EleutherAI	20B	Decoder	open
GPT3	OpenAI	175B (davinci)	Decoder	no
GPT4	OpenAI	unknown	OpenAI	no
InstructGPT	OpenAI	1.3B	Decoder	no
Alpaca	Stanford	同LLaMa	Decoder	open

- 相关仓库 - [github.com/ggerganov/whisper.cpp](https://github.com/ggerganov/whisper.cpp) Port of OpenAI's Whisper model in C/C++ - [github.com/openai/whisper](https://github.com/openai/whisper) Robust Speech Recognition via Large-Scale Weak Supervision - [github.com/microsoft/visual-chatgpt](https://github.com/microsoft/visual-chatgpt) Official repo for the paper: Visual ChatGPT: Talking, Drawing and Editing with Visual Foundation Models - [github.com/hwchase17/langchain](https://github.com/hwchase17/langchain) Building applications with LLMs through composability - [github.com/amazon-science/alexa-teacher-models](https://github.com/amazon-science/alexa-teacher-models) Alexa Teacher Models - [github.com/ggerganov/llama.cpp](https://github.com/ggerganov/llama.cpp) Port of Facebook's LLaMA model in C/C++ - [github.com/microsoft/JARVIS](https://github.com/microsoft/JARVIS) JARVIS, a system to connect LLMs with ML community.

Provide feedback