Skip to content

Latest commit

 

History

History
188 lines (188 loc) · 3.28 KB

LLMs.md

File metadata and controls

188 lines (188 loc) · 3.28 KB

LLMs 大型语言模型

  • 大模型(大型语言模型,LLMs)是当下AI和NLP研究与产业中最重要的方向之一。
  • 模型一览
Model 作者l Size l 类型 l 开源l
LLaMa Meta AI 7B-65B Decoder open
OPT Meta AI 125M-175B Decoder open
T5 Google 220M-11B Encoder-Decoder open
mT5 Google 235M-13B Encoder-Decoder open
UL2 Google 20B Encoder-Decoder open
PaLM Google 540B Decoder no
LaMDA Google 2B-137B Decoder no
FLAN-T5 Google 同T5 Encoder-Decoder open
FLAN-UL2 Google 同U2 Encoder-Decoder open
FLAN-PaLM Google 同PaLM Decoder no
FLAN Google 同LaMDA Decoder no
BLOOM BigScience 176B Decoder open
T0 BigScience 3B Decoder open
BLOOMZ BigScience 同BLOOM Decoder open
mT0 BigScience 同T0 Decoder open
GPT-Neo EleutherAI 125M-2.7B Decoder open
GPT-NeoX EleutherAI 20B Decoder open
GPT3 OpenAI 175B (davinci) Decoder no
GPT4 OpenAI unknown OpenAI no
InstructGPT OpenAI 1.3B Decoder no
Alpaca Stanford 同LLaMa Decoder open
- 相关仓库 - [github.com/ggerganov/whisper.cpp](https://github.com/ggerganov/whisper.cpp) Port of OpenAI's Whisper model in C/C++ - [github.com/openai/whisper](https://github.com/openai/whisper) Robust Speech Recognition via Large-Scale Weak Supervision - [github.com/microsoft/visual-chatgpt](https://github.com/microsoft/visual-chatgpt) Official repo for the paper: Visual ChatGPT: Talking, Drawing and Editing with Visual Foundation Models - [github.com/hwchase17/langchain](https://github.com/hwchase17/langchain) Building applications with LLMs through composability - [github.com/amazon-science/alexa-teacher-models](https://github.com/amazon-science/alexa-teacher-models) Alexa Teacher Models - [github.com/ggerganov/llama.cpp](https://github.com/ggerganov/llama.cpp) Port of Facebook's LLaMA model in C/C++ - [github.com/microsoft/JARVIS](https://github.com/microsoft/JARVIS) JARVIS, a system to connect LLMs with ML community.