[EPIC] Model support dashboard (v2) #1126

mudler · 2023-10-02T10:25:15Z

This epic is a major tracker for all the backends additions that should be part of LocalAI v2 and ongoing efforts.

The objective is to release a v2 which deprecates old models which are now superseded, plus adding a new set. In order to achieve this my idea is to clean up the current state and start pinning dependencies for all the backends which requires specific environment settings (python-based ones).

Some key points:

The go-llama backend will be sunset by the c++ version which is directly tied and easier to pinpoint to llama.cpp versions
we will deprecate some (if not all) the duplicated backends arch (like falcon,gpt-2, gpt-neo-x, etc..) which are now supported by llama.cpp
the python backends will use conda envionments to pin specific versions
having a base point for backends for several languages (rust, c++, python, golang, ...)
re-order internal layout to accomodate more backends as needed

Some backends will be deprecated as superseded, as such some repositories will be archived (TBD yet).

Backends:

After conda

We need to still test on master:

Some rough first-steps required :

docs: add documentation on how to write LocalAI backends #1057
[backend] llama-cpp, C++ gRPC backend #1154
feat(conda): conda environments #1144 (conda environments) (enhancement: use separate conda env for extra grpc backends #1117)
ci: GPU tests #1116 (GPU testing)
llama-cpp: feat parity with go-llama #1255 ( Pending items from [backend] llama-cpp, C++ gRPC backend #1154 )

Re-layout

After the re-layout we can add new backends listed above without any clashes and:

sunset the current llama in favor of llama-cpp
- llama-cpp: feat parity with go-llama #1255
refactor: rename llama-stable to llama-ggml #1287

The text was updated successfully, but these errors were encountered:

localai-bot · 2023-10-02T12:02:39Z

⚠️⚠️⚠️⚠️⚠️

Hi! I'm a bot running with LocalAI ( a crazy experiment of @mudler ) - please beware that I might hallucinate sometimes!

but.... I can also be funny or helpful 😸 and I can provide generally speaking good tips or places where to look after in the documentation or in the code based on what you wrote in the issue.

Don't engage in conversation with me,
I don't support (yet) replying!

⚠️⚠️⚠️⚠️⚠️

Issue description:
The issue is related to the LocalAI project and its implementation of the Image endpoint. The user wants to create a cute baby sea otter image with the given prompt, size, and number of images. However, the code provided in the readInput function is not able to parse the input correctly and returns an error.

Proposed solution:
The issue seems to be related to the parsing of the input parameters. The user can try modifying the code in the readInput function to handle the specific format of the prompt, size, and number of images provided by the OpenAI API. Alternatively, they can try using a different API that provides more flexibility in the input format or use a different tool to generate the image.

If the issue persists, the user can also check if there are any updates available for the LocalAI project or report the issue on the GitHub repository to get help from the community.

Sources:

Aisuko · 2023-10-14T10:11:40Z

I am thinking here, Should we need to have a outline likes below:

C++
- llama.cpp series LLM, like: Qwen, chatglm)
Python
- [ ]Huggingface
- autoGPT
Rust
- rustformers
- candle

If we keep this in mind, we can split the model to the specific backend. For example, I saw the Qwen LLM mentioned their C++ implementation of it "working in the same way as llama.cpp". So, maybe it can be loaded by using llama.cpp. It should be compatible with C++ our backend.

I also suggest to add some labels are related to the series of the backends. We can know a new model can be compatible by our backends.

mudler · 2023-11-04T14:35:07Z

conda branch was merged in #1144 . I'm looking now into make the llama.cpp backend on par with llama-go and also add llava support to it.

I'm going to refactor and re-layout things in the new backend directory too

Aisuko · 2023-11-05T02:34:31Z

@mudler thank you for mentioning. Here are some questions(second, and third one) may need you help #1180 (comment). Here what I am thinking is that we can use a tiny model to test the Rust backend features. And make sure everything ok. Maybe we can merge it.

And if everything ok. We can add other LLMs. I have plan to support Llama2(60% finished but it still has an issue), whisper, and also support onnex format.

mudler · 2023-11-13T15:14:18Z

Breaking re-layout PR: #1279

kno10 · 2023-11-27T15:24:05Z

caching/preloading of transformer and similar models, these are currently automatically loaded on startup into /root/.cache/huggingface/. It seems to be enough to set TRANSFORMERS_CACHE in the environment to the models folder, so maybe this only requires a documentation addition.

muka · 2023-12-27T04:23:20Z

I may start looking into #1273 while this progresses. What do you think ?

mudler · 2023-12-27T07:36:10Z

I may start looking into #1273 while this progresses. What do you think ?

please feel free to go ahead, there are many pieces involved in here, any help is more than appreciated 👍

mudler · 2024-03-04T17:48:03Z

caching/preloading of transformer and similar models, these are currently automatically loaded on startup into /root/.cache/huggingface/. It seems to be enough to set TRANSFORMERS_CACHE in the environment to the models folder, so maybe this only requires a documentation addition.

in #1746 I'm taking care of automatically binding the HF cache variables to the models directories if not set already

mudler added the enhancement New feature or request label Oct 2, 2023

mudler self-assigned this Oct 2, 2023

mudler changed the title ~~[EPIC Backends to add~~ [EPIC] Backends to add Oct 2, 2023

mudler added roadmap new-backend labels Oct 2, 2023

Aisuko self-assigned this Oct 2, 2023

mudler changed the title ~~[EPIC] Backends to add~~ [EPIC] Backends to add (v2) Oct 2, 2023

mudler changed the title ~~[EPIC] Backends to add (v2)~~ [EPIC] Backends for v2 Oct 17, 2023

mudler mentioned this issue Oct 25, 2023

cleanup: drop bloomz and ggllm as now supported by llama.cpp #1217

Merged

Aisuko pinned this issue Nov 5, 2023

mudler mentioned this issue Nov 11, 2023

feat: Assistant API #1273

Open

This was referenced Nov 13, 2023

refactor: move backends into the backends directory #1279

Merged

refactor: rename llama-stable to llama-ggml #1287

Merged

huggingface: use AutoModel instead of SentenceTransformer #1250

Closed

mudler added the high prio label Dec 1, 2023

mudler mentioned this issue Dec 4, 2023

fix(vall-e-x): correctly install reqs in environment #1377

Merged

mudler mentioned this issue Jan 23, 2024

feat(transformers): support also text generation #1630

Merged

1 task

mudler changed the title ~~[EPIC] Backends for v2~~ [EPIC] Model support dashboard (v2) Jan 24, 2024

This was referenced Jan 26, 2024

Mac os native build not working #1560

Open

Deprecation of old backends #1651

Closed

backend(rwkv): python rwkv backend #1658

Open

mudler mentioned this issue Apr 13, 2024

feat(parler-tts): Add new backend #2027

Merged

This was referenced May 13, 2024

openvoice v1 and v2 now MIT licensed, let local ai support them #2300

Closed

feat(backends): add openvoice backend #2334

Merged

mudler unassigned Aisuko May 19, 2024

mudler added the up for grabs Tickets that no-one is currently working on label May 19, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[EPIC] Model support dashboard (v2) #1126

[EPIC] Model support dashboard (v2) #1126

mudler commented Oct 2, 2023 •

edited

localai-bot commented Oct 2, 2023

Aisuko commented Oct 14, 2023 •

edited

mudler commented Nov 4, 2023

Aisuko commented Nov 5, 2023 •

edited

mudler commented Nov 13, 2023

kno10 commented Nov 27, 2023 •

edited

muka commented Dec 27, 2023

mudler commented Dec 27, 2023

mudler commented Mar 4, 2024

[EPIC] Model support dashboard (v2) #1126

[EPIC] Model support dashboard (v2) #1126

Comments

mudler commented Oct 2, 2023 • edited

Backends:

After conda

Some rough first-steps required :

Re-layout

After the re-layout we can add new backends listed above without any clashes and:

localai-bot commented Oct 2, 2023

⚠️⚠️⚠️⚠️⚠️

⚠️⚠️⚠️⚠️⚠️

Aisuko commented Oct 14, 2023 • edited

mudler commented Nov 4, 2023

Aisuko commented Nov 5, 2023 • edited

mudler commented Nov 13, 2023

kno10 commented Nov 27, 2023 • edited

muka commented Dec 27, 2023

mudler commented Dec 27, 2023

mudler commented Mar 4, 2024

mudler commented Oct 2, 2023 •

edited

Aisuko commented Oct 14, 2023 •

edited

Aisuko commented Nov 5, 2023 •

edited

kno10 commented Nov 27, 2023 •

edited