Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

feat(transformers): support also text generation #1630

Merged
merged 2 commits into from
Jan 23, 2024
Merged

Conversation

mudler
Copy link
Owner

@mudler mudler commented Jan 23, 2024

Description

Related to #1126. Up to now this backend was only supporting embeddings.

this PR adds text generation support also with transformers AutoModel/AutoTokenizer.

Notes for Reviewers

Signed commits

  • Yes, I signed my commits.

Copy link

netlify bot commented Jan 23, 2024

Deploy Preview for localai canceled.

Name Link
🔨 Latest commit 50bc9a4
🔍 Latest deploy log https://app.netlify.com/sites/localai/deploys/65affe37b59c00000860b0b7

@mudler mudler merged commit 5e335ea into master Jan 23, 2024
24 checks passed
@mudler mudler deleted the transformers_automodel branch January 23, 2024 22:07
@mudler mudler added the enhancement New feature or request label Jan 29, 2024
truecharts-admin added a commit to truecharts/charts that referenced this pull request Jan 29, 2024
….0@b689c91 by renovate (#17756)

This PR contains the following updates:

| Package | Update | Change |
|---|---|---|
| [docker.io/localai/localai](https://togithub.com/mudler/LocalAI) |
minor | `v2.6.1` -> `v2.7.0` |

---

> [!WARNING]
> Some dependencies could not be looked up. Check the Dependency
Dashboard for more information.

---

### Release Notes

<details>
<summary>mudler/LocalAI (docker.io/localai/localai)</summary>

### [`v2.7.0`](https://togithub.com/mudler/LocalAI/releases/tag/v2.7.0)

[Compare
Source](https://togithub.com/mudler/LocalAI/compare/v2.6.1...v2.7.0)

<!-- Release notes generated using configuration in .github/release.yml
at master -->

This release adds support to the transformer backend for LLM as well!

For now instance you can run codellama-7b with transformers with:

docker run -ti -p 8080:8080 --gpus all
localai/localai:v2.7.0-cublas-cuda12 codellama-7b

In the quickstart there are more examples available
https://localai.io/basics/getting_started/#running-models.

Note: As llama.cpp is ongoing with changes that could possible cause
breakage, this release does not includes changes from
[ggerganov/llama.cpp#5138
(the future versions will).

#### What's Changed

##### Bug fixes 🐛

- fix(paths): automatically create paths by
[@&#8203;mudler](https://togithub.com/mudler) in
[mudler/LocalAI#1650

##### Exciting New Features 🎉

- feat(transformers): support also text generation by
[@&#8203;mudler](https://togithub.com/mudler) in
[mudler/LocalAI#1630
- transformers: correctly load automodels by
[@&#8203;mudler](https://togithub.com/mudler) in
[mudler/LocalAI#1643
- feat(startup): fetch model definition remotely by
[@&#8203;mudler](https://togithub.com/mudler) in
[mudler/LocalAI#1654

##### 👒 Dependencies

- ⬆️ Update ggerganov/llama.cpp by
[@&#8203;localai-bot](https://togithub.com/localai-bot) in
[mudler/LocalAI#1642
- ⬆️ Update ggerganov/llama.cpp by
[@&#8203;localai-bot](https://togithub.com/localai-bot) in
[mudler/LocalAI#1644
- ⬆️ Update ggerganov/llama.cpp by
[@&#8203;localai-bot](https://togithub.com/localai-bot) in
[mudler/LocalAI#1652
- ⬆️ Update ggerganov/llama.cpp by
[@&#8203;localai-bot](https://togithub.com/localai-bot) in
[mudler/LocalAI#1655

##### Other Changes

- ⬆️ Update ggerganov/llama.cpp by
[@&#8203;localai-bot](https://togithub.com/localai-bot) in
[mudler/LocalAI#1632
- ⬆️ Update docs version mudler/LocalAI by
[@&#8203;localai-bot](https://togithub.com/localai-bot) in
[mudler/LocalAI#1631

**Full Changelog**:
mudler/LocalAI@v2.6.1...v2.6.2

</details>

---

### Configuration

📅 **Schedule**: Branch creation - "before 10pm on monday" in timezone
Europe/Amsterdam, Automerge - At any time (no schedule defined).

🚦 **Automerge**: Enabled.

♻ **Rebasing**: Whenever PR becomes conflicted, or you tick the
rebase/retry checkbox.

🔕 **Ignore**: Close this PR and you won't be reminded about this update
again.

---

- [ ] <!-- rebase-check -->If you want to rebase/retry this PR, check
this box

---

This PR has been generated by [Renovate
Bot](https://togithub.com/renovatebot/renovate).

<!--renovate-debug:eyJjcmVhdGVkSW5WZXIiOiIzNy4xNTYuNCIsInVwZGF0ZWRJblZlciI6IjM3LjE1Ni40IiwidGFyZ2V0QnJhbmNoIjoibWFzdGVyIn0=-->
GabrielBarzen pushed a commit to GabrielBarzen/charts that referenced this pull request Feb 2, 2024
….0@b689c91 by renovate (truecharts#17756)

This PR contains the following updates:

| Package | Update | Change |
|---|---|---|
| [docker.io/localai/localai](https://togithub.com/mudler/LocalAI) |
minor | `v2.6.1` -> `v2.7.0` |

---

> [!WARNING]
> Some dependencies could not be looked up. Check the Dependency
Dashboard for more information.

---

### Release Notes

<details>
<summary>mudler/LocalAI (docker.io/localai/localai)</summary>

### [`v2.7.0`](https://togithub.com/mudler/LocalAI/releases/tag/v2.7.0)

[Compare
Source](https://togithub.com/mudler/LocalAI/compare/v2.6.1...v2.7.0)

<!-- Release notes generated using configuration in .github/release.yml
at master -->

This release adds support to the transformer backend for LLM as well!

For now instance you can run codellama-7b with transformers with:

docker run -ti -p 8080:8080 --gpus all
localai/localai:v2.7.0-cublas-cuda12 codellama-7b

In the quickstart there are more examples available
https://localai.io/basics/getting_started/#running-models.

Note: As llama.cpp is ongoing with changes that could possible cause
breakage, this release does not includes changes from
[ggerganov/llama.cpp#5138
(the future versions will).

#### What's Changed

##### Bug fixes 🐛

- fix(paths): automatically create paths by
[@&truecharts#8203;mudler](https://togithub.com/mudler) in
[mudler/LocalAI#1650

##### Exciting New Features 🎉

- feat(transformers): support also text generation by
[@&truecharts#8203;mudler](https://togithub.com/mudler) in
[mudler/LocalAI#1630
- transformers: correctly load automodels by
[@&truecharts#8203;mudler](https://togithub.com/mudler) in
[mudler/LocalAI#1643
- feat(startup): fetch model definition remotely by
[@&truecharts#8203;mudler](https://togithub.com/mudler) in
[mudler/LocalAI#1654

##### 👒 Dependencies

- ⬆️ Update ggerganov/llama.cpp by
[@&truecharts#8203;localai-bot](https://togithub.com/localai-bot) in
[mudler/LocalAI#1642
- ⬆️ Update ggerganov/llama.cpp by
[@&truecharts#8203;localai-bot](https://togithub.com/localai-bot) in
[mudler/LocalAI#1644
- ⬆️ Update ggerganov/llama.cpp by
[@&truecharts#8203;localai-bot](https://togithub.com/localai-bot) in
[mudler/LocalAI#1652
- ⬆️ Update ggerganov/llama.cpp by
[@&truecharts#8203;localai-bot](https://togithub.com/localai-bot) in
[mudler/LocalAI#1655

##### Other Changes

- ⬆️ Update ggerganov/llama.cpp by
[@&truecharts#8203;localai-bot](https://togithub.com/localai-bot) in
[mudler/LocalAI#1632
- ⬆️ Update docs version mudler/LocalAI by
[@&truecharts#8203;localai-bot](https://togithub.com/localai-bot) in
[mudler/LocalAI#1631

**Full Changelog**:
mudler/LocalAI@v2.6.1...v2.6.2

</details>

---

### Configuration

📅 **Schedule**: Branch creation - "before 10pm on monday" in timezone
Europe/Amsterdam, Automerge - At any time (no schedule defined).

🚦 **Automerge**: Enabled.

♻ **Rebasing**: Whenever PR becomes conflicted, or you tick the
rebase/retry checkbox.

🔕 **Ignore**: Close this PR and you won't be reminded about this update
again.

---

- [ ] <!-- rebase-check -->If you want to rebase/retry this PR, check
this box

---

This PR has been generated by [Renovate
Bot](https://togithub.com/renovatebot/renovate).

<!--renovate-debug:eyJjcmVhdGVkSW5WZXIiOiIzNy4xNTYuNCIsInVwZGF0ZWRJblZlciI6IjM3LjE1Ni40IiwidGFyZ2V0QnJhbmNoIjoibWFzdGVyIn0=-->
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

1 participant