Integrate mistral.rs LLM #13105

EricLBuehler · 2024-04-25T12:15:12Z

Description

In this PR, I have added support for the mistral.rs LLM inference platform via a new integration. mistral.rs is a new LLM inference platform with key features such as prefix caching, optimized X-LoRA support, LoRA support via weight merging and grammar support.

New Package?

Did I fill in the tool.llamahub section in the pyproject.toml and provide a detailed README.md for my new integration or package?

Yes
No

Version Bump?

Did I bump the version in the pyproject.toml file of the package I am updating? (Except for the llama-index-core package)

Yes
No

Type of Change

Please delete options that are not relevant.

New feature (non-breaking change which adds functionality)

How Has This Been Tested?

Please describe the tests that you ran to verify your changes. Provide instructions so we can reproduce. Please also list any relevant details for your test configuration

Added new unit/integration tests
Added new notebook (that tests end-to-end)
I stared at the code and made sure it makes sense

Suggested Checklist:

I have performed a self-review of my own code
I have commented my code, particularly in hard-to-understand areas
I have made corresponding changes to the documentation
I have added Google Colab support for the newly added notebooks.
My changes generate no new warnings
I have added tests that prove my fix is effective or that my feature works
New and existing unit tests pass locally with my changes
I ran make format; make lint to appease the lint gods

review-notebook-app · 2024-04-25T12:15:18Z

Check out this pull request on

See visual diffs & provide feedback on Jupyter Notebooks.

Powered by ReviewNB

nerdai

Awesome to see rust-based LLM inference libraries!

Took a look, and it seems there is an excessive amount of lint/fmt type of changes in this PR. Not sure they're entirely needed as in that our own static checks should pass with out these.

EricLBuehler · 2024-04-25T14:47:54Z

Thank you! I've removed formatting the whole codebase and just formatted my new changes.

nerdai · 2024-04-25T15:11:42Z

Thank you! I've removed formatting the whole codebase and just formatted my new changes.

thanks!

nerdai

Left a few comments in my first pass :)

llama-index-integrations/llms/llama-index-llms-mistral-rs/pyproject.toml

llama-index-integrations/llms/llama-index-llms-mistral-rs/llama_index/llms/mistral_rs/base.py

nerdai · 2024-04-25T15:20:41Z

@EricLBuehler do you mind adding me to your fork? Looks like we need to do some pants related stuff (i.e. run pants tailor :: in the root of your project)

EricLBuehler · 2024-04-25T15:24:15Z

@nerdai, I addressed your comments and have added you to the repo.

llama-index-integrations/llms/llama-index-llms-mistral-rs/README.md

llama-index-integrations/llms/llama-index-llms-mistral-rs/tests/test_llms_mistral-rs.py

nerdai · 2024-04-26T00:35:55Z

@EricLBuehler looks like we're running into error still:

E   SyntaxError: keyword argument repeated: logprobs

EricLBuehler · 2024-04-26T00:41:18Z

@nerdai, sorry for that mistake. It should be fixed now.

nerdai · 2024-04-26T00:45:55Z

@nerdai, sorry for that mistake. It should be fixed now.

All good -- thanks for the quick fix!

EricLBuehler · 2024-04-26T01:28:04Z

It seems like the CI tests are failing because this integration depends on the mistralrs library. Would it be best if I update the CI to install and build mistralrs?

nerdai · 2024-04-26T02:47:19Z

It seems like the CI tests are failing because this integration depends on the mistralrs library. Would it be best if I update the CI to install and build mistralrs?

yes, please ensure all the required deps are listed in the pyproject.toml. Best to run:

poetry add mistralrs

have you published mistralrs to pypi yet?

EricLBuehler · 2024-04-27T10:21:41Z

@nerdai, I just released mistralrs on pypi. However, it requires a Rust toolchain to build. Can we update the CI to install the Rust toolchain?

nerdai · 2024-04-27T12:33:21Z

@nerdai, I just released mistralrs on pypi. However, it requires a Rust toolchain to build. Can we update the CI to install the Rust toolchain?

Ah okay. Can it work with just he standard rust installation?

EricLBuehler · 2024-04-27T14:18:08Z

@nerdai, yes, it can. It depends on openssl though.

nerdai · 2024-04-28T02:07:02Z

@nerdai, yes, it can. It depends on openssl though.

Sorry @EricLBuehler not sure if I'm following. To my knowledge, Rust Toolchain is installed in our github runners by default (source).

Can we not just do a poetry add mistralrs so that it gets added as a dep in pyproject.toml?

EricLBuehler · 2024-04-28T02:38:12Z

@nerdai, thanks for clarifying! I have added it as a dependency now.

EricLBuehler · 2024-05-02T01:46:20Z

Hi @nerdai! I have updated this PR to use our latest PyPi release. Additionally, I made sure the tests pass by running the following commands:

make format
make lint
pants tailor --check ::
poetry run make -s test

I think that the CI tests should pass now.

nerdai · 2024-05-02T04:49:07Z

@EricLBuehler we're getting close! Looks like tests are still failing. From the traceback captured in the logs, i see this:

    ) -> list[dict[str, str]]:
E   TypeError: 'type' object is not subscriptable

Maybe we need to do the following:

change list to List in all such occurrences in base.py
change dict to Dict in all such occurrences in base.py

where List and Dict are both from typing module.

EricLBuehler · 2024-05-02T12:28:22Z

@nerdai, that should be fixed now! Not sure why the tests I ran locally didn't catch that though.

nerdai · 2024-05-02T14:26:21Z

@nerdai, that should be fixed now! Not sure why the tests I ran locally didn't catch that though.

Happens to me too sometimes. I think there's a mismatch between versions perhaps on python or other testing/formatting dependencies. 🤔

nerdai · 2024-05-02T14:52:07Z

@EricLBuehler time to 🛳️!

Thanks for this :)

docs/BUILD

EricLBuehler · 2024-05-02T15:22:07Z

@nerdai thank you!

dosubot bot added the size:L This PR changes 100-499 lines, ignoring generated files. label Apr 25, 2024

nerdai reviewed Apr 25, 2024

View reviewed changes

EricLBuehler force-pushed the main branch from d65d218 to b6672dd Compare April 25, 2024 14:47

EricLBuehler force-pushed the main branch from 6ea135e to b6672dd Compare April 25, 2024 14:58

Integrate

09f6c90

EricLBuehler force-pushed the main branch from b6672dd to 09f6c90 Compare April 25, 2024 15:03

dosubot bot added size:XL This PR changes 500-999 lines, ignoring generated files. and removed size:L This PR changes 100-499 lines, ignoring generated files. labels Apr 25, 2024

nerdai reviewed Apr 25, 2024

View reviewed changes

Changes based on comments

2ab16a6

EricLBuehler added 2 commits April 25, 2024 12:05

Run pants tailor

27d381c

Properly extract and pass logprobs

0be07b3

nerdai reviewed Apr 25, 2024

View reviewed changes

llama-index-integrations/llms/llama-index-llms-mistral-rs/README.md Outdated Show resolved Hide resolved

nerdai reviewed Apr 25, 2024

View reviewed changes

llama-index-integrations/llms/llama-index-llms-mistral-rs/tests/test_llms_mistral-rs.py Outdated Show resolved Hide resolved

EricLBuehler added 2 commits April 25, 2024 13:37

Add a simple test

b5e6bbb

Add a usage section

90ee389

Fix silly mistake

3d4daa0

Add mistralrs as a dependancy

9202ef6

EricLBuehler added 4 commits April 29, 2024 21:02

Fix extract logprobs and update api

683ee97

Update for new version

0cf7a91

Update version

96c204b

Prettier

6c4df6b

Fix typing

0f46de1

nerdai approved these changes May 2, 2024

View reviewed changes

dosubot bot added the lgtm This PR has been approved by a maintainer label May 2, 2024

nerdai reviewed May 2, 2024

View reviewed changes

docs/BUILD Outdated Show resolved Hide resolved

Remove unnecessary in docs BUILD

52c57db

nerdai merged commit 772a575 into run-llama:main May 2, 2024
8 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Integrate mistral.rs LLM #13105

Integrate mistral.rs LLM #13105

EricLBuehler commented Apr 25, 2024 •

edited

review-notebook-app bot commented Apr 25, 2024

nerdai left a comment

EricLBuehler commented Apr 25, 2024

nerdai commented Apr 25, 2024

nerdai left a comment

nerdai commented Apr 25, 2024

EricLBuehler commented Apr 25, 2024

nerdai commented Apr 26, 2024

EricLBuehler commented Apr 26, 2024

nerdai commented Apr 26, 2024

EricLBuehler commented Apr 26, 2024

nerdai commented Apr 26, 2024 •

edited

EricLBuehler commented Apr 27, 2024

nerdai commented Apr 27, 2024

EricLBuehler commented Apr 27, 2024 •

edited

nerdai commented Apr 28, 2024

EricLBuehler commented Apr 28, 2024

EricLBuehler commented May 2, 2024 •

edited

nerdai commented May 2, 2024

EricLBuehler commented May 2, 2024

nerdai commented May 2, 2024

nerdai commented May 2, 2024

EricLBuehler commented May 2, 2024

Integrate mistral.rs LLM #13105

Integrate mistral.rs LLM #13105

Conversation

EricLBuehler commented Apr 25, 2024 • edited

Description

New Package?

Version Bump?

Type of Change

How Has This Been Tested?

Suggested Checklist:

review-notebook-app bot commented Apr 25, 2024

nerdai left a comment

Choose a reason for hiding this comment

EricLBuehler commented Apr 25, 2024

nerdai commented Apr 25, 2024

nerdai left a comment

Choose a reason for hiding this comment

nerdai commented Apr 25, 2024

EricLBuehler commented Apr 25, 2024

nerdai commented Apr 26, 2024

EricLBuehler commented Apr 26, 2024

nerdai commented Apr 26, 2024

EricLBuehler commented Apr 26, 2024

nerdai commented Apr 26, 2024 • edited

EricLBuehler commented Apr 27, 2024

nerdai commented Apr 27, 2024

EricLBuehler commented Apr 27, 2024 • edited

nerdai commented Apr 28, 2024

EricLBuehler commented Apr 28, 2024

EricLBuehler commented May 2, 2024 • edited

nerdai commented May 2, 2024

EricLBuehler commented May 2, 2024

nerdai commented May 2, 2024

nerdai commented May 2, 2024

EricLBuehler commented May 2, 2024

EricLBuehler commented Apr 25, 2024 •

edited

nerdai commented Apr 26, 2024 •

edited

EricLBuehler commented Apr 27, 2024 •

edited

EricLBuehler commented May 2, 2024 •

edited