New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Sync release 0.4.9 to dev #2407

Merged

Van-QA merged 3 commits into dev from main

Mar 19, 2024

Commits on Mar 17, 2024

Merge pull request #2398 from janhq/dev
```
Release cut 0.4.9
```
louis-jan committed Mar 17, 2024
Configuration menu
View commit details

Copy full SHA for 60cf8de

Browse repository at this point
Copy the full SHA

60cf8de View commit details

Browse the repository at this point in the history

Commits on Mar 18, 2024

fix: move tensorrt executable to engine (#2400 )

* fix: move tensorrt executable to engine

Signed-off-by: James <[email protected]>

* some update

Signed-off-by: hiro <[email protected]>

* chore: bump tensorrt version

* fix: wrong destroy path

* fix: install extensions in parallel

* chore: update path for tensorrt engine (#2404)

Signed-off-by: James <[email protected]>
Co-authored-by: James <[email protected]>

---------

Signed-off-by: James <[email protected]>
Signed-off-by: hiro <[email protected]>
Co-authored-by: James <[email protected]>
Co-authored-by: hiro <[email protected]>
Co-authored-by: Louis <[email protected]>

4 people committed Mar 18, 2024

Commits on Mar 19, 2024

Release/v0.4.9 (#2421 )

* fix: turn off experimental settings should also turn off quick ask (#2411)

* fix: app glitches 1s generating response before starting model (#2412)

* fix: disable experimental feature should also disable vulkan (#2414)

* fix: model load stuck on windows when can't get CPU core count (#2413)

Signed-off-by: James <[email protected]>
Co-authored-by: James <[email protected]>

* feat: TensorRT-LLM engine update support (#2415)

* fix: engine update

* chore: add remove prepopulated models

Signed-off-by: James <[email protected]>

* update tinyjensen url

Signed-off-by: James <[email protected]>

* update llamacorn

Signed-off-by: James <[email protected]>

* update Mistral 7B Instruct v0.1 int4

Signed-off-by: James <[email protected]>

* update tensorrt

Signed-off-by: James <[email protected]>

* update

Signed-off-by: hiro <[email protected]>

* update

Signed-off-by: James <[email protected]>

* prettier

Signed-off-by: James <[email protected]>

* update mistral config

Signed-off-by: James <[email protected]>

* fix some lint

Signed-off-by: James <[email protected]>

---------

Signed-off-by: James <[email protected]>
Signed-off-by: hiro <[email protected]>
Co-authored-by: James <[email protected]>
Co-authored-by: hiro <[email protected]>

* Tensorrt LLM disable turing support (#2418)

Co-authored-by: Hien To <[email protected]>

* chore: add prompt template tensorrtllm (#2375)

* chore: add prompt template tensorrtllm

* Add Prompt template for mistral and correct model metadata

---------

Co-authored-by: Hien To <[email protected]>

* fix: correct tensorrt mistral model.json (#2419)

---------

Signed-off-by: James <[email protected]>
Signed-off-by: hiro <[email protected]>
Co-authored-by: Louis <[email protected]>
Co-authored-by: James <[email protected]>
Co-authored-by: hiro <[email protected]>
Co-authored-by: hiento09 <[email protected]>
Co-authored-by: Hien To <[email protected]>

6 people committed Mar 19, 2024