Skip to content

Releases: xorbitsai/inference

v0.11.2.post1

24 May 11:52
ac8f334
Compare
Choose a tag to compare

What's new in 0.11.2.post1 (2024-05-24)

These are the changes in inference v0.11.2.post1, a hotfix version of v0.11.2.

Bug fixes

  • BUG: fix launch model error when use torch 2.3.0 by @amumu96 in #1543

Full Changelog: v0.11.2...v0.11.2.post1

v0.11.2

24 May 09:10
77e79f8
Compare
Choose a tag to compare

What's new in 0.11.2 (2024-05-24)

These are the changes in inference v0.11.2.

New features

Enhancements

Bug fixes

  • BUG: Fix start worker failed due to None device name by @codingl2k1 in #1539
  • BUG: Fix gpu_idx allocate error when set replica > 1 by @amumu96 in #1528

Others

Full Changelog: v0.11.1...v0.11.2

v0.11.1

17 May 07:17
55a0200
Compare
Choose a tag to compare

What's new in 0.11.1 (2024-05-17)

These are the changes in inference v0.11.1.

New features

  • FEAT: support Yi-1.5 series by @qinxuye in #1489
  • FEAT: [UI] embedding and rerank support the specified GPU and CPU. by @yiboyasss in #1491

Enhancements

Bug fixes

Documentation

New Contributors

Full Changelog: v0.11.0...v0.11.1

v0.11.0

11 May 09:41
21be5ab
Compare
Choose a tag to compare

What's new in 0.11.0 (2024-05-11)

These are the changes in inference v0.11.0.

Break Changes

v0.11.0 introduced break change when launching model that model_engine should be specified, refer to Model Engine for more information

New features

Enhancements

Bug fixes

Tests

  • TST: Pin huggingface-hub to pass CI since it has some break changes by @ChengjieLi28 in #1427

Documentation

Others

  • BUG:Fix mertics is empty when call /v1/chat/completions by @amumu96 in #1406

New Contributors

Full Changelog: v0.10.3...v0.11.0

v0.10.3

24 Apr 02:57
2ba72b0
Compare
Choose a tag to compare

What's new in 0.10.3 (2024-04-24)

These are the changes in inference v0.10.3.

New features

Enhancements

Bug fixes

  • BUG: Fix Launching embedding or reranking models from commandline fails due to PEFT by @hainaweiben in #1343
  • BUG: Fix extra parameters issue when auto-recovering models by @ChengjieLi28 in #1348
  • BUG: Fix old rerank models use flag rerank issue by @codingl2k1 in #1350

Documentation

New Contributors

Full Changelog: v0.10.2.post1...v0.10.3

v0.10.2.post1

19 Apr 06:48
5001715
Compare
Choose a tag to compare

What's new in 0.10.2.post1 (2024-04-19)

These are the changes in inference v0.10.2.post1.

Bug fixes

Full Changelog: v0.10.2...v0.10.2.post1

v0.10.2

19 Apr 06:19
f19e85b
Compare
Choose a tag to compare

What's new in 0.10.2 (2024-04-19)

These are the changes in inference v0.10.2.

New features

Enhancements

Others

  • Revert "REF: support query for engine feature" by @qinxuye in #1329

Full Changelog: v0.10.1...v0.10.2

v0.10.1

12 Apr 02:47
e3a947e
Compare
Choose a tag to compare

What's new in 0.10.1 (2024-04-12)

These are the changes in inference v0.10.1.

New features

Enhancements

Bug fixes

Tests

Documentation

New Contributors

Full Changelog: v0.10.0...v0.10.1

v0.10.0

29 Mar 04:56
2857ec4
Compare
Choose a tag to compare

What's new in 0.10.0 (2024-03-29)

These are the changes in inference v0.10.0.

New features

Enhancements

Bug fixes

  • BUG: Fix the incorrect model interface address caused a 307 redirect to HTTP, blocking the request and preventing the display of the model list. by @wertycn in #1182
  • BUG: fix doc fail introduced by #1171 & update readme by @qinxuye in #1203
  • BUG: Increase validator types for thie 'input' parameter of embeddings to match OpenAI API by @Minamiyama in #1201

Documentation

New Contributors

Full Changelog: v0.9.4...v0.10.0

v0.9.4

21 Mar 07:06
2c9465a
Compare
Choose a tag to compare

What's new in 0.9.4 (2024-03-21)

These are the changes in inference v0.9.4.

New features

Enhancements

  • ENH: vLLM latest models support by @1572161937 in #1155

Bug fixes

Documentation

Others

New Contributors

Full Changelog: v0.9.3...v0.9.4