Skip to content

v0.2.7

Compare
Choose a tag to compare
@github-actions github-actions released this 04 Jan 01:36
· 1053 commits to main since this release
2e0b6e7

Major Changes

  • Up to 70% throughput improvement for distributed inference by removing serialization/deserialization overheads
  • Fix tensor parallelism support for Mixtral + GPTQ/AWQ

What's Changed

New Contributors

Full Changelog: v0.2.6...v0.2.7