Skip to content

v0.3.0

Compare
Choose a tag to compare
@github-actions github-actions released this 31 Jan 08:07
· 973 commits to main since this release
1af090b

Major Changes

  • Experimental multi-lora support
  • Experimental prefix caching support
  • FP8 KV Cache support
  • Optimized MoE performance and Deepseek MoE support
  • CI tested PRs
  • Support batch completion in server

What's Changed

New Contributors

Full Changelog: v0.2.7...v0.3.0