Skip to content

Pull requests: vllm-project/vllm

Author
Filter by author
Label
Filter by label
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Milestones
Filter by milestone
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

[Misc] Logits processor plugins
#4769 opened May 11, 2024 by NadavShmayo Loading…
[Misc]Easier access to the nccl library
#4767 opened May 11, 2024 by Cyuchuan Loading…
[Bugfix] Fix call to init_logger in openai server
#4765 opened May 11, 2024 by NadavShmayo Loading…
[Core][Bugfix]: fix prefix caching for blockv2
#4764 opened May 11, 2024 by leiwen83 Loading…
[Core][Distributed] add fast broadcast for tensor dict
#4757 opened May 11, 2024 by youkaichao Loading…
1 task
Add TensorizerArgs to client api server
#4752 opened May 10, 2024 by vrdn-23 Loading…
[Misc] Enhance attention selector
#4751 opened May 10, 2024 by WoosukKwon Loading…
[CI/Build] use setuptools-scm to set __version__
#4738 opened May 10, 2024 by dtrifiro Loading…
[Misc] Added devcontainer to help vscode dev setup
#4720 opened May 9, 2024 by ElefHead Loading…
[CORE] Improvement in ranks code
#4718 opened May 9, 2024 by SwapnilDreams100 Loading…
add TypeLogitsProcessor
#4712 opened May 9, 2024 by eitanturok Draft
Remove Ray health check
#4693 opened May 8, 2024 by Yard1 Loading…
[Core] Implement sharded state loader
#4690 opened May 8, 2024 by aurickq Loading…
ProTip! Updated in the last three days: updated:>2024-05-09.