Pull requests: vllm-project/vllm
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[CI/Build] Enable entrypoints tests to be run in a single command
#4759
opened May 11, 2024 by
DarkLight1337
Loading…
[Frontend] Re-enable custom roles in Chat Completions API
#4758
opened May 11, 2024 by
DarkLight1337
Loading…
[Core][Distributed] add fast broadcast for tensor dict
#4757
opened May 11, 2024 by
youkaichao
Loading…
1 task
[Core][Distributed] refactor custom allreduce to support multiple tp groups
action-required
#4754
opened May 10, 2024 by
youkaichao
Loading…
1 task
[CI/Build] Enforce style for C++ and CUDA code with
clang-format
#4722
opened May 9, 2024 by
mgoin
Loading…
[CI/Build] Tweak Marlin Nondeterminism Issues
#4713
opened May 9, 2024 by
robertgshaw2-neuralmagic
Loading…
[Core][Hash][Automatic Prefix caching] Accelerating the hashing function by avoiding deep copies
#4696
opened May 8, 2024 by
KuntaiDu
Loading…
[Frontend] OpenAI API server: Do not add bos token by default when encoding
#4688
opened May 8, 2024 by
bofenghuang
Loading…
Previous Next
ProTip!
Updated in the last three days: updated:>2024-05-09.