Pull requests: intel/xFasterTransformer
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[COMM] Fix bugs of core dump && hang when running cross nodes
#423
opened May 24, 2024 by
abenmao
Loading…
[Distribute] Add distribute support for continuous batching api.
continuous batching
continuous batching
enhancement
New feature or request
[Kernel] Less compute for Self-Attention (Q * K)
enhancement
New feature or request
performance
performance related.
#420
opened May 24, 2024 by
pujiang2018
Loading…
Add --padding and fix bug
benchmark
performance or accuracy benchmark
bug
Something isn't working
#418
opened May 23, 2024 by
yangkunx
Loading…
[Kernel] Add FP16 MHA and MLP kernels.
enhancement
New feature or request
#415
opened May 21, 2024 by
changqi1
Loading…
[Kernel] Add GPU kernels.
enhancement
New feature or request
gpu
Related to GPU
#372
opened May 7, 2024 by
changqi1
Loading…
[Eval] Add eval test with opencompass.
benchmark
performance or accuracy benchmark
enhancement
New feature or request
Update AWQ GPTQ quantization guide
documentation
Improvements or additions to documentation
#306
opened Apr 10, 2024 by
miaojinc
Loading…
ProTip!
Updated in the last three days: updated:>2024-05-22.