Skip to content

Pull requests: intel/xFasterTransformer

Author
Filter by author
Label
Filter by label
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Milestones
Filter by milestone
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

[xDNN] Release v1.5.1.
#422 opened May 24, 2024 by changqi1 Loading…
[Kernel] Less compute for Self-Attention (Q * K) enhancement New feature or request performance performance related.
#420 opened May 24, 2024 by pujiang2018 Loading…
Add --padding and fix bug benchmark performance or accuracy benchmark bug Something isn't working
#418 opened May 23, 2024 by yangkunx Loading…
[Kernel] Add FP16 MHA and MLP kernels. enhancement New feature or request
#415 opened May 21, 2024 by changqi1 Loading…
[Kernel] Add GPU kernels. enhancement New feature or request gpu Related to GPU
#372 opened May 7, 2024 by changqi1 Loading…
[Model] Achieve whole pipeline parallel. enhancement New feature or request gpu Related to GPU
#355 opened Apr 28, 2024 by changqi1 Draft
[Eval] Add eval test with opencompass. benchmark performance or accuracy benchmark enhancement New feature or request
#325 opened Apr 17, 2024 by marvin-Yu Draft
Update AWQ GPTQ quantization guide documentation Improvements or additions to documentation
#306 opened Apr 10, 2024 by miaojinc Loading…
[Kernel] Add oneDNN GPU kernels. gpu Related to GPU performance performance related.
#253 opened Feb 29, 2024 by changqi1 Draft
[Kernel] Add oneDNN GPU kernels. gpu Related to GPU performance performance related.
#236 opened Feb 21, 2024 by changqi1 Draft
ProTip! Updated in the last three days: updated:>2024-05-22.