Skip to content

Navigation Menu

Explore
For
- Enterprise
- Teams
- Startups
- Education
By Solution
Resources
- GitHub Sponsors
  Fund open source developers
- The ReadME Project
  GitHub community articles
Repositories
Pricing

Search code, repositories, users, issues, pull requests...

Search

Clear

Search syntax tips

Provide feedback

We read every piece of feedback, and take your input very seriously.

Include my email address so I can be contacted

Saved searches

Use saved searches to filter your results more quickly

Name

Query

To see all available qualifiers, see our documentation.

You signed in with another tab or window. Reload to refresh your session. You signed out in another tab or window. Reload to refresh your session. You switched accounts on another tab or window. Reload to refresh your session.

Dismiss alert

intel / xFasterTransformer Public

Notifications
Fork 44
Star 245

Code
Issues 11
Pull requests 12
Actions
Projects
Wiki
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Actions
Projects
Wiki
Security
Insights

Pull requests: intel/xFasterTransformer

Labels 20 Milestones 0

Labels 20 Milestones 0

New pull request New

12 Open 337 Closed

12 Open 337 Closed

Author

Filter by author

Label

Filter by label

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Milestones

Filter by milestone

Reviews

Filter by reviews

No reviews Review required Approved review Changes requested

Assignee

Filter by who’s assigned

Assigned to nobody

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Pull requests list

[COMM] Fix bugs of core dump && hang when running cross nodes

#423 opened May 24, 2024 by abenmao

Loading…

[xDNN] Release v1.5.1.

#422 opened May 24, 2024 by changqi1

Loading…

[Distribute] Add distribute support for continuous batching api. continuous batching

continuous batching

New feature or request

#421 opened May 24, 2024 by Duyi-Wang • Draft

1

[Kernel] Less compute for Self-Attention (Q * K) enhancement

New feature or request

performance related.

#420 opened May 24, 2024 by pujiang2018

Loading…

Add --padding and fix bug benchmark

performance or accuracy benchmark

Something isn't working

#418 opened May 23, 2024 by yangkunx

Loading…

[Kernel] Add FP16 MHA and MLP kernels. enhancement

New feature or request

#415 opened May 21, 2024 by changqi1

Loading…

[Kernel] Add GPU kernels. enhancement

New feature or request

Related to GPU

#372 opened May 7, 2024 by changqi1

Loading…

5

[Model] Achieve whole pipeline parallel. enhancement

New feature or request

Related to GPU

#355 opened Apr 28, 2024 by changqi1 • Draft

1

[Eval] Add eval test with opencompass. benchmark

performance or accuracy benchmark

New feature or request

#325 opened Apr 17, 2024 by marvin-Yu • Draft

Update AWQ GPTQ quantization guide documentation

Improvements or additions to documentation

#306 opened Apr 10, 2024 by miaojinc

Loading…

[Kernel] Add oneDNN GPU kernels. gpu

Related to GPU

performance related.

#253 opened Feb 29, 2024 by changqi1 • Draft

[Kernel] Add oneDNN GPU kernels. gpu

Related to GPU

performance related.

#236 opened Feb 21, 2024 by changqi1 • Draft

ProTip! Updated in the last three days: updated:>2024-05-22.

Footer

© 2024 GitHub, Inc.

Footer navigation

Terms
Privacy
Security
Status
Docs
Contact

You can’t perform that action at this time.