Skip to content

Issues: vectorch-ai/ScaleLLM

ScaleLLM Roadmap
#84 opened Mar 16, 2024 by guocuimi
Open 3
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Author
Filter by author
Label
Filter by label
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Milestones
Filter by milestone
Assignee
Filter by who’s assigned
Sort

Issues list

Introducing the Mamba model
#165 opened Apr 28, 2024 by guocuimi
Structural Decoding: Json format
#154 opened Apr 28, 2024 by guocuimi
Structural Decoding: Json format
#153 opened Apr 28, 2024 by guocuimi
GPU Arch: Turing architecture (sm75) enhancement New feature or request
#152 opened Apr 28, 2024 by guocuimi
Adding support for Apple chips enhancement New feature or request
#151 opened Apr 28, 2024 by guocuimi
Introducing multi-modal models (LLaVA model) enhancement New feature or request
#150 opened Apr 28, 2024 by guocuimi
Implementing MoE (Mixture of Experts) kernels performance Improvements to performance
#149 opened Apr 28, 2024 by guocuimi
Exploring the feasibility of adopting the flashinfer library performance Improvements to performance
#147 opened Apr 28, 2024 by guocuimi
Exploring lookahead decoding support enhancement New feature or request
#146 opened Apr 28, 2024 by guocuimi
cuda graph capture may occasionally become stuck with multiple gpus. bug Something isn't working enhancement New feature or request
#131 opened Apr 18, 2024 by guocuimi
ScaleLLM Roadmap roadmap
#84 opened Mar 16, 2024 by guocuimi
9 of 31 tasks
ProTip! Type g p on any issue or pull request to go back to the pull request listing page.