Issues: intel/auto-round
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
hook AutoHfQuantizer of transformers to support different backends and mixed precision quantization
enhancement
New feature or request
#109
opened May 16, 2024 by
wenhuach21
large discrepancy between GPTQ model and qdq model at W2 asym
#108
opened May 15, 2024 by
wenhuach21
support F.linear and matmul in some moe models
enhancement
New feature or request
#66
opened Mar 28, 2024 by
wenhuach21
ProTip!
Mix and match filters to narrow down what you’re looking for.