-
Notifications
You must be signed in to change notification settings - Fork 228
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
llama2 '8da4w-gptq' quantization fails #3632
Labels
Comments
cc @HDCharles to check if there is anything obvious here
|
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
I can export llama2 with -qmode=8da4w with NO problem, but when I tried the -qmode=8da4w-gptq, it fails.
installed packages
executorch 0.3.0a0+aaa2f2e
torch 2.4.0.dev20240507+cpu
torchao 0.1
torchtune 0.1.1
command to reproduce
Has anyone succeeded on this? Please shed some lights and really appreciate the help.
The text was updated successfully, but these errors were encountered: