Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Inference with AWQ quantized base model + compile enabled results in the <unk> tokens #426

Open
4 tasks
thincal opened this issue Apr 19, 2024 · 1 comment
Open
4 tasks
Labels
bug Something isn't working

Comments

@thincal
Copy link
Contributor

thincal commented Apr 19, 2024

System Info

lorax: v0.9.0
awq: main branch
transformers: v4.39.3

Information

  • Docker
  • The CLI directly

Tasks

  • An officially supported command
  • My own modifications

Reproduction

  1. base model with awq quantized, startup lorax with --compile option
  2. inference without adapter, but result is: <unk><unk><unk><unk>...

But, it works fine if without --compile added.

Expected behavior

Normal reaction

@tgaddair tgaddair added the bug Something isn't working label May 23, 2024
@tgaddair
Copy link
Contributor

Hey @thincal, thanks for reporting, this sounds like an issue with AWQ + compile. We should definitely investigate.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
None yet
Development

No branches or pull requests

2 participants