Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

运行quikstart进行模型推理报错 #337

Open
sunt40 opened this issue May 8, 2024 · 4 comments
Open

运行quikstart进行模型推理报错 #337

sunt40 opened this issue May 8, 2024 · 4 comments

Comments

@sunt40
Copy link

sunt40 commented May 8, 2024

This modeling file requires the following packages that were not found in your environment: flash_attn. Run pip install flash_attn

@sunt40
Copy link
Author

sunt40 commented May 8, 2024

安装不了这个包

@wangzilong1997
Copy link

。。。。mac m1 一样这个问题

@shams2023
Copy link

如何解决?

@wytitan
Copy link

wytitan commented May 15, 2024

trust_remote_code=True, use_flash_attention_2=True
我把这两个都移除了可以加载了,参考https://huggingface.co/microsoft/phi-1_5/discussions/72

我在纯CPU环境,改成了AutoModelForCausalLM.from_pretrained(model_name_or_path, device_map=device_map, torch_dtype=torch.float16)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

4 participants