Support for Lora adapter layers for LLM inference #181

TejasRavichandran1995 · 2024-04-19T10:05:23Z

the llm-export utility https://github.com/wangzhaode/llm-export seems to have support to directly export a lora.mnn file during conversion in the llm_export.py .

However , it seems to me the framework does not yet support inference with lora.mnn exported file.
Any pointers regarding this would be useful :).
@wangzhaode

wangzhaode · 2024-04-19T10:15:42Z

MNN-2.9.0 will support apply lora on device. But now there are some accuracy problems caused by quantization.

TejasRavichandran1995 · 2024-04-19T10:47:34Z

Sure. Thanks @wangzhaode .
Any planned rough timelines on the 2.9.0 release?

github-actions · 2024-05-20T09:29:29Z

Marking as stale. No activity in 30 days.

github-actions bot added the stale label May 20, 2024

github-actions bot closed this as completed May 22, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support for Lora adapter layers for LLM inference #181

Support for Lora adapter layers for LLM inference #181

TejasRavichandran1995 commented Apr 19, 2024

wangzhaode commented Apr 19, 2024 •

edited

TejasRavichandran1995 commented Apr 19, 2024

github-actions bot commented May 20, 2024

Support for Lora adapter layers for LLM inference #181

Support for Lora adapter layers for LLM inference #181

Comments

TejasRavichandran1995 commented Apr 19, 2024

wangzhaode commented Apr 19, 2024 • edited

TejasRavichandran1995 commented Apr 19, 2024

github-actions bot commented May 20, 2024

wangzhaode commented Apr 19, 2024 •

edited