different inference result #453

xd2333 · 2024-05-12T04:30:43Z

hi unslothai, i got different inference result when using unsloth, i'v tested qwen1.5-chat and tinyllama-chat and got same issue, generate by unsloth always get a bad result compare with transformers and dont know why

and here is my case:
https://colab.research.google.com/drive/1dxGKB-c3U8BYX-m2rQie8R12--0-JQMs?usp=sharing

danielhanchen · 2024-05-13T10:16:55Z

You're correct! It seems like max_seq_length's default of 4096 is auto scaling TinyLlama, causing bad outputs - I'll fix this asap - thanks for the report!

xd2333 · 2024-05-13T23:21:27Z

You're correct! It seems like max_seq_length's default of 4096 is auto scaling TinyLlama, causing bad outputs - I'll fix this asap - thanks for the report!

Hi unslothai, thx for fixing that! tinyllama-chat seems better not but i found Qwen1.5-7B-Chat still not well

and here is the case too:
https://colab.research.google.com/drive/1dxGKB-c3U8BYX-m2rQie8R12--0-JQMs?usp=sharing#scrollTo=47OE5BgPB6Wm

danielhanchen added the currently fixing Am fixing now! label May 13, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

different inference result #453

different inference result #453

xd2333 commented May 12, 2024

danielhanchen commented May 13, 2024

xd2333 commented May 13, 2024 •

edited

different inference result #453

different inference result #453

Comments

xd2333 commented May 12, 2024

danielhanchen commented May 13, 2024

xd2333 commented May 13, 2024 • edited

xd2333 commented May 13, 2024 •

edited