-
Notifications
You must be signed in to change notification settings - Fork 569
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
数据加载出现问题:pad_sequence(): argument 'padding_value' (position 3) must be float, not NoneType #405
Comments
像是cache生成有问题,删掉cache重新生成一下试试 |
删了重试,还是这个错误 |
不是很确定问题所在,代码在生成cache的过程中可能因为内存不足而程序失败的问题,重提就能解决,但是生成cache后训练中出现问题还没遇到 |
你实际输入的tokenizer不是我们发布的tokenizer,然后你的tokenizer.model中没有pad_token这个选项,所以会出现这个错误。 |
I think we can add the following code block to sft trainer.
|
I have created pr for this issue. |
This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your consideration. |
Closing the issue, since no updates observed. Feel free to re-open if you need any further assistance. |
自己训练词汇表时,怎么加入这个pad_token? |
我的打印后有如下:
但是也出现问题了; |
提交前必须检查以下项目
问题类型
其他问题
基础模型
Chinese-Alpaca-2 (7B/13B)
操作系统
Linux
详细描述问题
依赖情况(代码类问题务必提供)
运行日志或截图
The text was updated successfully, but these errors were encountered: