New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
添加--group_by_length后训练一段时间后OOM #3668
Labels
solved
This problem has been already solved.
Comments
跟这个参数没有必然关系,跟你的batch_size 和cut off you关系。 |
@codemayq 调小batch size当然可以,但我的问题是group_by_length不是按照从大到小排序了吗?为啥一开始不OOM训到中间会OOM呢 中间的序列长度不会比第一批长吧 |
group_by_length 会从大到小排序 被训练,这个有什么文档可以支撑这个说法吗? |
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Reminder
Reproduction
Expected behavior
添加--group_by_length后序列长度不是从大到小排序了吗?为什么中间会OOM
System Info
No response
Others
No response
The text was updated successfully, but these errors were encountered: