New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Do the published training weights "7b_tiva_v0" include all three stages of training results simultaneously? #62
Comments
Hi, the released checkpoint includes all training parameters across all three stages. Lines 27 to 28 in e2e2f94
Lines 110 to 111 in e2e2f94
|
@ChocoWu Thank you for your reply. I have another question. How do I save the training results of the three stages in a weight file when I train myself? Can we directly specify the training results of each stage as the same file, such as "7b_tiva_v0"? Will the results of each stage of training be merged or covered? It seems that the results of the first stage training were not used during the second stage training, and the results of the first and second stages of training were also not used during the third stage training. |
@pengxuan001, actually, the results of the previous stage training are used during the next stage of training: Line 17 in e2e2f94
If you want to separately save the weights trained in different stages, you need to specify a different save path, |
Is there any suggestions on how to load 7b_tiva_v0 during training stage? I tried to continue instruction training on my own data starting from the provided 7b_tiva_v0 checkpoints. However, simply setting |
From the training code, the training for each stage will be saved separately in a file, do the published training weights "7b_tiva_v0" include all three stages of training results simultaneously? In addition, in the inference code, input projection layer、output projection layer and lora file with LLM seem to be initialized and not loaded from the existing model "7b_tiva_v0".
The text was updated successfully, but these errors were encountered: