fix bug for load pretrained sharded model when train with lora #8360

jiaruipeng1994 · 2024-05-06T03:36:50Z

PR types

Bug fixes

PR changes

Others

Description

When train the lora model and load from full-parameter pretrained sharded model, we need to delete the fixed parameter in optimizer.

paddle-bot · 2024-05-06T03:36:55Z

Thanks for your contribution!

CLAassistant · 2024-05-06T03:36:57Z

Thank you for your submission! We really appreciate it. Like many open source projects, we ask that you sign our Contributor License Agreement before we can accept your contribution.

Ruipeng Jia seems not to be a GitHub user. You need a GitHub account to be able to sign the CLA. If you have already a GitHub account, please add the email address used for this commit to your account.
_{You have signed the CLA already but the status is still pending? Let us recheck it.}

ZHUI · 2024-05-07T12:26:36Z

paddlenlp/trainer/trainer.py

@@ -521,6 +521,22 @@ def load_state_dict_from_checkpoint(self, resume_from_checkpoint=None):
 state_dict = self.load_state_dict_from_checkpoint_with_reshard(resume_from_checkpoint)
 if self.args.bf16:
 state_dict = self.recover_params_from_master_weights(state_dict)
+
+ for p in self.model.parameters():


这里有判断哪些参数是冻住的吗?

当时只是调通 lora 了, 确实没注意, 我再完善一下...

fix bug for load pretrained sharded model when train with lora

969c0a4

paddle-bot bot added the contributor label May 6, 2024

paddle-bot bot assigned lugimzzz May 6, 2024

ZHUI reviewed May 7, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix bug for load pretrained sharded model when train with lora #8360

fix bug for load pretrained sharded model when train with lora #8360

jiaruipeng1994 commented May 6, 2024

paddle-bot bot commented May 6, 2024

CLAassistant commented May 6, 2024

ZHUI May 7, 2024

jiaruipeng1994 May 7, 2024

fix bug for load pretrained sharded model when train with lora #8360

Are you sure you want to change the base?

fix bug for load pretrained sharded model when train with lora #8360

Conversation

jiaruipeng1994 commented May 6, 2024

PR types

PR changes

Description

paddle-bot bot commented May 6, 2024

CLAassistant commented May 6, 2024

ZHUI May 7, 2024

Choose a reason for hiding this comment

jiaruipeng1994 May 7, 2024

Choose a reason for hiding this comment