Loss calculation always 0 #207

sanipanwala · 2024-02-27T08:14:08Z

Hello,

I'm trying to fine-tune the 34B model but during fine-tuning, I always get a loss 0. While I was able to fine-tune 7B and 13B models but not 34B.

Let me know if I'm overlooking this or please give me suggestions.

Thanks.

jgehring · 2024-02-28T07:20:19Z

Hi @sanipanwala, we don't provide support for fine-tuning in this repository. Which tools are you using for this? Are you sure they support the 34B model well? The exact same setting works for 7B and 13B? In any case, a loss of 0 at the start of training is a good indication that something's going wrong.

sanipanwala · 2024-02-28T09:16:46Z

@jgehring I mean I'm using "codellama/CodeLlama-34b-hf" model and running a normal Python script and yes same configuration works with 7B and 13B.

Thanks.

sssszh · 2024-04-03T14:19:06Z

@sanipanwala
Hi, have you solved this problem yet?

I found the same problem when trying to peft fine-tune CodeLLama-7B (using LlamaForSequenceClassification), the Loss is always 0 during the fine-tuning.

Thanks！

sanipanwala · 2024-04-04T03:30:38Z

Hi @sssszh ,

No, I haven't found any solution yet.

Thanks,
Sani

jgehring self-assigned this Feb 28, 2024

jgehring mentioned this issue Mar 13, 2024

fine-tuning CodeLlama-34b loss #202

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Loss calculation always 0 #207

Loss calculation always 0 #207

sanipanwala commented Feb 27, 2024

jgehring commented Feb 28, 2024

sanipanwala commented Feb 28, 2024

sssszh commented Apr 3, 2024

sanipanwala commented Apr 4, 2024

Loss calculation always 0 #207

Loss calculation always 0 #207

Comments

sanipanwala commented Feb 27, 2024

jgehring commented Feb 28, 2024

sanipanwala commented Feb 28, 2024

sssszh commented Apr 3, 2024

sanipanwala commented Apr 4, 2024