Does it support Arabic #10

Qt4arab · 2024-02-07T09:11:05Z

I have 50k high quality Arabic dataset,is possible to train the model on Arabic?

sidroopdaska · 2024-02-09T01:31:19Z

See comment here #6

vatsalaggarwal · 2024-02-21T17:10:40Z

I've added some initial pointers to this here: #70 (comment)

lucapericlp · 2024-03-14T13:39:15Z

Hey @Qt4arab , we've just published an initial approach for finetuning the last N transformer blocks of the first stage LLM. Best to play around with the hyperparams in finetune_params.py as we didn't determine the optimal set. Let us know if you have any issues or if you're up for contributing any improvements (via param sweep or otherwise!)

Next step to improve finetuning effectiveness is to have LoRA adapters for the first stage LLM which is being worked on here.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Does it support Arabic #10

Does it support Arabic #10

Qt4arab commented Feb 7, 2024

sidroopdaska commented Feb 9, 2024

vatsalaggarwal commented Feb 21, 2024

lucapericlp commented Mar 14, 2024 •

edited

Does it support Arabic #10

Does it support Arabic #10

Comments

Qt4arab commented Feb 7, 2024

sidroopdaska commented Feb 9, 2024

vatsalaggarwal commented Feb 21, 2024

lucapericlp commented Mar 14, 2024 • edited

lucapericlp commented Mar 14, 2024 •

edited