test model on other languages #56

wonderfultina · 2019-11-07T03:08:23Z

Hi,
I want to ask a question ,if i want to use model that trained in English,and use it to test other languages.How do I run the code?

PiotrCzapla · 2019-11-11T10:13:41Z

Do you mean in form of zeroshoot transfer learning?

If so we use Laser for that. First we train laser , to obtain zeroshoot predictions for other languages.
Then we use that zershoot predictions to train regular multifit (pretrained in the language that we are testing on). The unsupervised pretraining removes noise from the laser zeroshoot predictions and improves the results.

wonderfultina · 2019-11-11T11:53:26Z

I understand now,thank you.

vhargitai · 2019-11-25T10:16:18Z

Hi @PiotrCzapla , have you or your colleagues already pretrained this model on English Wikipedia?

If not, would using prepare_wiki-en.sh to grab wikitext-103, then running postprocess_wikitext.py on it be identical to the dataset preparation you did for other languages in the MultiFiT paper?

I'd like to reproduce the monolingual supervised training procedure in the MultiFiT paper for English language classification. Thanks in advance!

mhajiaghayi · 2019-12-04T21:11:27Z

Do you mean in form of zeroshoot transfer learning?
If so we use Laser for that. First we train laser , to obtain zeroshoot predictions for other languages.
Then we use that zershoot predictions to train regular multifit (pretrained in the language that we are testing on). The unsupervised pretraining removes noise from the laser zeroshoot predictions and improves the results.

Q) in this case, you don't have a single model with the fixed tokenization that does zero-shot embedding for other language. am I right?

iNeil77 · 2020-07-27T07:58:53Z

Do you mean in form of zeroshoot transfer learning?

If so we use Laser for that. First we train laser , to obtain zeroshoot predictions for other languages.
Then we use that zershoot predictions to train regular multifit (pretrained in the language that we are testing on). The unsupervised pretraining removes noise from the laser zeroshoot predictions and improves the results.

In the CLS-DE notebook I only see the classifier fine-tuning happening with DE Music Data, Label pairs. But if I understand what you said correctly, shouldn't the LASER classifier be fine-tuned with EN Music Data first before it can act as a teacher to fine-tune the DE Classifier? I don't see that in the notebook. Am I misunderstanding the training regime?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

test model on other languages #56

test model on other languages #56

wonderfultina commented Nov 7, 2019

PiotrCzapla commented Nov 11, 2019

wonderfultina commented Nov 11, 2019

vhargitai commented Nov 25, 2019

mhajiaghayi commented Dec 4, 2019

iNeil77 commented Jul 27, 2020

test model on other languages #56

test model on other languages #56

Comments

wonderfultina commented Nov 7, 2019

PiotrCzapla commented Nov 11, 2019

wonderfultina commented Nov 11, 2019

vhargitai commented Nov 25, 2019

mhajiaghayi commented Dec 4, 2019

iNeil77 commented Jul 27, 2020