Highly overfit to training dataset #46

kikirizki · 2022-10-24T08:58:22Z

Hi @baudm thank you for your great works, I trained parseq-tiny model with Focused Scene Text Dataset + Incidental Scene Text Dataset, from (https://rrc.cvc.uab.es/), it contains around 4000+ images, after I trained for 300 epochs with default hyperparameter from this repo, it perform very well for training dataset and perform very poor for new unseen data, It seem the language model part overfit because when I tried new dataset, the wrong output usually are text that available in the training dataset, what do you think, do I need more dataset

baudm · 2022-11-03T09:43:56Z

A dataset that small + training schedule that long would definitely result in overfitting.

Don't use the default hyperparameters.
Try decoding with decode_ar=False and refine_iters=0.

kikirizki · 2022-11-03T10:21:31Z

@baudm Thank you so much for your response I will try it

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Highly overfit to training dataset #46

Highly overfit to training dataset #46

kikirizki commented Oct 24, 2022

baudm commented Nov 3, 2022

kikirizki commented Nov 3, 2022

Highly overfit to training dataset #46

Highly overfit to training dataset #46

Comments

kikirizki commented Oct 24, 2022

baudm commented Nov 3, 2022

kikirizki commented Nov 3, 2022