Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Highly overfit to training dataset #46

Open
kikirizki opened this issue Oct 24, 2022 · 2 comments
Open

Highly overfit to training dataset #46

kikirizki opened this issue Oct 24, 2022 · 2 comments

Comments

@kikirizki
Copy link

Hi @baudm thank you for your great works, I trained parseq-tiny model with Focused Scene Text Dataset + Incidental Scene Text Dataset, from (https://rrc.cvc.uab.es/), it contains around 4000+ images, after I trained for 300 epochs with default hyperparameter from this repo, it perform very well for training dataset and perform very poor for new unseen data, It seem the language model part overfit because when I tried new dataset, the wrong output usually are text that available in the training dataset, what do you think, do I need more dataset

@baudm
Copy link
Owner

baudm commented Nov 3, 2022

A dataset that small + training schedule that long would definitely result in overfitting.

  1. Don't use the default hyperparameters.
  2. Try decoding with decode_ar=False and refine_iters=0.

@kikirizki
Copy link
Author

@baudm Thank you so much for your response I will try it

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants