Skip to content

ch06 model overfitting comment #145

Closed Answered by rasbt
nicolaleo asked this question in Q&A
Discussion options

You must be logged in to vote

Thanks for the comment. You raise a good point. I am not sure if a larger dev set could potentially solve it (it might, but that's a good hypothesis). I'd say the overfitting to the training set (and to the validation/dev set) could also come from tweaking hyperparameters slightly to get a good validation set performance.

I modified the note as follows:

  • However, based on the slightly lower test set performance, we can see that the model overfits the training data to a very small degree, as well as the validation data that has been used for tweaking some of the hyperparameters, such as the learning rate

(However, these are only short notes in the Jupyter notebook; I will say more in th…

Replies: 1 comment

Comment options

You must be logged in to vote
0 replies
Answer selected by rasbt
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants