Skip to content

What is actually train loss and train accuracy? #281

Answered by AlexHarn
AaronBlare asked this question in Q&A
Discussion options

You must be logged in to vote

Hi! I would not expect them to be the same. I would expect the test metrics to be better. I see two main reasons for differences in test vs train metrics when using the same data:

  1. The train metrics are averaged over the epoch while the test metrics are calculated with the weights at the end of the epoch. Since the model is being trained and improving with each batch during an epoch, it is performing better at the end of the epoch. Therefore the average over the training epoch (train metrics) will be worse.

  2. Things like dropout and batch normalization behave differently during training and testing. The example net in this repository does not use dropout, but it uses batch norm. The para…

Replies: 1 comment 1 reply

Comment options

You must be logged in to vote
1 reply
@AaronBlare
Comment options

Answer selected by AaronBlare
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants