Why performance highly degraded when perform model.eval()? #79

qimw · 2020-04-12T12:42:51Z

I am adapting from light dataset to a darker one. When I am running test on the source domain, I found that the performance is highly degraded if performing model.eval(). But this doesn't appear on the target domain. It is quiet wired. And my pytorch version is 1.0

qimw · 2020-04-12T12:45:37Z

I found that the bns are set requires_grad = False, but it will still update the running_mean and running_var. So what's the meaning of doing this?

wasidennis · 2020-04-19T00:51:19Z

Seems that the gamma and beta in batchnorm are still updated (we also found this before), but we cannot control it. For the degraded performance on source, it is natural compared to the model without target domain alignment. However, it should not produce something super bad as there is still a supervised loss on source.

qimw · 2020-04-21T03:08:45Z

No, the result is super bad. Maybe this is due to the large domain gap. The last iter we will train model on target domain. So the batchnorm parameters(running_mean, running_var) will adapt to the target domain at the same time. When we are testing on the source domain, batchnorm parameters don't match with this domain. As a result, performance will drop.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Why performance highly degraded when perform model.eval()? #79

Why performance highly degraded when perform model.eval()? #79

qimw commented Apr 12, 2020

qimw commented Apr 12, 2020

wasidennis commented Apr 19, 2020

qimw commented Apr 21, 2020

Why performance highly degraded when perform model.eval()? #79

Why performance highly degraded when perform model.eval()? #79

Comments

qimw commented Apr 12, 2020

qimw commented Apr 12, 2020

wasidennis commented Apr 19, 2020

qimw commented Apr 21, 2020