Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Initial Weights Initialization #92

Open
Nadavc220 opened this issue Nov 24, 2020 · 3 comments
Open

Initial Weights Initialization #92

Nadavc220 opened this issue Nov 24, 2020 · 3 comments

Comments

@Nadavc220
Copy link

When training the model you are initialzing the model weights with the weights found here:
'http://vllab.ucmerced.edu/ytsai/CVPR18/DeepLab_resnet_pretrained_init-f81d91e8.pth'

The paper states that the initial baseline is a Deeplab pre-trained on ImageNet dataset. Are these weights the ImageNet pre trained or are you using a GTA5 pretrained network to initialize the model?

Thanks.

@wasidennis
Copy link
Owner

@Nadavc220 sorry for the confusion. In this repo, the pre-trained weight for the VGG backbone is on ImageNet, while the one for the ResNet is on ImageNet + COCO (obtained from DeepLab). In practice, we found that using ImageNet pre-trained weight converges slower but will eventually achieve a similar result as ImageNet + COCO. Similarly, if using a GTA5 pre-trained weight, it should also achieve a similar result with faster convergency.

@Nadavc220
Copy link
Author

Thanks for the quick response.
In general, don't you thing there is a difference initializing the net with ImageNet + COCO than initializing with a GTA5 trained net other than convergence speed? Did you test this theory or is it just an idea?

Thanks

@wasidennis
Copy link
Owner

@Nadavc220 This is a good point! Internally, the learning behavior would be quite different as we need to consider their domain gaps. For example, initializing from GTA5 could provide a more stable training procedure, as GTA5 is already a driving-scene dataset. However, since GTA5 also has a large domain gap to Cityscapes, pre-training on GTA5 for too many iterations would not be also a good practice (fitting too much in the GTA5 data distribution). This is something we already empirically tried, but of course it is still an open research problem to explore. To make it simpler, we just use the ImageNet (+COCO) as the pre-trained weight.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants