Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

some other networks do not converge in the second stage. #57

Open
wqz960 opened this issue Aug 3, 2019 · 3 comments
Open

some other networks do not converge in the second stage. #57

wqz960 opened this issue Aug 3, 2019 · 3 comments

Comments

@wqz960
Copy link

wqz960 commented Aug 3, 2019

Hi! Thank you for your excellent work, I just replaced the network which is vgg in your original network with Resnet or some others for my interest. during the first-stage training, the network shows almost the same performance with vgg. But in the second-stage training. it seems that the network no longer converges, and the effect is even worse than that in the first stage. According to the concept of heatmaps and feature maps in your paper, there should be some improvement in the accuracy in the second stage, but during my training in the second stage, the validation accuracy has decreased. By the way, in the second stage, I deleted the face alignment operation, the connected section only contains the heatmaps and feature map process second. whether this is the cause of the problem? In my opinion, it does not matter. Looking forward for your responds!!!

Thanks!!!

@MarekKowalski
Copy link
Owner

MarekKowalski commented Aug 3, 2019

Hi,

Do you mean that the second stage no longer gets a face image as input? If that's the case then it probably won't work.
You can provide the second stage with a non-aligned image, essentially the same as intput to the first stage. However, if you do that, remember that the heatmap should also be generated from landmarks that have not been processed by the connection layers.

If that's what you are doing then it should work fine.

Best regards,

Marek

@wqz960
Copy link
Author

wqz960 commented Aug 3, 2019

hi! Thank you for your quick respond. The feed to the second stage is a cropped face image which is not aligned, the feature map and the heat maps generate from the non-aligned landmarks . But in that case the network doesn’t converge. By the way, the input size, I have modified to 2562563, does the different input sizes cause the problem? can you provide me some advice to solve this situation? Thank you!

@wqz960
Copy link
Author

wqz960 commented Aug 4, 2019

sorry to take your time again! I saw some other network including yours, when the images change from the original size to 112 size, the groundtrue landmarks should minus 1 because of the affine tranforms. when should I minus 1 during training? when changing the original size to 256 size, whether the groundtrue need to minus 1?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants