Initialization of conjugate gradient in competitive_optimizer #255

f-t-s · 2020-05-12T15:57:06Z

Hi,

I saw that you have implemented competitive gradient descent in your dev-branch, thanks for giving it a try @mikkel !
I just wanted to mention that you seem to have a bug that we also found in our pytorch implementation recently, and will correct shortly.

In the definition of mgeneral_conjugate_gradient you initialize the residual with

r = [tf.identity(_b) for _b in b]

This is correct if the initial guess is zero, otherwise (if A is the matrix to be inverted), it should be
r = b - Ax_0

where x_0 is the initial guess, see also here

Initializing the residual wrongly can lead to different results and we found that there are cases where this bug breaks cgd, while the fixed version performs well.

The text was updated successfully, but these errors were encountered:

martyn · 2020-07-30T09:02:35Z

Sorry for missing this. Conjugate gradient is awesome, thank you for your work! We've rewritten hypergan in pytorch and are working to re-add conjugate gradient.

martyn added the enhancement label Jul 30, 2020

martyn assigned mikkel Jul 30, 2020

martyn added bug scheduled for 1.0 and removed scheduled for after 1.0 labels Jul 30, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Initialization of conjugate gradient in competitive_optimizer #255

Initialization of conjugate gradient in competitive_optimizer #255

f-t-s commented May 12, 2020 •

edited

martyn commented Jul 30, 2020

Initialization of conjugate gradient in competitive_optimizer #255

Initialization of conjugate gradient in competitive_optimizer #255

Comments

f-t-s commented May 12, 2020 • edited

martyn commented Jul 30, 2020

f-t-s commented May 12, 2020 •

edited