Unexpected `shape` issue in Hessian-Vector computation #8

stalhabukhari · 2021-02-01T06:15:43Z

Hi!

Thank you making the source code of your work available. I tried to use the library for an application involving a 3D network architecture, and ran into the following issue:

********** Commencing Hessian Computation **********
Traceback (most recent call last):
  File "hessian_analysis.py", line 181, in <module>
    hessianObj.analyze(model_checkpoint_filepath)
  File "/media/ee/DATA/Repositories/PyHessian/hessian_analysis.py", line 70, in analyze
    top_eigenvalues, top_eigenvectors  = hessian_comp.eigenvalues(top_n=self.top_n)
  File "/media/ee/DATA/Repositories/PyHessian/pyhessian/hessian.py", line 167, in eigenvalues
    Hv = hessian_vector_product(self.gradsH, self.params, v)
  File "/media/ee/DATA/Repositories/PyHessian/pyhessian/utils.py", line 88, in hessian_vector_product
    retain_graph=True)
  File "/home/ee/anaconda3/envs/torch13/lib/python3.6/site-packages/torch/autograd/__init__.py", line 197, in grad
    grad_outputs_ = _make_grads(outputs, grad_outputs_)
  File "/home/ee/anaconda3/envs/torch13/lib/python3.6/site-packages/torch/autograd/__init__.py", line 32, in _make_grads
    if not out.shape == grad.shape:
AttributeError: 'float' object has no attribute 'shape'

Interestingly, the issue does not occur at the first call to back-propagation via loss.backward(), rather occurs at the call to torch.autograd.grad().

I believe that the float object in question is the 0. manually inserted when param.grad is None in the following routine:

PyHessian/pyhessian/utils.py

Lines 61 to 72 in c2e49d2

 def get_params_grad(model): 

 """ 

  get model parameters and corresponding gradients 

  """ 

 params = [] 

 grads = [] 

 for param in model.parameters(): 

 if not param.requires_grad: 

 continue 

 params.append(param) 

 grads.append(0. if param.grad is None else param.grad + 0.) 

 return params, grads

~~If I am right, it is even more mind-boggling that a type float is able to pass the check for data-type in PyTorch~~ (I mistakenly mixed outputs and inputs arguments of torch.autograd.grad). Kindly guide about what I can do here.

P.S. hessian_analysis.py is a wrapper I wrote around the library, for my use-case. I verified the wrapper by running a 2-layer neural network for a regression task.

The text was updated successfully, but these errors were encountered:

liujingcs · 2021-11-11T12:10:25Z

Hi, I have faced the same issue. Have you solved this issue?

stalhabukhari · 2021-11-13T14:34:29Z

@liujingcs Ah! It has been a long while. I think I upgraded the PyTorch version (probably 1.8).

If nothing works, you may want to check out this work by the same group: https://github.com/noahgolmant/pytorch-hessian-eigenthings

hubery1619 · 2023-06-06T03:18:48Z

Hi, I have faced the same issue. Have you solved this issue?

Hi, I wonder if you have solved this issue. Thanks so much.

yxiao54 · 2023-06-07T02:20:06Z

Hi guys, I meet the same issue and just figure it out. In my case, it's because there are some layers that have been defined in the model but didn't participate in forward or backward propagation. The issue was fixed after I delete the unused layers.
Another way to resolve it is modifying the get_params_grad function in the utils of pyhessian library. When the grade is None, the grads should be a Tensor of zeros instead of float zeros.

yxiao54 mentioned this issue Jun 7, 2023

Potential bugs #19

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Unexpected `shape` issue in Hessian-Vector computation #8

Unexpected `shape` issue in Hessian-Vector computation #8

stalhabukhari commented Feb 1, 2021 •

edited

liujingcs commented Nov 11, 2021

stalhabukhari commented Nov 13, 2021

hubery1619 commented Jun 6, 2023

yxiao54 commented Jun 7, 2023

Unexpected shape issue in Hessian-Vector computation #8

Unexpected shape issue in Hessian-Vector computation #8

Comments

stalhabukhari commented Feb 1, 2021 • edited

liujingcs commented Nov 11, 2021

stalhabukhari commented Nov 13, 2021

hubery1619 commented Jun 6, 2023

yxiao54 commented Jun 7, 2023

Unexpected `shape` issue in Hessian-Vector computation #8

Unexpected `shape` issue in Hessian-Vector computation #8

stalhabukhari commented Feb 1, 2021 •

edited