You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
File "/root/miniconda3/envs/py3.10/lib/python3.10/site-packages/bitsandbytes/autograd/_functions.py", line 516, in forward
output = torch.nn.functional.linear(A, F.dequantize_4bit(B, quant_state).to(A.dtype).t(), bias)
RuntimeErroroutput = torch.nn.functional.linear(A, F.dequantize_4bit(B, quant_state).to(A.dtype).t(), bias):
mat1 and mat2 shapes cannot be multiplied (512x4096 and 1x4194304)
RuntimeError: mat1 and mat2 shapes cannot be multiplied (512x4096 and 1x4194304)
with FSDP:
File "/root/miniconda3/envs/py3.10/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1527, in _call_impl
return forward_call(*args, **kwargs)
File "/root/miniconda3/envs/py3.10/lib/python3.10/site-packages/accelerate/hooks.py", line 166, in new_forward
output = module._old_forward(*args, **kwargs)
File "/root/miniconda3/envs/py3.10/lib/python3.10/site-packages/transformers/models/llama/modeling_llama.py", line 436, in forward
output = module._old_forward(*args, **kwargs)
File "/root/miniconda3/envs/py3.10/lib/python3.10/site-packages/transformers/models/llama/modeling_llama.py", line 436, in forward
return forward_call(*args, **kwargs)
File "/root/miniconda3/envs/py3.10/lib/python3.10/site-packages/peft/tuners/lora/bnb.py", line 476, in forward
key_states = self.k_proj(hidden_states)
File "/root/miniconda3/envs/py3.10/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1518, in _wrapped_call_impl
key_states = self.k_proj(hidden_states)
File "/root/miniconda3/envs/py3.10/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1518, in _wrapped_call_impl
output = self._apply_dora(x, lora_A, lora_B, scaling, active_adapter)
File "/root/miniconda3/envs/py3.10/lib/python3.10/site-packages/peft/tuners/lora/layer.py", line 226, in _apply_dora
return forward_call(*args, **kwargs)
File "/root/miniconda3/envs/py3.10/lib/python3.10/site-packages/peft/tuners/lora/bnb.py", line 476, in forward
lora_weight = lora_B.weight @ lora_A.weight
RuntimeError: inconsistent tensor size, expected tensor [820] and src [3277] to have the same number of elements, but got 820 and 3277 elements respectively
The text was updated successfully, but these errors were encountered:
System Info
Who can help?
No response
Information
Tasks
examples
folderReproduction
Expected behavior
As reported by @winglian
with deepspeed zero3:
with FSDP:
The text was updated successfully, but these errors were encountered: