Using torch.bfloat16 to prevent overflow instead of default fp16 in AMP #345

rajeevgl01 · 2024-01-12T20:13:21Z

Using torch.bfloat16 to prevent overflow. Float16 has three less integer bits compared to bfloat16 which causes NaN loss and NaN grad norms during AMP training. This seems to be a common issue while training the Swin Transformer.

BFloat16 has same integer bits compared to FP32 but less precision bits. If we want higher precision but also want to save GPU memory, then TensorFloat32 or tfloat32 can be used instead.

TF32 has less precision bits when compared to FP32, but 3 more integer bits compared to FP16. But TF32 can only be used on latest NVIDIA ampere gpus or newer.

Using torch.bfloat16 to prevent overflow instead of default fp16 in AMP

e444741

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Using torch.bfloat16 to prevent overflow instead of default fp16 in AMP #345

Using torch.bfloat16 to prevent overflow instead of default fp16 in AMP #345

rajeevgl01 commented Jan 12, 2024

Using torch.bfloat16 to prevent overflow instead of default fp16 in AMP #345

Are you sure you want to change the base?

Using torch.bfloat16 to prevent overflow instead of default fp16 in AMP #345

Conversation

rajeevgl01 commented Jan 12, 2024