Intel GPU: specify the tolerance for torchbench models (#125213)

We encountered some model accuracy failures as the tolerance is critical. In general, we align with CUDA practice. This PR intends to adjust the tolerance for Torchbench models for training mode on Intel GPU devices and aligns with CUDA. Pull Request resolved: #125213 Approved by: https://github.com/desertfire
pytorch · May 3, 2024 · 49dfbc5 · 49dfbc5
1 parent 6adb14f
commit 49dfbc5
Showing 1 changed file with 1 addition and 1 deletion.
diff --git a/benchmarks/dynamo/torchbench.py b/benchmarks/dynamo/torchbench.py
@@ -402,7 +402,7 @@ def get_tolerance_and_cosine_flag(self, is_training, current_device, name):
  if name in self._tolerance["higher_bf16"]:
  return 1e-2, cosine
 
- if is_training and current_device == "cuda":
+ if is_training and (current_device == "cuda" or current_device == "xpu"):
  tolerance = 1e-3
  if name in self._tolerance["cosine"]:
  cosine = True