Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Is convolution node 486 in Faster R-CNN working fine? #620

Open
Apisteftos opened this issue Jul 14, 2023 · 0 comments
Open

Is convolution node 486 in Faster R-CNN working fine? #620

Apisteftos opened this issue Jul 14, 2023 · 0 comments
Labels

Comments

@Apisteftos
Copy link

Bug Report

Which model does this pertain to?

Model faster R-CNN Opset 12

Describe the bug

I am doing profiling with faster RCNN and calculating the Throughput in TOPs is 1321 TOPs which is really high over the limits of the NVIDIA A100 GPU. Can somebody explain me if the model works properly?

Reproduction instructions

System Information

OS Platform and Distribution (Linux Ubuntu 22.04):
ONNX version (1.14):
Backend/Runtime version (Onnexruntime 1.15):

Here my profiling data:

FP32
dur: 70
486_kernel_time
output_type_shape: ( 1, 256, 200, 392)
input_type_shape: (1, 256, 200, 392)
kernel_shape : (256, 256, 3, 3)
bias: 256
provider: CUDAExecutionProvider
op_name: Conv
Throughput: 1321 TOPs

Notes

A100 specs is: Peak FP32 TFLOPS (non-Tensor) = 19.5

@Apisteftos Apisteftos added the bug label Jul 14, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

1 participant