TensorRT10 with JetPack 6.0 Docs update #11779

Burhan-Q · 2024-05-08T19:21:19Z

Adds CLI example for exporting to TensorRT with INT8 quantization (thanks @lakshanthad for raising this)
Expands python example for exporting TensorRT with INT8 quantization
Moves classification metrics to appropriate row
Adds TensorRT 10 and JetPack 6 benchmarks for Jetson Orin NX (@lakshanthad)

🛠️ PR Summary

_{Made with ❤️ by Ultralytics Actions}

🌟 Summary

Enhancements to TensorRT Documentation and Performance Updates

📊 Key Changes

Documentation Update: Revised and expanded the TensorRT integration guide to include more detailed examples for Python and CLI.
Performance Benchmarks: Updated performance benchmarks for various model precisions (FP32, FP16, INT8) on the NVIDIA Jetson Orin NX 16GB, showcasing improvements in inference times.
Enhanced Export Options: Now explicitly includes steps to export models with dynamic axes and INT8 quantization, plus tips on maximizing batch sizes and memory allocation for better performance.
Intuitive Examples: Added clear examples for exporting a YOLO model to TensorRT format and running inference with the exported model.

🎯 Purpose & Impact

Smoother TensorRT Integration: With detailed examples and clear documentation, users can more easily integrate Ultralytics models with NVIDIA's TensorRT for enhanced performance.
Improved Inference Speed: Updated benchmarks demonstrate the efficiency gains possible with the latest software versions, useful for those deploying models on NVIDIA hardware.
Flexibility in Model Deployment: The additional details on model export options give developers better tools for optimizing their models for specific hardware, leading to faster, more efficient AI applications.

These changes aim to streamline the process for developers looking to leverage TensorRT's powerful optimization capabilities with Ultralytics models, ultimately leading to faster AI-driven insights and applications. 🚀

codecov · 2024-05-08T19:23:13Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 70.59%. Comparing base (303579c) to head (dd3d6ab).

❗ Current head dd3d6ab differs from pull request most recent head 3ad4504

Please upload reports for the commit 3ad4504 to get more accurate results.

Additional details and impacted files

@@             Coverage Diff             @@
##             main   #11779       +/-   ##
===========================================
+ Coverage   37.29%   70.59%   +33.30%     
===========================================
  Files         122      122               
  Lines       15636    15636               
===========================================
+ Hits         5831    11039     +5208     
+ Misses       9805     4597     -5208

Flag	Coverage Δ
Benchmarks	`35.50% <ø> (?)`
GPU	`37.27% <ø> (-0.02%)`	⬇️
Tests	`66.74% <ø> (?)`

Flags with carried forward coverage won't be shown. Click here to find out more.

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

lakshanthad · 2024-05-16T07:14:29Z

@Burhan-Q JP6.0 with TRT10 benchmarks for Jetson Orin NX are updated!

Burhan-Q · 2024-05-16T22:13:18Z

@lakshanthad I was actually thinking it could be good to keep both results (add a tab for the new version). What do you think tho? Would it be worth showing results for an older + newer version? The difference isn't that big, so probably okay to just use the TensorRT10 numbers.

lakshanthad · 2024-05-16T23:41:54Z

@Burhan-Q I initially had the same idea as yours but the difference isn't that big yes. That is why I just replaced the numbers.

glenn-jocher · 2024-05-17T17:02:45Z

@Burhan-Q @lakshanthad awesome guys, great updates. PR merged!!

Burhan-Q added 2 commits May 8, 2024 14:59

add CLI int8 example, move metrics into appropriate rows

b4758ca

expand python int8 example and clean up cli example

a18f251

Burhan-Q added the documentation Improvements or additions to documentation label May 8, 2024

Burhan-Q and others added 3 commits May 11, 2024 10:59

Merge branch 'main' into docs_trtint8

69fd650

Merge branch 'main' into docs_trtint8

8b21d4f

Update Jetson table with JP6.0, TRT10

9cd5219

Merge branch 'main' into docs_trtint8

c764680

Merge branch 'main' into docs_trtint8

dd3d6ab

Burhan-Q marked this pull request as ready for review May 17, 2024 14:08

Burhan-Q requested a review from glenn-jocher May 17, 2024 14:08

Merge branch 'main' into docs_trtint8

3ad4504

glenn-jocher changed the title ~~TensorRT docs page additions and fix up~~ TensorRT10 with JetPack 6.0 Docs update May 17, 2024

glenn-jocher merged commit 10b3564 into main May 17, 2024
13 checks passed

glenn-jocher deleted the docs_trtint8 branch May 17, 2024 17:02

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

TensorRT10 with JetPack 6.0 Docs update #11779

TensorRT10 with JetPack 6.0 Docs update #11779

Burhan-Q commented May 8, 2024 •

edited by github-actions bot

codecov bot commented May 8, 2024 •

edited

lakshanthad commented May 16, 2024

Burhan-Q commented May 16, 2024

lakshanthad commented May 16, 2024

glenn-jocher commented May 17, 2024

TensorRT10 with JetPack 6.0 Docs update #11779

TensorRT10 with JetPack 6.0 Docs update #11779

Conversation

Burhan-Q commented May 8, 2024 • edited by github-actions bot

🛠️ PR Summary

🌟 Summary

📊 Key Changes

🎯 Purpose & Impact

codecov bot commented May 8, 2024 • edited

Codecov Report

lakshanthad commented May 16, 2024

Burhan-Q commented May 16, 2024

lakshanthad commented May 16, 2024

glenn-jocher commented May 17, 2024

Burhan-Q commented May 8, 2024 •

edited by github-actions bot

codecov bot commented May 8, 2024 •

edited