Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

TensorRT10 with JetPack 6.0 Docs update #11779

Merged
merged 8 commits into from May 17, 2024
Merged

TensorRT10 with JetPack 6.0 Docs update #11779

merged 8 commits into from May 17, 2024

Conversation

Burhan-Q
Copy link
Member

@Burhan-Q Burhan-Q commented May 8, 2024

  • Adds CLI example for exporting to TensorRT with INT8 quantization (thanks @lakshanthad for raising this)
  • Expands python example for exporting TensorRT with INT8 quantization
  • Moves classification metrics to appropriate row
  • Adds TensorRT 10 and JetPack 6 benchmarks for Jetson Orin NX (@lakshanthad)

πŸ› οΈ PR Summary

Made with ❀️ by Ultralytics Actions

🌟 Summary

Enhancements to TensorRT Documentation and Performance Updates

πŸ“Š Key Changes

  • Documentation Update: Revised and expanded the TensorRT integration guide to include more detailed examples for Python and CLI.
  • Performance Benchmarks: Updated performance benchmarks for various model precisions (FP32, FP16, INT8) on the NVIDIA Jetson Orin NX 16GB, showcasing improvements in inference times.
  • Enhanced Export Options: Now explicitly includes steps to export models with dynamic axes and INT8 quantization, plus tips on maximizing batch sizes and memory allocation for better performance.
  • Intuitive Examples: Added clear examples for exporting a YOLO model to TensorRT format and running inference with the exported model.

🎯 Purpose & Impact

  • Smoother TensorRT Integration: With detailed examples and clear documentation, users can more easily integrate Ultralytics models with NVIDIA's TensorRT for enhanced performance.
  • Improved Inference Speed: Updated benchmarks demonstrate the efficiency gains possible with the latest software versions, useful for those deploying models on NVIDIA hardware.
  • Flexibility in Model Deployment: The additional details on model export options give developers better tools for optimizing their models for specific hardware, leading to faster, more efficient AI applications.

These changes aim to streamline the process for developers looking to leverage TensorRT's powerful optimization capabilities with Ultralytics models, ultimately leading to faster AI-driven insights and applications. πŸš€

@Burhan-Q Burhan-Q added the documentation Improvements or additions to documentation label May 8, 2024
Copy link

codecov bot commented May 8, 2024

Codecov Report

All modified and coverable lines are covered by tests βœ…

Project coverage is 70.59%. Comparing base (303579c) to head (dd3d6ab).

❗ Current head dd3d6ab differs from pull request most recent head 3ad4504

Please upload reports for the commit 3ad4504 to get more accurate results.

Additional details and impacted files
@@             Coverage Diff             @@
##             main   #11779       +/-   ##
===========================================
+ Coverage   37.29%   70.59%   +33.30%     
===========================================
  Files         122      122               
  Lines       15636    15636               
===========================================
+ Hits         5831    11039     +5208     
+ Misses       9805     4597     -5208     
Flag Coverage Ξ”
Benchmarks 35.50% <ΓΈ> (?)
GPU 37.27% <ΓΈ> (-0.02%) ⬇️
Tests 66.74% <ΓΈ> (?)

Flags with carried forward coverage won't be shown. Click here to find out more.

β˜” View full report in Codecov by Sentry.
πŸ“’ Have feedback on the report? Share it here.

@lakshanthad
Copy link
Collaborator

@Burhan-Q JP6.0 with TRT10 benchmarks for Jetson Orin NX are updated!

@Burhan-Q
Copy link
Member Author

@lakshanthad I was actually thinking it could be good to keep both results (add a tab for the new version). What do you think tho? Would it be worth showing results for an older + newer version? The difference isn't that big, so probably okay to just use the TensorRT10 numbers.

@lakshanthad
Copy link
Collaborator

@Burhan-Q I initially had the same idea as yours but the difference isn't that big yes. That is why I just replaced the numbers.

@Burhan-Q Burhan-Q marked this pull request as ready for review May 17, 2024 14:08
@glenn-jocher glenn-jocher changed the title TensorRT docs page additions and fix up TensorRT10 with JetPack 6.0 Docs update May 17, 2024
@glenn-jocher glenn-jocher merged commit 10b3564 into main May 17, 2024
13 checks passed
@glenn-jocher glenn-jocher deleted the docs_trtint8 branch May 17, 2024 17:02
@glenn-jocher
Copy link
Member

@Burhan-Q @lakshanthad awesome guys, great updates. PR merged!!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
documentation Improvements or additions to documentation
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

3 participants