📖 [Story] Optimize the launch overhead of TRT engine and pytorch kernels #3274

keehyuna · 2024-11-04T13:03:08Z

TL;DR

Runtime optimization in torch-tensorrt is crucial for maximizing model performance in real-world applications.
This story tracks the effort to improve runtime performance.

Goal(s)

Understand the overhead in cpp/python runtime module and improve the inference performance
Ensure no or minimized impact on accuracy and resource with optimization

Tasks

Give feedback

feat: Runtime output buffer optimization #3276

cla signed component: api [Python] component: core component: dynamo component: runtime component: tests documentation
✨[Feature] Performance optimization of PyTorch + TRT subgraphs #3277

feature request
Options

Additional context

Tasks

Give feedback

No tasks being tracked yet.

Options

keehyuna added the Story Issues proposing a new Story label Nov 4, 2024

keehyuna self-assigned this Nov 4, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

📖 [Story] Optimize the launch overhead of TRT engine and pytorch kernels #3274

📖 [Story] Optimize the launch overhead of TRT engine and pytorch kernels #3274

keehyuna commented Nov 4, 2024 •

edited

Loading

Tasks

Tasks

📖 [Story] Optimize the launch overhead of TRT engine and pytorch kernels #3274

📖 [Story] Optimize the launch overhead of TRT engine and pytorch kernels #3274

Comments

keehyuna commented Nov 4, 2024 • edited Loading

TL;DR

Goal(s)

Tasks

Tasks

Additional context

Tasks

keehyuna commented Nov 4, 2024 •

edited

Loading