Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

📖 [Story] Optimize the launch overhead of TRT engine and pytorch kernels #3274

Open
2 tasks
keehyuna opened this issue Nov 4, 2024 · 0 comments
Open
2 tasks
Assignees
Labels
Story Issues proposing a new Story

Comments

@keehyuna
Copy link
Collaborator

keehyuna commented Nov 4, 2024

TL;DR

Runtime optimization in torch-tensorrt is crucial for maximizing model performance in real-world applications.
This story tracks the effort to improve runtime performance.

Goal(s)

  • Understand the overhead in cpp/python runtime module and improve the inference performance
  • Ensure no or minimized impact on accuracy and resource with optimization

Tasks

Tasks

  1. cla signed component: api [Python] component: core component: dynamo component: runtime component: tests documentation
    keehyuna
  2. feature request
    keehyuna

Additional context

Tasks

No tasks being tracked yet.
@keehyuna keehyuna added the Story Issues proposing a new Story label Nov 4, 2024
@keehyuna keehyuna self-assigned this Nov 4, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Story Issues proposing a new Story
Projects
None yet
Development

No branches or pull requests

1 participant