Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

avoid calling libkineto::api().client()->stop twice #1029

Closed
wants to merge 3 commits into from

Conversation

staugust
Copy link
Contributor

In normal case, torch main thread calls performRunLoopStep when profilerStep is called to collect traces. If maxGpuBufferCount_ is set too small, profiling may be stopped early, profilerThread_ will also call performRunLoopStep to collect trace. If the cpu trace buffer is too big, collectTrace may be called twice, and libkineto::api().client()->stop will be called twice, which will throw an uncaught ::c10::Error. Finally, torch process quits with a core file.

@facebook-github-bot
Copy link
Contributor

@sraikund16 has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

@facebook-github-bot
Copy link
Contributor

@staugust has updated the pull request. You must reimport the pull request before landing.

@facebook-github-bot
Copy link
Contributor

@staugust has updated the pull request. You must reimport the pull request before landing.

@facebook-github-bot
Copy link
Contributor

@sraikund16 has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

@facebook-github-bot
Copy link
Contributor

@sraikund16 merged this pull request in d92080f.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants