You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
When using on-demand profiling via dynolog and kineto, we noticed that, when profiling request configured with iterations, the last profiling iteration took more time than other profiling iterations. The train process is blocked at optimizer.step(), which calls step in kineto, finally, in performRunLoop, libkineto::api().client()->stop() took the most time.
At the same time, the processTraceInternal is executed asynchronously in performRunLoop, which will not block torch train process.
I'm wondering whether there's a plan to fix this performance issue to make minimal overhead on pytorch training process when on-demand profiling is enabled. it would be very nice if there's already a plan or a proposal. If not, I'd like to make a proposal later.
The text was updated successfully, but these errors were encountered:
Summary:
This fix issue #953. Makes `libkineto::api().client()->stop()` and `stopTraceInternal` run in `profilerThread_` so that, the training process will not be blocked.
Pull Request resolved: #966
Reviewed By: sanrise
Differential Revision: D64214259
Pulled By: sraikund16
fbshipit-source-id: a27398e266df8502579d49b2c7d65c1863788008
When using on-demand profiling via
dynolog
andkineto
, we noticed that, when profiling request configured with iterations, the last profiling iteration took more time than other profiling iterations. The train process is blocked atoptimizer.step()
, which callsstep
inkineto
, finally, inperformRunLoop
,libkineto::api().client()->stop()
took the most time.At the same time, the
processTraceInternal
is executed asynchronously inperformRunLoop
, which will not block torch train process.I'm wondering whether there's a plan to fix this performance issue to make minimal overhead on pytorch training process when on-demand profiling is enabled. it would be very nice if there's already a plan or a proposal. If not, I'd like to make a proposal later.
The text was updated successfully, but these errors were encountered: