Skip to content

Commit

Permalink
[cambricon] fix previous model obtain the PID (#748)
Browse files Browse the repository at this point in the history
  • Loading branch information
cifar10 authored Sep 14, 2024
1 parent df9b99e commit 1b9e9e8
Showing 1 changed file with 1 addition and 1 deletion.
2 changes: 1 addition & 1 deletion training/utils/start_task_helper.py
Original file line number Diff line number Diff line change
Expand Up @@ -58,7 +58,7 @@ def get_mlu_pid():
import subprocess
result = subprocess.Popen(['ps', 'aux'], stdout=subprocess.PIPE, text=True)
for line in result.stdout:
if 'MLU_VISIBLE_DEVICES' in line and 'grep' not in line:
if 'torchrun' in line and 'flagscale' in line and 'grep' not in line:
return line.split()[1]
return None

Expand Down

0 comments on commit 1b9e9e8

Please sign in to comment.