You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
preliminary implementation done and online for LS6.
limitations: 1) raw data for individual GPUs are merged in the database when imported, so only the total percentage is availlable.
2) a few nodes in gpu-a100-small and gpu-dev seems don't have gpu recording enabled by the monitor, no gpu data is recorded, e.g. : https://ls6-stats.tacc.utexas.edu/machine/job/1473810/
Possible workaround without changing database stucture: make "event" to be "utilization_$gpunumber" instead of "utilization" when importing, then extract "$gpunumber" in views.py.
We need a counter on the new version that says how many GPUs were used for a job.
The text was updated successfully, but these errors were encountered: