Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add more detailed timing. #281

Merged
merged 3 commits into from
May 20, 2024
Merged

Add more detailed timing. #281

merged 3 commits into from
May 20, 2024

Conversation

GeorgiosSmyrnis
Copy link
Collaborator

This PR adds more detailed timing for our training run, so that performance can be monitored more closely.

@GeorgiosSmyrnis
Copy link
Collaborator Author

This now fixes an issue where the time to sync loss across ranks was not included in the batch time.

@GeorgiosSmyrnis GeorgiosSmyrnis merged commit 2a9e43a into main May 20, 2024
2 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants