Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

ci: Upload phase 1 & phase 2 training logs for loss graphs #356

Open
wants to merge 1 commit into
base: main
Choose a base branch
from

Conversation

alimaredia
Copy link

No description provided.

@mergify mergify bot added CI/CD Affects CI/CD configuration ci-failure labels Nov 25, 2024
@alimaredia alimaredia force-pushed the fix-loss-graphs-ci branch 2 times, most recently from 8214570 to 8352fba Compare November 25, 2024 18:28
cat "${output_file}" >> "${GITHUB_STEP_SUMMARY}"
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

It seems like this E2E file has an error - this line should be present after the output files are generated by running the python scripts/create-loss-graph.py

Suggested change
cat "${output_file}" >> "${GITHUB_STEP_SUMMARY}"

--pr-number "${{ github.event.pull_request.number }}" \
--head-sha "${{ github.event.pull_request.head.sha }}" \
--origin-repository "${{ github.repository }}"

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

You'll want to actually move the cat "${output_file}" >> "{GITHUB_STEP_SUMMARY}" step to take place in the upload to phase steps.

Otherwise the current logic is paradoxical - we never output the summary on a successful upload, but we try to upload it on error.

Suggested change
cat "${output_file}" >> "${GITHUB_STEP_SUMMARY}"

echo "::warning::Failed to upload loss graph to S3. This won't block the workflow, but you may want to investigate."
echo "Loss graph upload failed" >> "${GITHUB_STEP_SUMMARY}"

cat "${output_file}" >> "${GITHUB_STEP_SUMMARY}"
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Move this one to the phase 1 upload step

Suggested change
cat "${output_file}" >> "${GITHUB_STEP_SUMMARY}"

@@ -295,10 +314,34 @@ jobs:
--head-sha "${{ github.event.pull_request.head.sha }}" \
--origin-repository "${{ github.repository }}"

Copy link
Member

@RobotSail RobotSail Nov 26, 2024

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

You'll want to also echo the output file here:

Suggested change
cat "${output_file}" >> "${GITHUB_STEP_SUMMARY}"

Copy link
Member

@RobotSail RobotSail left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thank you for making the quick PR Ali! I've provided some inline suggestions to fix a few issues in the PR. Once those are made we can go ahead and merge this. Appreciate you taking this on!!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
CI/CD Affects CI/CD configuration
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants