Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Zobelisk GitHub Actions container frequently goes down #1212

Open
5 tasks
cbrxyz opened this issue Jun 20, 2024 · 1 comment
Open
5 tasks

Zobelisk GitHub Actions container frequently goes down #1212

cbrxyz opened this issue Jun 20, 2024 · 1 comment
Assignees
Labels
ci infra Related to MIL server infrastructure software

Comments

@cbrxyz
Copy link
Member

cbrxyz commented Jun 20, 2024

What needs to change?

The GitHub Actions Docker container running our CI builds repeatedly goes down on Zobelisk, requiring someone to provide a new authentication key every time it needs to be restarted. It would be great if the container could be more reliable, or at least if it was easier to re-launch the existing container.

How would this task be tested?

  1. This one is a little hard to test, since it usually takes a while for the GitHub Actions container to go down. Maybe you could make the container forcefully shut down after a shorter amount of time, and then ensure that the container comes alive again.

Contacts

  • We need help from the mechanical team.
  • We need help from the electrical team.
  • We need help from Dr. Schwartz or other faculty.
  • We need help from a company or an organization.
  • We need help from another UF staff member or organization (ex, facilities).
@cbrxyz cbrxyz added ci software infra Related to MIL server infrastructure labels Jun 20, 2024
@lynettehemingway lynettehemingway self-assigned this Nov 17, 2024
@lynettehemingway
Copy link

Update: I am learning about strategies to improve the reliability of the GitHub Actions Docker container, including exploring automated key management solutions, understanding how to implement health checks and auto-restart mechanisms, and researching best practices for testing and stress-testing containers for best performance.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
ci infra Related to MIL server infrastructure software
Projects
None yet
Development

No branches or pull requests

2 participants