Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Chaos test the cluster #21

Open
stan-dot opened this issue Jun 3, 2024 · 13 comments
Open

Chaos test the cluster #21

stan-dot opened this issue Jun 3, 2024 · 13 comments
Labels
enhancement New feature or request

Comments

@stan-dot
Copy link
Collaborator

stan-dot commented Jun 3, 2024

explore cluster resilience testing

options considered:

@stan-dot stan-dot added the enhancement New feature or request label Jun 3, 2024
@stan-dot stan-dot self-assigned this Jun 3, 2024
@DiamondJoseph
Copy link

Whatever decisions and learning come out of this are going to be applicable to the other beamline repositories

@stan-dot
Copy link
Collaborator Author

note: this tool is more suited to aws and isn't that well-developed
https://chaostoolkit.org/reference/tutorials/containerising/

@stan-dot
Copy link
Collaborator Author

@stan-dot stan-dot removed their assignment Jun 20, 2024
@stan-dot stan-dot self-assigned this Jul 25, 2024
@stan-dot
Copy link
Collaborator Author

raised a chaos-mesh ticket with the cloud team about the namespace creation

@stan-dot
Copy link
Collaborator Author

moved to looking into a more restricted tool - https://github.com/asobti/kube-monkey

@stan-dot
Copy link
Collaborator Author

@stan-dot
Copy link
Collaborator Author

working on it

@stan-dot
Copy link
Collaborator Author

stan-dot commented Oct 1, 2024

tracking the cloud aspect here https://jira.diamond.ac.uk/servicedesk/customer/portal/2/SCHD-6072

@stan-dot stan-dot transferred this issue from DiamondLightSource/ViSR Oct 24, 2024
@stan-dot
Copy link
Collaborator Author

stan-dot commented Nov 5, 2024

might use litmus https://github.com/litmuschaos/litmus

@stan-dot
Copy link
Collaborator Author

stan-dot commented Nov 5, 2024

@stan-dot
Copy link
Collaborator Author

Workflow Controller: The Argo Workflow Controller responsible for the creation of Chaos Experiments using the Chaos Experiment CR.

and argo isn't quite ready yet? might need to delay this
https://docs.litmuschaos.io/docs/architecture/chaos-execution-plane

@stan-dot
Copy link
Collaborator Author

stan-dot commented Nov 13, 2024

  • might need to fork chaos hub to deploy own experiments
  • might need to add DNS and the ingress

https://docs.litmuschaos.io/docs/concepts/chaoshub

@stan-dot
Copy link
Collaborator Author

this is not on the critical path, deprioritizing

@stan-dot stan-dot removed their assignment Nov 26, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

2 participants