Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Per user storage quotas or alternative improvements #3922

Closed
3 tasks done
consideRatio opened this issue Apr 9, 2024 · 2 comments
Closed
3 tasks done

Per user storage quotas or alternative improvements #3922

consideRatio opened this issue Apr 9, 2024 · 2 comments

Comments

@consideRatio
Copy link
Member

consideRatio commented Apr 9, 2024

Incident prompting additional improvement

Julius at LEAP recently ended up with 0% available storage in filestore, and reported:

In the longer run, I am really struggling with this. I have a few users that are just not keeping to their quota. It is a pretty bad situation that just one single user who doesn't know what they are doing (or just doesnt listen 👀) can bring the whole hub to a grinding halt.

I observe that we received the alert when dropping below 10% March 29th, but we didn't act until it was too late at April 9th when it dropped from 3.5% to 0%.

image

Available features of relevance

  1. Terraform managed alerts when going below 10%:
    • pagerduty notifications
    • slack notifications (via pagerduty notifications)
  2. Prometheus metrics collection and Grafana dashboards providing information about users home directories
    • This is available, and it can help community representatives know who consumes too much disk space

Not available features of relevance

This isn't an exhaustive list, just a quick writeup.

  1. Terraform managed alerts to community representatives
    We lack this currently and Julius at LEAP for example didn't get one. While they may have been setup manually in the past, they may have been overridden by terraform later?
    Tracked by Terraform configured alerting to community representatives about NFS storage #3923.
  2. NFS per user quotas
    To my knowledge, this isn't possible for us via the managed NFS services we use with AWS/GCP/Azure and we don't have a clear idea on how to go about resolving this.
    Currently represented by this issue.

Related

@consideRatio
Copy link
Member Author

@damianavila @haroldcampbell I've put this in the engineering backlog even though its not ready to be worked with technical concrete steps etc, not sure how else to make it not not fall between cracks.

@consideRatio
Copy link
Member Author

I figure this is tracked by #5010, closing.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
No open projects
Status: Needs Shaping / Refinement
Development

No branches or pull requests

1 participant