Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[2i2c-uk:lis] Enable profileList #3308

Closed

Conversation

GeorgianaElena
Copy link
Member

@GeorgianaElena GeorgianaElena commented Oct 24, 2023

Update: waiting for feedback in https://2i2c.freshdesk.com/a/tickets/1066 before merging.

Follow-up to #3302, also for https://2i2c.freshdesk.com/a/tickets/1066.

Matthew from LIS requested a guarantee of 1GB as they are still experiencing crashes, even after the increase to 512GB of guarantee from #3302.

This PR enables profileLists for the lis hub, adding the new guarantee and reducing the limit of the initial one of 512G one from 2GB to 1GB.

I think it would be useful for them to enable https://github.com/jupyter-server/jupyter-resource-usage so they can better understand they mem needs, especially since from the grafana below, it looks like only a few users actually required around or more than 1GB.

This is also why the new server size option, instead of increasing it again.

Screenshot 2023-10-24 at 10 02 40

@github-actions
Copy link

github-actions bot commented Oct 24, 2023

Merging this PR will trigger the following deployment actions.

Support and Staging deployments

Cloud Provider Cluster Name Upgrade Support? Reason for Support Redeploy Upgrade Staging? Reason for Staging Redeploy
gcp 2i2c-uk No Yes Core infrastructure has been modified
aws nasa-veda No Yes Core infrastructure has been modified
aws victor No Yes Core infrastructure has been modified
gcp leap No Yes Core infrastructure has been modified
kubeconfig utoronto No Yes Core infrastructure has been modified
aws gridsst No Yes Core infrastructure has been modified
gcp catalystproject-latam No Yes Core infrastructure has been modified
aws 2i2c-aws-us No Yes Core infrastructure has been modified
aws jupyter-meets-the-earth No Yes Core infrastructure has been modified
gcp qcl No Yes Core infrastructure has been modified
aws nasa-cryo No Yes Core infrastructure has been modified
gcp linked-earth No Yes Core infrastructure has been modified
gcp cloudbank No Yes Core infrastructure has been modified
gcp meom-ige No Yes Core infrastructure has been modified
aws catalystproject-africa No Yes Core infrastructure has been modified
gcp awi-ciroh No Yes Core infrastructure has been modified
aws carbonplan No Yes Core infrastructure has been modified
aws smithsonian No Yes Core infrastructure has been modified
aws openscapes No Yes Core infrastructure has been modified
aws nasa-ghg No Yes Core infrastructure has been modified
gcp m2lines No Yes Core infrastructure has been modified
gcp callysto No Yes Core infrastructure has been modified
gcp pangeo-hubs No Yes Core infrastructure has been modified
gcp 2i2c No Yes Core infrastructure has been modified
gcp hhmi No Yes Core infrastructure has been modified
aws ubc-eoas No Yes Core infrastructure has been modified

Production deployments

Cloud Provider Cluster Name Hub Name Reason for Redeploy
gcp 2i2c-uk lis Core infrastructure has been modified
aws nasa-veda prod Core infrastructure has been modified
aws victor prod Core infrastructure has been modified
gcp leap prod Core infrastructure has been modified
kubeconfig utoronto prod Core infrastructure has been modified
kubeconfig utoronto r-prod Core infrastructure has been modified
aws gridsst prod Core infrastructure has been modified
gcp catalystproject-latam unitefa-conicet Core infrastructure has been modified
aws 2i2c-aws-us researchdelight Core infrastructure has been modified
aws 2i2c-aws-us ncar-cisl Core infrastructure has been modified
aws 2i2c-aws-us go-bgc Core infrastructure has been modified
aws 2i2c-aws-us itcoocean Core infrastructure has been modified
aws 2i2c-aws-us cosmicds Core infrastructure has been modified
aws jupyter-meets-the-earth prod Core infrastructure has been modified
gcp qcl prod Core infrastructure has been modified
aws nasa-cryo prod Core infrastructure has been modified
gcp linked-earth prod Core infrastructure has been modified
gcp cloudbank bcc Core infrastructure has been modified
gcp cloudbank ccsf Core infrastructure has been modified
gcp cloudbank csm Core infrastructure has been modified
gcp cloudbank dvc Core infrastructure has been modified
gcp cloudbank elcamino Core infrastructure has been modified
gcp cloudbank evc Core infrastructure has been modified
gcp cloudbank glendale Core infrastructure has been modified
gcp cloudbank howard Core infrastructure has been modified
gcp cloudbank miracosta Core infrastructure has been modified
gcp cloudbank skyline Core infrastructure has been modified
gcp cloudbank demo Core infrastructure has been modified
gcp cloudbank fresno Core infrastructure has been modified
gcp cloudbank humboldt Core infrastructure has been modified
gcp cloudbank laney Core infrastructure has been modified
gcp cloudbank sbcc Core infrastructure has been modified
gcp cloudbank sbcc-dev Core infrastructure has been modified
gcp cloudbank lacc Core infrastructure has been modified
gcp cloudbank lamission Core infrastructure has been modified
gcp cloudbank mills Core infrastructure has been modified
gcp cloudbank mission Core infrastructure has been modified
gcp cloudbank norco Core infrastructure has been modified
gcp cloudbank palomar Core infrastructure has been modified
gcp cloudbank pasadena Core infrastructure has been modified
gcp cloudbank sjcc Core infrastructure has been modified
gcp cloudbank sacramento Core infrastructure has been modified
gcp cloudbank srjc Core infrastructure has been modified
gcp cloudbank saddleback Core infrastructure has been modified
gcp cloudbank santiago Core infrastructure has been modified
gcp cloudbank sjsu Core infrastructure has been modified
gcp cloudbank tuskegee Core infrastructure has been modified
gcp cloudbank wlac Core infrastructure has been modified
gcp cloudbank csulb Core infrastructure has been modified
gcp cloudbank csum Core infrastructure has been modified
gcp meom-ige prod Core infrastructure has been modified
aws catalystproject-africa nm-aist Core infrastructure has been modified
gcp awi-ciroh prod Core infrastructure has been modified
aws carbonplan prod Core infrastructure has been modified
aws smithsonian prod Core infrastructure has been modified
aws openscapes prod Core infrastructure has been modified
aws nasa-ghg prod Core infrastructure has been modified
gcp m2lines prod Core infrastructure has been modified
gcp callysto prod Core infrastructure has been modified
gcp pangeo-hubs prod Core infrastructure has been modified
gcp pangeo-hubs coessing Core infrastructure has been modified
gcp 2i2c imagebuilding-demo Core infrastructure has been modified
gcp 2i2c demo Core infrastructure has been modified
gcp 2i2c ohw Core infrastructure has been modified
gcp 2i2c aup Core infrastructure has been modified
gcp 2i2c temple Core infrastructure has been modified
gcp 2i2c ucmerced Core infrastructure has been modified
gcp 2i2c climatematch Core infrastructure has been modified
gcp 2i2c neurohackademy Core infrastructure has been modified
gcp 2i2c mtu Core infrastructure has been modified
gcp 2i2c jackeddy Core infrastructure has been modified
gcp hhmi prod Core infrastructure has been modified
aws ubc-eoas prod Core infrastructure has been modified

Comment on lines 50 to 62
- display_name: "Small: ~512 MB RAM / ~0.5 CPU"
slug: mem_512m
default: true
kubespawner_override:
# increase as requested via https://2i2c.freshdesk.com/a/tickets/1066
mem_guarantee: 512M
mem_limit: 1G
- display_name: "Large: ~1 GB RAM / ~0.5 CPU"
slug: mem_1g
kubespawner_override:
# increase as requested via https://2i2c.freshdesk.com/a/tickets/1066
mem_guarantee: 1G
mem_limit: 2G
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I'd like to avoid use of relative words like "small" and "large" because it will make communities use these words that we won't understand without reading up on their configuration first.

I think it would make sense for them to use the existing 4 CPU / 32 GB nodes with node resource allocation script generated requests/limits where the limit is 2x the request. Is the script able to do that atm? Hmmm...

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I actually did a deploy with just the sizes as names and it felt very mathematical 😅
Screenshot 2023-10-24 at 11 01 57

I think it would make sense for them to use the existing 4 CPU / 32 GB nodes with node resource allocation script generated requests/limits where the limit is 2x the request. Is the script able to do that atm? Hmmm..

I am not familiar with what the script can do yet 😬

Copy link
Contributor

@consideRatio consideRatio Oct 24, 2023

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Neither am I and I failed to quickly whip up a suggested alternative - so lets go with anything for now and iterate!

The script currently only provides requests and limits being set to the same value, so nothing matches exactly what we want with limits higher than requests.

Copy link
Contributor

@consideRatio consideRatio left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This is approved as is, including "Small" and "Large" mentions, but also with currently deployed change. Let's iterate over time on this instead!

@GeorgianaElena
Copy link
Member Author

@consideRatio, thank you! I was thinking just now, if we are moving to profileLists, why not add their initial memory guarantee as another, even smaller option (256MB)?

@consideRatio
Copy link
Contributor

@GeorgianaElena I suggest we go for #3308 (comment) and ask them for feedback

@GeorgianaElena
Copy link
Member Author

Update

Waiting for feedback before merging.

The profile list looks like:

Screenshot 2023-10-24 at 13 07 10

@GeorgianaElena
Copy link
Member Author

I will close this PR to de-clutter and will open up again if/when agreement about this one.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
No open projects
Status: Done 🎉
Development

Successfully merging this pull request may close these issues.

2 participants