[2i2c-uk:lis] Enable profileList #3308

GeorgianaElena · 2023-10-24T07:06:02Z

Update: waiting for feedback in https://2i2c.freshdesk.com/a/tickets/1066 before merging.

Follow-up to #3302, also for https://2i2c.freshdesk.com/a/tickets/1066.

Matthew from LIS requested a guarantee of 1GB as they are still experiencing crashes, even after the increase to 512GB of guarantee from #3302.

This PR enables profileLists for the lis hub, adding the new guarantee and reducing the limit of the initial one of 512G one from 2GB to 1GB.

I think it would be useful for them to enable https://github.com/jupyter-server/jupyter-resource-usage so they can better understand they mem needs, especially since from the grafana below, it looks like only a few users actually required around or more than 1GB.

This is also why the new server size option, instead of increasing it again.

github-actions · 2023-10-24T07:07:23Z

Merging this PR will trigger the following deployment actions.

Support and Staging deployments

Cloud Provider	Cluster Name	Upgrade Support?	Upgrade Staging?	Reason for Staging Redeploy
gcp	2i2c-uk	No	Yes	Core infrastructure has been modified
aws	nasa-veda	No	Yes	Core infrastructure has been modified
aws	victor	No	Yes	Core infrastructure has been modified
gcp	leap	No	Yes	Core infrastructure has been modified
kubeconfig	utoronto	No	Yes	Core infrastructure has been modified
aws	gridsst	No	Yes	Core infrastructure has been modified
gcp	catalystproject-latam	No	Yes	Core infrastructure has been modified
aws	2i2c-aws-us	No	Yes	Core infrastructure has been modified
aws	jupyter-meets-the-earth	No	Yes	Core infrastructure has been modified
gcp	qcl	No	Yes	Core infrastructure has been modified
aws	nasa-cryo	No	Yes	Core infrastructure has been modified
gcp	linked-earth	No	Yes	Core infrastructure has been modified
gcp	cloudbank	No	Yes	Core infrastructure has been modified
gcp	meom-ige	No	Yes	Core infrastructure has been modified
aws	catalystproject-africa	No	Yes	Core infrastructure has been modified
gcp	awi-ciroh	No	Yes	Core infrastructure has been modified
aws	carbonplan	No	Yes	Core infrastructure has been modified
aws	smithsonian	No	Yes	Core infrastructure has been modified
aws	openscapes	No	Yes	Core infrastructure has been modified
aws	nasa-ghg	No	Yes	Core infrastructure has been modified
gcp	m2lines	No	Yes	Core infrastructure has been modified
gcp	callysto	No	Yes	Core infrastructure has been modified
gcp	pangeo-hubs	No	Yes	Core infrastructure has been modified
gcp	2i2c	No	Yes	Core infrastructure has been modified
gcp	hhmi	No	Yes	Core infrastructure has been modified
aws	ubc-eoas	No	Yes	Core infrastructure has been modified

Production deployments

Cloud Provider	Cluster Name	Hub Name	Reason for Redeploy
gcp	2i2c-uk	lis	Core infrastructure has been modified
aws	nasa-veda	prod	Core infrastructure has been modified
aws	victor	prod	Core infrastructure has been modified
gcp	leap	prod	Core infrastructure has been modified
kubeconfig	utoronto	prod	Core infrastructure has been modified
kubeconfig	utoronto	r-prod	Core infrastructure has been modified
aws	gridsst	prod	Core infrastructure has been modified
gcp	catalystproject-latam	unitefa-conicet	Core infrastructure has been modified
aws	2i2c-aws-us	researchdelight	Core infrastructure has been modified
aws	2i2c-aws-us	ncar-cisl	Core infrastructure has been modified
aws	2i2c-aws-us	go-bgc	Core infrastructure has been modified
aws	2i2c-aws-us	itcoocean	Core infrastructure has been modified
aws	2i2c-aws-us	cosmicds	Core infrastructure has been modified
aws	jupyter-meets-the-earth	prod	Core infrastructure has been modified
gcp	qcl	prod	Core infrastructure has been modified
aws	nasa-cryo	prod	Core infrastructure has been modified
gcp	linked-earth	prod	Core infrastructure has been modified
gcp	cloudbank	bcc	Core infrastructure has been modified
gcp	cloudbank	ccsf	Core infrastructure has been modified
gcp	cloudbank	csm	Core infrastructure has been modified
gcp	cloudbank	dvc	Core infrastructure has been modified
gcp	cloudbank	elcamino	Core infrastructure has been modified
gcp	cloudbank	evc	Core infrastructure has been modified
gcp	cloudbank	glendale	Core infrastructure has been modified
gcp	cloudbank	howard	Core infrastructure has been modified
gcp	cloudbank	miracosta	Core infrastructure has been modified
gcp	cloudbank	skyline	Core infrastructure has been modified
gcp	cloudbank	demo	Core infrastructure has been modified
gcp	cloudbank	fresno	Core infrastructure has been modified
gcp	cloudbank	humboldt	Core infrastructure has been modified
gcp	cloudbank	laney	Core infrastructure has been modified
gcp	cloudbank	sbcc	Core infrastructure has been modified
gcp	cloudbank	sbcc-dev	Core infrastructure has been modified
gcp	cloudbank	lacc	Core infrastructure has been modified
gcp	cloudbank	lamission	Core infrastructure has been modified
gcp	cloudbank	mills	Core infrastructure has been modified
gcp	cloudbank	mission	Core infrastructure has been modified
gcp	cloudbank	norco	Core infrastructure has been modified
gcp	cloudbank	palomar	Core infrastructure has been modified
gcp	cloudbank	pasadena	Core infrastructure has been modified
gcp	cloudbank	sjcc	Core infrastructure has been modified
gcp	cloudbank	sacramento	Core infrastructure has been modified
gcp	cloudbank	srjc	Core infrastructure has been modified
gcp	cloudbank	saddleback	Core infrastructure has been modified
gcp	cloudbank	santiago	Core infrastructure has been modified
gcp	cloudbank	sjsu	Core infrastructure has been modified
gcp	cloudbank	tuskegee	Core infrastructure has been modified
gcp	cloudbank	wlac	Core infrastructure has been modified
gcp	cloudbank	csulb	Core infrastructure has been modified
gcp	cloudbank	csum	Core infrastructure has been modified
gcp	meom-ige	prod	Core infrastructure has been modified
aws	catalystproject-africa	nm-aist	Core infrastructure has been modified
gcp	awi-ciroh	prod	Core infrastructure has been modified
aws	carbonplan	prod	Core infrastructure has been modified
aws	smithsonian	prod	Core infrastructure has been modified
aws	openscapes	prod	Core infrastructure has been modified
aws	nasa-ghg	prod	Core infrastructure has been modified
gcp	m2lines	prod	Core infrastructure has been modified
gcp	callysto	prod	Core infrastructure has been modified
gcp	pangeo-hubs	prod	Core infrastructure has been modified
gcp	pangeo-hubs	coessing	Core infrastructure has been modified
gcp	2i2c	imagebuilding-demo	Core infrastructure has been modified
gcp	2i2c	demo	Core infrastructure has been modified
gcp	2i2c	ohw	Core infrastructure has been modified
gcp	2i2c	aup	Core infrastructure has been modified
gcp	2i2c	temple	Core infrastructure has been modified
gcp	2i2c	ucmerced	Core infrastructure has been modified
gcp	2i2c	climatematch	Core infrastructure has been modified
gcp	2i2c	neurohackademy	Core infrastructure has been modified
gcp	2i2c	mtu	Core infrastructure has been modified
gcp	2i2c	jackeddy	Core infrastructure has been modified
gcp	hhmi	prod	Core infrastructure has been modified
aws	ubc-eoas	prod	Core infrastructure has been modified

consideRatio · 2023-10-24T07:23:56Z

config/clusters/2i2c-uk/lis.values.yaml

+      - display_name: "Small: ~512 MB RAM / ~0.5 CPU"
+        slug: mem_512m
+        default: true
+        kubespawner_override:
+          # increase as requested via https://2i2c.freshdesk.com/a/tickets/1066
+          mem_guarantee: 512M
+          mem_limit: 1G
+      - display_name: "Large: ~1 GB RAM / ~0.5 CPU"
+        slug: mem_1g
+        kubespawner_override:
+          # increase as requested via https://2i2c.freshdesk.com/a/tickets/1066
+          mem_guarantee: 1G
+          mem_limit: 2G


I'd like to avoid use of relative words like "small" and "large" because it will make communities use these words that we won't understand without reading up on their configuration first.

I think it would make sense for them to use the existing 4 CPU / 32 GB nodes with node resource allocation script generated requests/limits where the limit is 2x the request. Is the script able to do that atm? Hmmm...

I actually did a deploy with just the sizes as names and it felt very mathematical 😅

I think it would make sense for them to use the existing 4 CPU / 32 GB nodes with node resource allocation script generated requests/limits where the limit is 2x the request. Is the script able to do that atm? Hmmm..

I am not familiar with what the script can do yet 😬

Neither am I and I failed to quickly whip up a suggested alternative - so lets go with anything for now and iterate!

The script currently only provides requests and limits being set to the same value, so nothing matches exactly what we want with limits higher than requests.

consideRatio

This is approved as is, including "Small" and "Large" mentions, but also with currently deployed change. Let's iterate over time on this instead!

GeorgianaElena · 2023-10-24T08:15:20Z

@consideRatio, thank you! I was thinking just now, if we are moving to profileLists, why not add their initial memory guarantee as another, even smaller option (256MB)?

config/clusters/2i2c-uk/lis.values.yaml

consideRatio · 2023-10-24T08:52:03Z

@GeorgianaElena I suggest we go for #3308 (comment) and ask them for feedback

Co-authored-by: Erik Sundell <[email protected]>

GeorgianaElena · 2023-10-24T11:05:41Z

Update

Waiting for feedback before merging.

The profile list looks like:

GeorgianaElena · 2023-11-17T17:50:11Z

I will close this PR to de-clutter and will open up again if/when agreement about this one.

GeorgianaElena added 2 commits October 24, 2023 09:54

Enable profile lists for lis

0ee0acf

Fix allowed_idp for 2i2c-uk staging hub

5974233

GeorgianaElena requested a review from a team as a code owner October 24, 2023 07:06

github-actions bot assigned GeorgianaElena Oct 24, 2023

consideRatio reviewed Oct 24, 2023

View reviewed changes

consideRatio approved these changes Oct 24, 2023

View reviewed changes

consideRatio reviewed Oct 24, 2023

View reviewed changes

config/clusters/2i2c-uk/lis.values.yaml Outdated Show resolved Hide resolved

GeorgianaElena and others added 2 commits October 24, 2023 12:53

Use a profile list with guarantee eq limits and allow more options

5822962

Co-authored-by: Erik Sundell <[email protected]>

Fix profile list code syntax

e829752

consideRatio approved these changes Oct 24, 2023

View reviewed changes

GeorgianaElena closed this Nov 17, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[2i2c-uk:lis] Enable profileList #3308

[2i2c-uk:lis] Enable profileList #3308

GeorgianaElena commented Oct 24, 2023 •

edited

Loading

github-actions bot commented Oct 24, 2023 •

edited

Loading

consideRatio Oct 24, 2023

GeorgianaElena Oct 24, 2023

consideRatio Oct 24, 2023 •

edited

Loading

consideRatio left a comment

GeorgianaElena commented Oct 24, 2023

consideRatio commented Oct 24, 2023

GeorgianaElena commented Oct 24, 2023

GeorgianaElena commented Nov 17, 2023

[2i2c-uk:lis] Enable profileList #3308

[2i2c-uk:lis] Enable profileList #3308

Conversation

GeorgianaElena commented Oct 24, 2023 • edited Loading

Update: waiting for feedback in https://2i2c.freshdesk.com/a/tickets/1066 before merging.

github-actions bot commented Oct 24, 2023 • edited Loading

Support and Staging deployments

Production deployments

consideRatio Oct 24, 2023

Choose a reason for hiding this comment

GeorgianaElena Oct 24, 2023

Choose a reason for hiding this comment

consideRatio Oct 24, 2023 • edited Loading

Choose a reason for hiding this comment

consideRatio left a comment

Choose a reason for hiding this comment

GeorgianaElena commented Oct 24, 2023

consideRatio commented Oct 24, 2023

GeorgianaElena commented Oct 24, 2023

Update

GeorgianaElena commented Nov 17, 2023

GeorgianaElena commented Oct 24, 2023 •

edited

Loading

github-actions bot commented Oct 24, 2023 •

edited

Loading

consideRatio Oct 24, 2023 •

edited

Loading