Document procedure and migrate existing AWS EKS based hubs from k8s 1.21+ to 1.24+ #2057

consideRatio · 2023-01-18T18:41:50Z

Amazon EKS has deprecated Kubernetes version 1.22 and this version will no longer be supported on June 4, 2023. Starting that day, you will no longer be able to create new 1.22 clusters and all EKS clusters running Kubernetes version 1.22 will be updated to the latest available platform version of Kubernetes version 1.23.

This is the current status of AWS EKS clusters.

2i2c-aws-us, k8s v1.25 (2i2c AWS SSO, two-eye-two-see), upgraded to 1.25
2i2c-aws-us: k8s 1.25, highmem nodes, node sharing profile list, ssh-keys #2343
carbonplan, k8s 1.19, upgraded to 1.24
carbonplan: update k8s from 1.19 to 1.24 is made, now update eksctl cluster config template #2085
gridsst, k8s 1.22 (2i2c AWS SSO), upgraded to 1.25,
gridsst: k8s 1.22 to 1.25, core node from m5 to r5, dask nodes from 4 different m5 to one r5.4xlarge #2373
nasa-cryo, k8s 1.22 (2i2c AWS SSO), upgraded to 1.25
discussed for upgrade in https://2i2c.freshdesk.com/a/tickets/543
nasa-cryo: k8s 1.22 to 1.25, node sharing setup #2374
nasa-veda, k8s 1.25, upgraded to 1.25
nasa-veda: upgrade to k8s 1.25, highmem nodes, profile list with node sharing #2340
openscapes, k8s 1.21, upgraded to 1.24
openscapes: update EKS cluster config templates from k8s 1.21 to 1.24 #2139
ubc-eoas, k8s 1.24 (2i2c AWS SSO)
Created at 1.24
victor, k8s 1.22 (2i2c AWS SSO), upgraded to 1.25
victor: k8s 1.22 to 1.25, core node from m5 to r5, dask nodes from 4 different m5 to one r5.4xlarge #2375

Action points

Upgrade all EKS clusters to k8s 1.24+
Document a upgrade procedure for EKS clusters
docs: add an aws k8s cluster upgrade guide #2142

Original outdated issue

Click to expand

Such migration will require some additional steps related to #2054 and #2056.

In practice, I think it involves updating the .jsonnet templates of eksctl cluster configuration files that we have in the eksctl folder to match how they would look if we re-generated them from the jinja2 template (template.jsonnet) of these .jsonnet template files.

It also involves manually adding the EKSCTL addon with a eksctl command, and recreating the node pools, etc. This will cause disruption.

I've outlined the steps I took when I updated the JMTE hub in the eksctl cluster config file part of the JMTE PR branch. These may not be the exact steps we ought to take, but help guide the steps we should take.

Update: I think for the ebs driver stuff, I think we need to add the ebs driver addon and not add iam stuff to the nodeGroups. If I'm wrong, we may need to revert 8fe4009 from #2056.

The text was updated successfully, but these errors were encountered:

yuvipanda · 2023-02-07T05:24:02Z

Re: GKE, I specifically put us in 'unspecified' because if we are in a channel, not just the control plane but all the nodes would restart at an arbitrary time that GCP chooses. However, in practice since we haven't been upgrading them ourselves, they've just been upgrading at a much slower level. Figuring out a way here to minimize disruption would be great. I think using GKE release channels could redo entire large dask clusters or notebook nodes at inopportune times, so I think keeping the upgrade process manual but actually doing upgrades is probably the way to go

consideRatio · 2023-02-07T08:25:16Z

@yuvipanda maybe it could be communicated with users that maintenance will be done automatically in pre-agreed time-windows, which we may need anyhow if we do the upgrade ourselves. Maybe there is trouble for terraform if we let GKE do it instead of ourselves though? I'm not sure.

But let's table this deliberation for another issue and let this issue focus on upgrading EKS clusters manually. I think it would be good to upgrade the GKE clusters manually at least once as well to gain experience and such with doing it manually to start so I've opened #2157 about it.

consideRatio · 2023-02-07T08:36:21Z

All EKS clusters now have the EBS addon
Upgrade docs PR in docs: add an aws k8s cluster upgrade guide #2142
I conclude uwhackweeks was still at k8s 1.21, not k8s 1.22 as I previously had summarized. This is a cluster that may be fully deleted soon though so let's ignore it for a while.

consideRatio · 2023-03-02T09:43:39Z

I removed myself from being assigned to this. There is now documentation on how to make the upgrade, so I think it would be good if I'm not doing all the AWS upgrades.

consideRatio · 2023-03-19T23:07:21Z

Its done!!! All EKS k8s clusters are v1.24 or v1.25 now!

consideRatio added the Engineering:SRE Cloud infrastructure operations and development. label Jan 18, 2023

consideRatio added this to DEPRECATED Engineering and Product Backlog and Sprint Board Jan 18, 2023

consideRatio mentioned this issue Jan 18, 2023

AWS EKS infra: transition to k8s 1.24 and add required storage addon #2056

Merged

This comment was marked as outdated.

Sign in to view

consideRatio changed the title ~~Migrate existing AWS EKS based hubs from k8s 1.22 to 1.24~~ Migrate existing AWS EKS based hubs from k8s 1.19+ to 1.24 Jan 23, 2023

This comment was marked as resolved.

Sign in to view

This comment was marked as outdated.

Sign in to view

consideRatio mentioned this issue Jan 24, 2023

carbonplan: update k8s from 1.19 to 1.24 is made, now update eksctl cluster config template #2085

Merged

damianavila moved this to In progress in DEPRECATED Engineering and Product Backlog Jan 26, 2023

damianavila moved this to In Progress ⚡ in Sprint Board Jan 26, 2023

damianavila assigned consideRatio Jan 26, 2023

This comment was marked as resolved.

Sign in to view

consideRatio mentioned this issue Feb 1, 2023

Upgrade openscapes AWS EKS k8s version 1.21 to 1.24 #2125

Closed

consideRatio changed the title ~~Migrate existing AWS EKS based hubs from k8s 1.19+ to 1.24~~ Migrate existing AWS EKS based hubs from k8s 1.21+ to 1.24 Feb 1, 2023

This was referenced Feb 6, 2023

Decide on a cluster upgrade policy #412

Closed

aws clusters: systematically add ebs-csi-driver addon before k8s 1.23+ upgrade #2149

Merged

consideRatio mentioned this issue Feb 7, 2023

Migrate existing GCP GKE based hubs from k8s 1.22+ to 1.24+ #2157

Closed

14 tasks

consideRatio mentioned this issue Feb 7, 2023

Document procedure and migrate existing Azure AKS based hubs from k8s 1.22+ to 1.24+ #2158

Closed

3 tasks

damianavila removed this from Sprint Board Feb 15, 2023

consideRatio changed the title ~~Migrate existing AWS EKS based hubs from k8s 1.21+ to 1.24~~ Migrate existing AWS EKS based hubs from k8s 1.21+ to 1.24+ Feb 22, 2023

consideRatio removed their assignment Mar 2, 2023

damianavila added this to Sprint Board Mar 2, 2023

damianavila moved this to Todo 👍 in Sprint Board Mar 2, 2023

damianavila moved this from Todo 👍 to In Progress ⚡ in Sprint Board Mar 2, 2023

consideRatio mentioned this issue Mar 2, 2023

Goal: k8s maintenance #2293

Closed

3 tasks

consideRatio mentioned this issue Mar 13, 2023

2i2c-aws-us: k8s 1.25, highmem nodes, node sharing profile list, ssh-keys #2343

Merged

consideRatio self-assigned this Mar 19, 2023

consideRatio moved this from In Progress ⚡ to Done 🎉 in Sprint Board Mar 19, 2023

consideRatio closed this as completed Mar 19, 2023

github-project-automation bot moved this from In progress to Complete in DEPRECATED Engineering and Product Backlog Mar 19, 2023

damianavila mentioned this issue Apr 4, 2023

Update for 2023 Q1 2i2c-org/team-compass#685

Closed

consideRatio changed the title ~~Migrate existing AWS EKS based hubs from k8s 1.21+ to 1.24+~~ Document procedure and migrate existing AWS EKS based hubs from k8s 1.21+ to 1.24+ Jun 9, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Document procedure and migrate existing AWS EKS based hubs from k8s 1.21+ to 1.24+ #2057

Document procedure and migrate existing AWS EKS based hubs from k8s 1.21+ to 1.24+ #2057

consideRatio commented Jan 18, 2023 •

edited

Loading

This comment was marked as outdated.

This comment was marked as resolved.

This comment was marked as outdated.

This comment was marked as resolved.

yuvipanda commented Feb 7, 2023

consideRatio commented Feb 7, 2023

consideRatio commented Feb 7, 2023

consideRatio commented Mar 2, 2023 •

edited

Loading

consideRatio commented Mar 19, 2023

Document procedure and migrate existing AWS EKS based hubs from k8s 1.21+ to 1.24+ #2057

Document procedure and migrate existing AWS EKS based hubs from k8s 1.21+ to 1.24+ #2057

Comments

consideRatio commented Jan 18, 2023 • edited Loading

Action points

Related

Original outdated issue

This comment was marked as outdated.

This comment was marked as resolved.

This comment was marked as outdated.

This comment was marked as resolved.

yuvipanda commented Feb 7, 2023

consideRatio commented Feb 7, 2023

consideRatio commented Feb 7, 2023

consideRatio commented Mar 2, 2023 • edited Loading

consideRatio commented Mar 19, 2023

consideRatio commented Jan 18, 2023 •

edited

Loading

consideRatio commented Mar 2, 2023 •

edited

Loading