Improve VPC CNI memory by reducing number of things it is caching #2887

GnatorX · 2024-04-18T20:47:38Z

What would you like to be added:
Narrow down what VPC CNI is caching to reduce memory utilization in large clusters. Currently we see pretty high memory utilization and seems to scale with nodes.

Why is this needed:
The current behavior is pretty problematic when cluster size gets large (5000+ nodes) causing us to increase memory request for the CNI even though it isn't necessary for the CNI to use all that memory.

I believe simply adding a new ByObject Filter on Node object is enough to reduce cache utilization. Since https://github.com/aws/amazon-vpc-cni-k8s/blob/master/pkg/k8sapi/k8sutils.go#L183 only gets the node object the CNI is running on.

orsenthil · 2024-04-19T03:01:48Z

Thanks for the report and the Pull Request. Have you done any measurements with and without this change? Could you share the differences?

GnatorX · 2024-04-19T03:34:10Z

Not yet. Will update once I have tested this

GnatorX · 2024-04-19T17:24:35Z

@orsenthil I am wondering if it make sense to even cache nodes. K8s caches which usesList + watches on startup are extremely expensive calls. The CNI only cares about the node it is running on and calls with node name is index from k8s side which is relatively fast. Rather than filtering, why not just use non-cached calls get that information?

The availability difference isn't that high, watches vs a call.

GnatorX · 2024-04-29T18:08:25Z

I took a pprof of the issue.

It seems like the issue is with the stream watcher is consuming memory during cluster size increase. It seems to require quite a bit of memory in order to process all nodes and store it in the memory. Even though the memory consumption isn't very high, its still unnecessary to store all node information in cache.

I need to re-test this with my change however I do believe the real solution is to avoid performing list watch against all nodes and only watch for node events specific to the CNI.

orsenthil · 2024-05-01T20:24:25Z

K8s caches which usesList + watches on startup are extremely expensive calls

Even though the memory consumption isn't very high, its still unnecessary to store all node information in cache.

I do believe the real solution is to avoid performing list watch against all nodes and only watch for node events specific to the CNI.

It is pretty standard for k8s client calls to use the cached client. It will be good to measure difference in the memory usage and the performance of the various operations in the large clusters before we decide to not use the cache.

With your changes, if you see any different in both memory and performance, please share an update here.

GnatorX · 2024-05-01T20:31:58Z

It is pretty standard for k8s client calls to use the cached client. It will be good to measure difference in the memory usage and the performance of the various operations in the large clusters before we decide to not use the cache.

Agreed.

When I tested my changes, it didn't yield significant difference in memory utilization. I believe, as shown in the pprof, the memory usage is because of the stream watcher attempting unmarshal incoming data. I think rather than using a informer cache and raw watch against the node itself may be more efficient(?).

I can close to issue for now since I likely don't have time to look into writing a direct watcher instead and I think the memory spike isn't large enough to be a concern.

orsenthil · 2024-06-26T20:09:04Z

I can close to issue for now since I likely don't have time to look into writing a direct watcher instead and I think the memory spike isn't large enough to be a concern.

This sounds reasonable, if we have better proof with improvements, we can bring this change in.

github-actions · 2024-06-26T20:09:21Z

This issue is now closed. Comments on closed issues are hard for our team to see.
If you need more assistance, please either tag a team member or open a new issue that references this one.

GnatorX · 2024-10-10T18:09:49Z

@orsenthil can we reopen this. We have more data now

GnatorX · 2024-10-10T18:17:13Z

When we added the filter #2888 we were able to drop our memory utilization on a 3000 nodes cluster.

orsenthil · 2024-10-22T00:46:30Z

When we added the filter #2888 we were able to drop our memory utilization on a 3000 nodes cluster.

Could you explain this a bit more, how did adding cache filter on node reduce the VPC CNI memory utilization? Is it due to stream watcher you attributed to here - #2887 (comment)

GnatorX · 2024-10-23T17:31:36Z

Sorry i realize now that I commented on the PR and not the issue. Feel free to close this against the PR since it is merged now

GnatorX added enhancement feature request labels Apr 18, 2024

GnatorX mentioned this issue Apr 18, 2024

Add byobject filter on nodes #2888

Merged

orsenthil closed this as completed Jun 26, 2024

orsenthil reopened this Oct 10, 2024

orsenthil added this to the v1.18.6 milestone Oct 23, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve VPC CNI memory by reducing number of things it is caching #2887

Improve VPC CNI memory by reducing number of things it is caching #2887

GnatorX commented Apr 18, 2024 •

edited

Loading

orsenthil commented Apr 19, 2024

GnatorX commented Apr 19, 2024

GnatorX commented Apr 19, 2024 •

edited

Loading

GnatorX commented Apr 29, 2024

orsenthil commented May 1, 2024

GnatorX commented May 1, 2024 •

edited

Loading

orsenthil commented Jun 26, 2024

github-actions bot commented Jun 26, 2024

GnatorX commented Oct 10, 2024

GnatorX commented Oct 10, 2024

orsenthil commented Oct 22, 2024

GnatorX commented Oct 23, 2024

Improve VPC CNI memory by reducing number of things it is caching #2887

Improve VPC CNI memory by reducing number of things it is caching #2887

Comments

GnatorX commented Apr 18, 2024 • edited Loading

orsenthil commented Apr 19, 2024

GnatorX commented Apr 19, 2024

GnatorX commented Apr 19, 2024 • edited Loading

GnatorX commented Apr 29, 2024

orsenthil commented May 1, 2024

GnatorX commented May 1, 2024 • edited Loading

orsenthil commented Jun 26, 2024

github-actions bot commented Jun 26, 2024

GnatorX commented Oct 10, 2024

GnatorX commented Oct 10, 2024

orsenthil commented Oct 22, 2024

GnatorX commented Oct 23, 2024

GnatorX commented Apr 18, 2024 •

edited

Loading

GnatorX commented Apr 19, 2024 •

edited

Loading

GnatorX commented May 1, 2024 •

edited

Loading