Use unified nginx helm chart #636

lianhao · 2024-12-11T08:42:54Z

Description

Add helm chart support for OPEA nginx
Use unified nginx chart in E2E.

Issues

Fixes #635.

Type of change

List the type of change like below. Please delete options that are not relevant.

New feature (non-breaking change which adds new functionality)

Dependencies

List the newly introduced 3rd party dependency if exists.

Tests

Describe the tests that you ran to verify your changes.

helm-charts/common/nginx/values.yaml

helm-charts/common/nginx/templates/deployment.yaml

helm-charts/common/nginx/README.md

helm-charts/common/nginx/values.yaml

lianhao · 2024-12-12T00:46:55Z

The faqgen failure is due to the known issue of opea-project/GenAIComps#969

lianhao · 2024-12-20T02:39:00Z

we need to wait until langserve 0.3.1 is available in pypi to unblock the faqgen CI failure

- Add helm chart support for OPEA nginx - Use unified nginx chart in E2E Signed-off-by: Lianhao Lu <[email protected]>

lianhao · 2024-12-20T07:02:58Z

gaudi CI environment issue:
huggingface_hub.utils._errors.HfHubHTTPError: 429 Client Error: Too Many Requests for url: https://huggingface.co/api/whoami-v2

eero-t · 2024-12-20T10:14:42Z

helm-charts/common/nginx/values.yaml

+  targetCPUUtilizationPercentage: 80
+  # targetMemoryUtilizationPercentage: 80


IMHO these variable names could be a bit shorter.

Suggested change

targetCPUUtilizationPercentage: 80

# targetMemoryUtilizationPercentage: 80

targetCPUPercentage: 80

# targetMemoryPercentage: 80

eero-t · 2024-12-20T10:29:40Z

helm-charts/common/nginx/values.yaml

+  # We usually recommend not to specify default resources and to leave this as a conscious
+  # choice for the user. If you do want to specify resources, uncomment the following
+  # lines, adjust them as necessary, and remove the curly braces after 'resources:'.
+  # limits:
+  #   cpu: 100m
+  #   memory: 128Mi
+  # requests:
+  #   cpu: 100m
+  #   memory: 128Mi


I've never used cpu/mem metrics for scaling (only application specific custom metrics), but according to docs: https://kubernetes.io/docs/tasks/run-application/horizontal-pod-autoscale/#how-does-a-horizontalpodautoscaler-work

CPU/mem resource requests need to be specified for HPA to work on relative CPU/mem targets:

For per-pod resource metrics (like CPU), the controller fetches the metrics from the resource metrics API for each Pod targeted by the HorizontalPodAutoscaler. Then, if a target utilization value is set, the controller calculates the utilization value as a percentage of the equivalent resource request on the containers in each Pod.
...
Please note that if some of the Pod's containers do not have the relevant resource request set, CPU utilization for the Pod will not be defined and the autoscaler will not take any action for that metric.

So I think that:

requests part should be enabled, after verifying that current values match "normal" resource usage for specified nginx version

Comment for requests values should state that "normal" nginx resource usage should be checked and values updated accordingly, before enabling autoscaling

Comment for limits needs to state that they should be enabled only after checking how much nginx needs when it's stressed, and using those values + some extra for growth (increased buffers, possible leakage etc)

Can we do this in another PR along with issue #643 ?

lianhao requested a review from Ruoyu-y December 11, 2024 08:42

lianhao requested a review from yongfengdu as a code owner December 11, 2024 08:42

lianhao force-pushed the unified_nginx branch 2 times, most recently from 030fe5b to 4d71f08 Compare December 11, 2024 08:48

eero-t suggested changes Dec 11, 2024

View reviewed changes

lianhao force-pushed the unified_nginx branch 2 times, most recently from 002d680 to 6139cdb Compare December 12, 2024 02:04

lianhao force-pushed the unified_nginx branch from 6139cdb to 1d16ef4 Compare December 20, 2024 01:02

Use unified nginx helm chart

900798e

- Add helm chart support for OPEA nginx - Use unified nginx chart in E2E Signed-off-by: Lianhao Lu <[email protected]>

lianhao force-pushed the unified_nginx branch from 1d16ef4 to 900798e Compare December 20, 2024 05:17

yongfengdu approved these changes Dec 20, 2024

View reviewed changes

eero-t reviewed Dec 20, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use unified nginx helm chart #636

Use unified nginx helm chart #636

lianhao commented Dec 11, 2024 •

edited

Loading

lianhao commented Dec 12, 2024

lianhao commented Dec 20, 2024

lianhao commented Dec 20, 2024 •

edited

Loading

eero-t Dec 20, 2024

eero-t Dec 20, 2024 •

edited

Loading

eero-t Dec 20, 2024 •

edited

Loading

lianhao Dec 25, 2024

		targetCPUUtilizationPercentage: 80
		# targetMemoryUtilizationPercentage: 80

Use unified nginx helm chart #636

Are you sure you want to change the base?

Use unified nginx helm chart #636

Conversation

lianhao commented Dec 11, 2024 • edited Loading

Description

Issues

Type of change

Dependencies

Tests

lianhao commented Dec 12, 2024

lianhao commented Dec 20, 2024

lianhao commented Dec 20, 2024 • edited Loading

eero-t Dec 20, 2024

Choose a reason for hiding this comment

eero-t Dec 20, 2024 • edited Loading

Choose a reason for hiding this comment

eero-t Dec 20, 2024 • edited Loading

Choose a reason for hiding this comment

lianhao Dec 25, 2024

Choose a reason for hiding this comment

lianhao commented Dec 11, 2024 •

edited

Loading

lianhao commented Dec 20, 2024 •

edited

Loading

eero-t Dec 20, 2024 •

edited

Loading

eero-t Dec 20, 2024 •

edited

Loading