[ADR]: 🎚️ Limiting resource requests and limits on the Dagster webserver and daemon #7
Closed
2 tasks done
Labels
accepted
Suggestion has been accepted
ADR
This issue is labeled as an ADR
enhancement
New feature or request
✍️ Context
To reduce spending on the dagster deployment on GKE, we should limit the resources allocated for the long-running services.
These services are:
The resource limits can be set on the dagster helm chart values.yml.
Current resource requests
Looking at the pod deployments, we see that the following resource requests and limits are set for the webserver, daemon, and code location:
Currently, all three Dagster services have unused resources
CPU (ranked in terms of unused resources)
Memory (ranked in terms of unused resources)
Requested versus used resource requests (from GKE workload overview, 24 hour window)
Daemon
Webserver
Code location
🤝 Decision
Set the resource constraints as follows:
Daemon
Webserver
Code location
These values can be set in "dagster-infra/app.tf" as follows:
These values can be set in "dagster-dags/values.yaml.j2" as follows:
💥 Impact
Shouldn't impact users as long as we properly monitor resources. We don't currently do this. We have a ticket to pick this up, see #5
☝️ Consequences
Harder:
📝 Checklist (after ADR has been accepted)
The text was updated successfully, but these errors were encountered: