You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Introduce a new metric specifically designed to reflect the readiness status of serverless function controller:
AC:
it should indicate whether the function controller's main reconciliation loop is ready to serve requests or not (if the queue is served)
the frequency of metric update should be independent from kubernetes probing frequency configuration (i.e separate go rutine with own ticker).
frequent probing should not have negative effect on function-controller performance; probe should add an event for function controller who serves it with a fast exit. (we have it already. health probing is entering reconciliation loop)
No user misconfigurations (i.e invalid function CR or function code) should have an effect on the metric (and disrupt the SLO budget)
the metric should be observable in the time frame (via promql) so that observer can model alerting rules based on aggregated time series.
The above criteria are for the basic availability indication.
Think of additional availability indicator for serverless that could be used to inspect weather every requested function CR was "attempted to be built" and those which were successfully built were "attempted to be deployed"
Reasons
Ensure SLO is observable for serverless.
Enable administrators to set up alerting and monitoring based on function controller readiness.
Attachments
The text was updated successfully, but these errors were encountered:
kwiatekus
changed the title
Expose Function Controller Readiness Metric for Prometheus-based Monitoring
Expose function controller readiness metric for prometheus-based monitoring
Apr 9, 2024
Description
Introduce a new metric specifically designed to reflect the readiness status of serverless function controller:
AC:
The above criteria are for the basic availability indication.
Think of additional availability indicator for serverless that could be used to inspect weather every requested function CR was "attempted to be built" and those which were successfully built were "attempted to be deployed"
Reasons
Ensure SLO is observable for serverless.
Enable administrators to set up alerting and monitoring based on function controller readiness.
Attachments
The text was updated successfully, but these errors were encountered: