Install a custom kube-prometheus-stack

Overview

The Kyma monitoring stack brings limited configuration options in contrast to the upstream kube-prometheus-stack chart. Modifications might be reset at the next upgrade cycle.

This tutorial outlines how to set up a kube-prometheus-stack, including a basic setup of Prometheus, Grafana and the Alertmanager.

CAUTION:

This tutorial describes a basic setup that you should not use in production. Typically, a production setup needs further configuration, like optimizing the amount of data to scrape and the required resource footprint of the installation. To achieve qualities like high availability, scalability, or durable long-term storage, you need a more advanced setup.
This example uses the latest Grafana version, which is under AGPL-3.0 and might not be free of charge for commercial usage.

Prerequisites

Kyma as the target deployment environment.
The Istio module is enabled.
Kubectl > 1.22.x
Helm 3.x

Installation

Preparation

Export your Namespace as a variable. Replace the {namespace} placeholder in the following command and run it:
```
export K8S_NAMESPACE="{namespace}"
```
If you haven't created the Namespace yet, now is the time to do so:
```
kubectl create namespace $K8S_NAMESPACE
```
Note: This Namespace must have no Istio sidecar injection enabled; that is, there must be no istio-injection label present on the Namespace. The Helm chart deploys jobs that will not succeed when Isto sidecar injection is enabled.
Export the Helm release name that you want to use. It can be any name, but be aware that all resources in the cluster will be prefixed with that name. Run the following command:
```
export HELM_PROM_RELEASE="prometheus"
```

Update your Helm installation with the required Helm repository:

helm repo add prometheus-community https://prometheus-community.github.io/helm-charts
helm repo update

Install the kube-prometheus-stack

Run the Helm upgrade command, which installs the chart if it's not present yet. At the end of the command, change the Grafana admin password to some value of your choice.

helm upgrade --install -n ${K8S_NAMESPACE} ${HELM_PROM_RELEASE} prometheus-community/kube-prometheus-stack -f https://raw.githubusercontent.com/kyma-project/examples/main/prometheus/values.yaml --set grafana.adminPassword=myPwd

You can use the values.yaml provided with this tutorial, which contains customized settings deviating from the default settings, or create your own one. The provided values.yaml covers the following adjustments:

Client certificate injection to support scraping of workload secured with Istio strict mTLS
Active scraping of workload annotated with prometheus.io/scrape
Basic configuration of data persistence with retention
Basic resource limits for involved components

Activate scraping of Istio metrics & Grafana dashboards

To configure Prometheus for scraping of the Istio-specific metrics from any istio-proxy running in the cluster, deploy a PodMonitor, which scrapes any Pod that has a port with name .*-envoy-prom exposed.
```
kubectl -n ${K8S_NAMESPACE} apply -f https://raw.githubusercontent.com/kyma-project/examples/main/prometheus/istio/podmonitor-istio-proxy.yaml
```

Deploy a ServiceMonitor definition for the central metrics of the istiod deployment:

kubectl -n ${K8S_NAMESPACE} apply -f https://raw.githubusercontent.com/kyma-project/examples/main/prometheus/istio/servicemonitor-istiod.yaml

Get the latest versions of the Istio-specific dashboards. Grafana is configured to load dashboards dynamically from ConfigMaps in the cluster, so Istio-specific dashboards can be applied as well. Either follow the Istio quick start instructions, or take the prepared ones with the following command:
```
kubectl -n ${K8S_NAMESPACE} apply -f https://raw.githubusercontent.com/kyma-project/examples/main/prometheus/istio/configmap-istio-grafana-dashboards.yaml
kubectl -n ${K8S_NAMESPACE} apply -f https://raw.githubusercontent.com/kyma-project/examples/main/prometheus/istio/configmap-istio-services-grafana-dashboards.yaml
```
NOTE: This setup collects all Istio metrics on a Pod level, which can lead to cardinality issues. Because metrics are only needed on service level, for setups having a bigger amount of workloads deployed, it is recommended to use a setup based on federation as described in the Istio documentation.

Verify the installation

You should see several Pods coming up in the Namespace, especially Prometheus and Alertmanager. Assure that all Pods have the "Running" state.
Browse the Prometheus dashboard and verify that all "Status->Targets" are healthy. The following command exposes the dashboard on http://localhost:9090:
```
kubectl -n ${K8S_NAMESPACE} port-forward $(kubectl -n ${K8S_NAMESPACE} get service -l app=kube-prometheus-stack-prometheus -oname) 9090
```
Browse the Grafana dashboard and verify that the dashboards are showing data. The user admin is preconfigured in the Helm chart; the password was provided in your helm install command. The following command exposes the dashboard on http://localhost:3000:
```
kubectl -n ${K8S_NAMESPACE} port-forward svc/${HELM_PROM_RELEASE}-grafana 3000:80
```

Deploy a custom workload and scrape it

Follow the tutorial monitoring-custom-metrics, but use the steps above to verify that the metrics are collected.

Scrape workload via annotations

Instead of defining a ServiceMonitor per workload for setting up custom metric scraping, you can use a simplified way based on annotations. The used values.yaml defines an additionalScrapeConfig, which scrapes all Pods and services that have the following annotations:

prometheus.io/scrape: "true"   # mandatory to enable automatic scraping
prometheus.io/scheme: https    # optional, default is "http" if no Istio sidecar is used. When using a sidecar (Pod has label `security.istio.io/tlsMode=istio`), the default is "https". Use "https" to scrape workloads using Istio client certificates.
prometheus.io/port: "1234"     # optional, configure the port under which the metrics are exposed
prometheus.io/path: /myMetrics # optional, configure the path under which the metrics are exposed

You can try it out by removing the ServiceMonitor from the previous example and instead providing the annotations to the Service manifest.

Set up alerting

You can connect the Alertmanager to your notification channel (for instance, Slack or VictorOps) by providing an Alertmanager configuration to the alertmanager.config value.

The alertmanager-values.yaml example provides a configuration that sends notifications for alerts with high severity to a Slack channel. To deploy it, download the file, adapt <channel-name>, <api-url> and <cluster-domain> to your environment, and run the Helm upgrade command to deploy the configuration:
```
 helm upgrade --install -n ${K8S_NAMESPACE} ${HELM_PROM_RELEASE} prometheus-community/kube-prometheus-stack -f https://raw.githubusercontent.com/kyma-project/examples/main/prometheus/values.yaml -f ./alertmanager-values.yaml --set grafana.adminPassword=myPwd
```
Follow the tutorial monitoring-alert-rules to set up an alerting rule on Prometheus.

Set up Grafana dashboards

Follow the tutorial monitoring-grafana-dashboard to learn how to visualize your metrics in a Grafana dashboard.

Cleanup

To remove the installation from the cluster, call Helm:

helm delete -n ${K8S_NAMESPACE} ${HELM_PROM_RELEASE}

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

prometheus.md

prometheus.md

Install a custom kube-prometheus-stack

Overview

Prerequisites

Installation

Preparation

Install the kube-prometheus-stack

Activate scraping of Istio metrics & Grafana dashboards

Verify the installation

Deploy a custom workload and scrape it

Scrape workload via annotations

Set up alerting

Set up Grafana dashboards

Cleanup

Files

prometheus.md

Latest commit

History

prometheus.md

File metadata and controls

Install a custom kube-prometheus-stack

Overview

Prerequisites

Installation

Preparation

Install the kube-prometheus-stack

Activate scraping of Istio metrics & Grafana dashboards

Verify the installation

Deploy a custom workload and scrape it

Scrape workload via annotations

Set up alerting

Set up Grafana dashboards

Cleanup