Application Logging on Red Hat OpenShift Container Platform (RHOCP) with Elasticsearch, Fluentd, and Kibana
The following guide has been tested with Red Hat OpenShift Container Platform (RHOCP) 4.7 and 4.8.
Pod processes running in Kubernetes frequently produce logs. To effectively manage this log data and ensure no loss of log data occurs when a pod terminates, a log aggregation tool should be deployed on the Kubernetes cluster. Log aggregation tools help users persist, search, and visualize the log data that is gathered from the pods across the cluster. Log aggregation tools in the market today include: EFK, LogDNA, Splunk, Datadog, IBM Operations Analytics, etc. When considering log aggregation tools, enterprises make choices that are inclusive of their journey to cloud, both new cloud-native applications running in Kubernetes and their existing traditional IT choices.
One choice for application logging with log aggregation, based on open source, is EFK (Elasticsearch, Fluentd, and Kibana). This guide describes the process of deploying EFK using the Elasticsearch Operator and the Red Hat Openshift Logging Operator. Use this preconfigured EFK stack to aggregate all container logs. After a successful installation, the EFK pods should reside inside the openshift-logging namespace of the cluster.
To install the Openshift Logging component, follow the OpenShift guide Installing Openshift Logging. Ensure you set up valid storage for Elasticsearch via Persistent Volumes. When the example Openshift Logging instance YAML from the guide is deployed, the Elasticsearch pods that are created automatically search for Persistent Volumes to bind to; if there are none available for binding, the Elasticsearch pods are stuck in a pending state. Using in-memory storage is also possible by removing the storage
definition from the Openshift Logging instance YAML, but this is not suitable for production.
After the installation completes without any error, you can see the following pods that are running in the openshift-logging namespace. The exact number of pods running for each of the EFK components can vary depending on the configuration specified in the ClusterLogging Custom Resource (CR).
[root@rhel7-ocp ~]# oc get pods -n openshift-logging
NAME READY STATUS RESTARTS AGE
cluster-logging-operator-874597bcb-qlmlf 1/1 Running 0 150m
curator-1578684600-2lgqp 0/1 Completed 0 4m46s
elasticsearch-cdm-4qrvthgd-1-5444897599-7rqx8 2/2 Running 0 9m6s
elasticsearch-cdm-4qrvthgd-2-865c6b6d85-69b4r 2/2 Running 0 8m3s
fluentd-rmdbn 1/1 Running 0 9m5s
fluentd-vtk48 1/1 Running 0 9m5s
kibana-756fcdb7f-rw8k8 2/2 Running 0 9m6s
The Openshift Logging instance also exposes a route for external access to the Kibana console.
[root@rhel7-okd ~]# oc get routes -n openshift-logging
NAME HOST/PORT PATH SERVICES PORT TERMINATION WILDCARD
kibana kibana-openshift-logging.apps.host.kabanero.com kibana <all> reencrypt/Redirect None
In OpenShift, the OpenShift Logging Fluentd collectors capture the application container logs and set each log into a message field of a Fluentd JSON document. If the logs are being output in JSON format, then they will be nested inside the the Fluentd JSON document’s message field. In order to properly utilize the JSON log data inside a Kibana dashboard, the individual fields inside the nested JSON logs must be parsed out.
Starting from RHOCP 4.7, the ability to parse these nested JSON application container logs has been provided with the additional deployment of a Cluster Log Forwarder instance. Once deployed, the configured Cluster Log Forwarder instance will copy the nested JSON log into a separate structured
field inside the Fluentd JSON document, where the individual fields from the JSON container log can be accessed as structured.<field_name>
.
In order to avoid the possibility of conflicting JSON fields, where different products or applications may use the same JSON field names to represent different data types, the Cluster Log Forwarder instance requires JSON container logs from different application/product types to be separated into unique indices. In the following instructions, the ClusterLogForwarder Custom Resource (CR) will create these unique indices using a label attached to the service of your application.
First, add the label logFormat: liberty
to your OpenLibertyApplication Custom Resource. The Cluster Log Forwarder instance will use this label later to create a unique index for the container’s application logs.
kind: Service
apiVersion: v1
metadata:
name: <your-liberty-app>
labels:
logFormat: liberty
....
Restart your application deployment to include the updated label in your application’s service and pod.
Create the following YAML file cluster-logging-forwarder.yaml
to configure a Cluster Log Forwarder instance that will parse your JSON container logs.
apiVersion: logging.openshift.io/v1
kind: ClusterLogForwarder
metadata:
namespace: openshift-logging
name: instance
spec:
inputs:
- name: liberty-logs
application:
namespaces:
- liberty-app #modify this value to be your own app namespace
outputDefaults:
elasticsearch:
structuredTypeKey: kubernetes.labels.logFormat
structuredTypeName: nologformat
outputs:
pipelines:
- name: parse-liberty-json
inputRefs:
- liberty-logs
outputRefs:
- default
parse: json
The YAML above sets up a new ClusterLogForwarder pipeline parse-liberty-json
. This pipeline takes in an input of liberty_logs
, all the container logs from the liberty-app
namespace, and outputs them to the OpenShift cluster’s default Elasticsearch log store. The parse: json
definition enables the JSON log parsing.
The configured outputDefaults.elasticsearch.structuredTypeKey
parameter will build a unique index for the container logs by prepending app-
to the value of the logFormat
label in the container. Previously, the label logFormat: liberty
was added to the service of your OpenLibertyApplication, so the logs forwarded to the Elasticsearch default log store can be found under an index pattern of app-liberty-*
. If no logFormat
label exists in your application container, the outputDefaults.elasticsearch.structuredTypeName
parameter provides a fallback index name.
Deploy the Cluster Log Forwarder instance with the following command:
[root@rhel7-ocp ~]# oc create -f cluster-logging-forwarder.yaml
For more information on parsing JSON logs in RHOCP, see the the OpenShift guide Enabling JSON logging.
In cases where the application server provides the option, output application logs in JSON format. This lets you fully take advantage of Kibana’s dashboard functions. Kibana is then able to process the data from each individual field of the JSON object to create customized visualizations for that field.
See the Kibana dashboard page by using the routes URL https://kibana-openshift-logging.apps.host.kabanero.com. Log in using your Kubernetes user and password. The browser will redirect you to Kibana’s Create index pattern page under Management. Create a new index pattern app-liberty-* (that was defined above) to select all the Elasticsearch indices used for your Liberty application logs. Navigate to the Discover page to view the application logs generated by the deployed application.
Expand an individual log entry to see the structured.*
formatted individual fields, parsed and copied out of the nested JSON log entry.
Kibana dashboards created for Open Liberty logging events can be found here. To import a dashboard and its associated objects, navigate back to the Management page and click Saved Objects. Click Import and select the dashboard file. When prompted, click the Yes, overwrite all option.
Head back to the Dashboard page and enjoy navigating logs on the newly imported dashboard.
If changes must be made for the installed EFK stack, edit the ClusterLogging Custom Resource (CR) of the deployed Openshift Logging instance. If the EFK stack is no longer needed, remove the Openshift Logging instance from the Red Hat Openshift Logging Operator Details page.