[Request]: Update 'service.name' docs with additional guidance #4102

roshan-elastic · 2024-07-31T08:35:52Z

Relates to:

Log metrics for Services don't show unless "log.level" emitted kibana#189389

Description

We need to add some additional content to the docs which explain how to declare a service in your logs:

https://www.elastic.co/docs/current/serverless/observability/add-logs-service-name

Changes required

Explain that log.level needs to be present in order to display log metrics for a service in the new experience
Document a potentially common use case which a user could come across

Background
In order to filter out unhelpful APM logs unrelated to logging (e.g. APM transaction errors) we are forcing the 'log rate' and 'log error %' metrics to require log.level in order to work.

Services Inventory - New Experience

Services View - New Experience

Specifically:

Log Rate
Rate of logs per minute observed for given service.name.

Formula Calculation:
count(kql='log.level: *') / [PERIOD_IN_MINUTES]

Log Error %
% of logs where error detected for given service.name.

Formula Calculation:
count(kql='log.level: "error" OR log.level: "ERROR"') / count(kql='log.level: *')

log.level isn't always provided automatically by our various ingestion methods (e.g. beats, elastic agent) so we need to provide some guidance to explain this and suggest how to do this.

Additionally, there is a likely common use case where the log.level is nested within a message and is therefore not used in our metrics. We would like to provide some guidance around this.

1. Explain that log.level needs to be present in order to display log metrics for a service in the new experience

Note : We will be updating the UI to point towards this documentation:

We should provide some guidance on how to declare log.level in your logs via Elastic Agent. For example, here is some documentation on how to do this for standalone:

https://www.elastic.co/guide/en/fleet/current/elastic-agent-standalone-logging-config.html

Perhaps there is more comprehensive guidance? I'll add our engineers as contacts in case they can help.

2. Document a potentially common use case which a user could come across

One potentially common use case is where users specify a service.name in container or kubernetes logs (although not exclusive to this use case). We should write a bit about this to give them an example of how to work around this:

Example cluster

In this case, the log relating to the service is nested within the message:

message:{"@timestamp":"2024-07-31T08:26:45Z","log.level":"info","message":"proxying API request to http://opbeans:3000"}

There are methods to decode this but they'll need to be careful that they don't override the existing log.level if they do.

We should make them aware of this use case and give them some guidance on how to pull out the encoded log.level and surface it in the main document (being careful not to overwrite the existing log.level).

Resources

Quick demo video:

services.inventory.-.new.experience.-.demo.mp4

Which documentation set does this change impact?

Stateful and Serverless

Feature differences

It's going to be available on both.

What release is this request related to?

8.16

Collaboration model

The documentation team

Point of contact.

Main contact: @roshan-elastic

Stakeholders: @cauemarcondes @kpatticha

The text was updated successfully, but these errors were encountered:

roshan-elastic · 2024-07-31T08:37:48Z

Hey @mdbirnstiehl this is the update I mentioned.

@bmorelli25 wondering if this update could be prioritised? It's relating to a bit of a problem that we can't build around in the product so we want to provide some user guidance for in the docs (and link to it from the product).

We'll be linking to the docs from the product (by next Tuesday) but as it's a short-link, we can point them to the general docs without it being a blocker.

bmorelli25 · 2024-07-31T21:42:16Z

We should be able to prioritize this. @mdbirnstiehl is booked up for the near future, but I'll try to find someone else on the team to take this on.

bmorelli25 · 2024-08-01T18:25:49Z

Note to writer: This document also needs to be ported to stateful

dedemorton · 2024-08-08T00:02:46Z

@cauemarcondes @kpatticha Can you provide a decode_json_fields processor config example that shows how to pull out the encoded log.level and surface it in the main document (being careful not to overwrite the existing log.level). Maybe a config that would work with the following example:

message:{"@timestamp":"2024-07-31T08:26:45Z","log.level":"info","message":"proxying API request to http://opbeans:3000"}

dedemorton · 2024-08-08T00:07:57Z

@mdbirnstiehl Do you think this new content belongs in the topic about adding the service name or a new topic called something like "Add a log level to logs"? I'm leaning towards a separate topic because it seems like we are mixing things that logically don't belong together. If I add this info to the existing topic, I'll need to completely restructure it. I also wonder if folks who aren't user the new experience might still want to know how to add (or decode) the log level WDYT?

cauemarcondes · 2024-08-08T08:21:24Z

Can you provide a decode_json_fields processor config example that shows how to pull out the encoded log.level and surface it in the main document (being careful not to overwrite the existing log.level). Maybe a config that would work with the following example:

Hi @dedemorton, I think it's best to ask the @elastic/obs-ux-logs-team team for an official guide on how to add the JSON processor for both the Auto-detect logs and metrics and Stream log files starting guide. @flash1293 This is about what we talked a couple of weeks ago when you helped me parsing the log messages.

tonyghiani · 2024-08-08T12:21:45Z

Can you provide a decode_json_fields processor config example that shows how to pull out the encoded log.level and surface it in the main document (being careful not to overwrite the existing log.level)

@dedemorton you can use the pre-installed logs@json-pipeline pipeline to parse JSON logs. This is installed by ES by default and takes care of all the steps for the parsing of a JSON-like message.

Please note that the strategy the pipeline follows after parsing is add_to_root_conflict_strategy: merge, which means existing parsed fields will be overwritten.

Here is how you can use it:

{
  "my-pipeline": {
    "processors": [
      {
        "pipeline": {
          "name": "logs@json-pipeline",
          "ignore_missing_pipeline": true
        }
      }
    ]
  }
}

And this is the whole definition of logs@json-pipeline:

{
  "logs@json-pipeline": {
    "processors": [
      {
        "rename": {
          "if": "ctx.message instanceof String && ctx.message.startsWith('{') && ctx.message.endsWith('}')",
          "field": "message",
          "target_field": "_tmp_json_message",
          "ignore_missing": true
        }
      },
      {
        "json": {
          "if": "ctx._tmp_json_message != null",
          "field": "_tmp_json_message",
          "add_to_root": true,
          "add_to_root_conflict_strategy": "merge",
          "allow_duplicate_keys": true,
          "on_failure": [
            {
              "rename": {
                "field": "_tmp_json_message",
                "target_field": "message",
                "ignore_missing": true
              }
            }
          ]
        }
      },
      {
        "dot_expander": {
          "if": "ctx._tmp_json_message != null",
          "field": "*",
          "override": true
        }
      },
      {
        "remove": {
          "field": "_tmp_json_message",
          "ignore_missing": true
        }
      }
    ],
    "_meta": {
      "description": "automatic parsing of JSON log messages",
      "managed": true
    },
    "version": 12,
    "deprecated": false
  }
}

mdbirnstiehl · 2024-08-08T15:58:19Z

@mdbirnstiehl Do you think this new content belongs in the topic about adding the service name or a new topic called something like "Add a log level to logs"? I'm leaning towards a separate topic because it seems like we are mixing things that logically don't belong together. If I add this info to the existing topic, I'll need to completely restructure it. I also wonder if folks who aren't user the new experience might still want to know how to add (or decode) the log level WDYT?

Yeah, I agree that it makes more sense to me as a separate topic.

dedemorton · 2024-08-15T20:06:47Z

Spoke with @mdbirnstiehl. He is going to take over this issue because he has time now. Plus, as our logs guy™, he knows more about this subject. Thanks Mike!

mdbirnstiehl · 2024-08-25T14:56:10Z

Hi @roshan-elastic, I'm not sure I completely understand the scenario of having to declare a log level that don't contain log levels at all.

I understand when there is a log.level present, but it's not parsed and we can use the logs@json pipeline described by @tonyghiani to parse the logs.

With the standalone agent link, wouldn't that just apply to the Agent's logs and not to logs from events that are getting indexed? Are we wanting to create arbitrary log levels for logs that don't contain log levels? I'm not sure if that would create meaningful data or graphs.

roshan-elastic · 2024-08-27T14:41:02Z

Hey @mdbirnstiehl,

Good timing :)

We're actually going to change:

Current
Log Rate : (currently requires logs to have log.level to be included
Log Error % : (currently requires logs to have log.level to be included )

Change to

Log Rate : count() / [PERIOD_IN_MINUTES]
Log Error Rate : count(kql='log.level: "error" OR log.level: "ERROR"') / [PERIOD_IN_MINUTES]

@iblancof might be the best contact for this but we'll be making these changes as part of this epic:

https://github.com/elastic/observability-dev/issues/3462

For example, here

mdbirnstiehl · 2024-09-04T19:59:19Z

Hi @roshan-elastic and @iblancof ,

Would it make sense to add a note or section in the new experience page about the Log Rate and Log Error Rate formulas and the need to parse the log.level for the Log Error Rate? We could then point to some examples like the page on extracting log.level from an unstructured or semi-structured log data, or extracting a log level from k8s logs?

I think it might fit better on the new experiences page rather than the service.name page. WDYT?

roshan-elastic · 2024-09-05T09:23:51Z

Hey @mdbirnstiehl - thanks for reminding me about this.

we're actually about to change how we handle the log rate and log error %:

Log Rate

Now
Log Rate
count(kql='log.level: *') / [PERIOD_IN_MINUTES]

Changing to
Log Rate
count() / [PERIOD_IN_MINUTES]

Log Error %

Now
Log Error %
count(kql='log.level: "error" OR log.level: "ERROR"') / [PERIOD_IN_MINUTES]

Changing to
Log Error Rate
count(kql='log.level: "error" OR log.level: "ERROR" OR error.log.level : "error"') / [PERIOD_IN_MINUTES]

This means that the log charts in the service views should be far more unlikely to be empty:

We could then point to some examples like the page on extracting log.level from an unstructured or semi-structured log data, or extracting a log level from k8s logs?

Makes sense!

I think it might fit better on the new experiences page rather than the service.name page. WDYT?

Yeah, I think this makes sense to show this in the new experiences page but perhaps it makes sense to link 'how' to do this in the service logs doc so that all of the information about how to declare your services well lives in one place?

In short:

Call out that UI won't work properly without declaring the service ==> experience page (and link through to service logs page)
How to declare data against your services ==> service logs page

WDYT?

mdbirnstiehl · 2024-09-06T14:03:41Z

@roshan-elastic sounds good, I'll start making the updates!

github-actions bot added Docset:All Lead:Writer Request:8.16 labels Jul 31, 2024

This was referenced Jul 31, 2024

Log metrics for Services don't show unless "log.level" emitted elastic/kibana#189389

Closed

Get docs created elastic/kibana#189498

Closed

bmorelli25 assigned dedemorton Aug 6, 2024

roshan-elastic mentioned this issue Aug 15, 2024

Update short-links to point at docs elastic/kibana#189501

Open

1 task

dedemorton assigned mdbirnstiehl and unassigned dedemorton Aug 15, 2024

mdbirnstiehl mentioned this issue Sep 23, 2024

add log level to services #4260

Closed

mdbirnstiehl closed this as not planned Won't fix, can't repro, duplicate, stale Oct 3, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Request]: Update 'service.name' docs with additional guidance #4102

[Request]: Update 'service.name' docs with additional guidance #4102

roshan-elastic commented Jul 31, 2024 •

edited

Loading

roshan-elastic commented Jul 31, 2024

bmorelli25 commented Jul 31, 2024

bmorelli25 commented Aug 1, 2024

dedemorton commented Aug 8, 2024

dedemorton commented Aug 8, 2024

cauemarcondes commented Aug 8, 2024

tonyghiani commented Aug 8, 2024

mdbirnstiehl commented Aug 8, 2024

dedemorton commented Aug 15, 2024

mdbirnstiehl commented Aug 25, 2024

roshan-elastic commented Aug 27, 2024

mdbirnstiehl commented Sep 4, 2024

roshan-elastic commented Sep 5, 2024

mdbirnstiehl commented Sep 6, 2024

[Request]: Update 'service.name' docs with additional guidance #4102

[Request]: Update 'service.name' docs with additional guidance #4102

Comments

roshan-elastic commented Jul 31, 2024 • edited Loading

Relates to:

Description

Resources

Which documentation set does this change impact?

Feature differences

What release is this request related to?

Collaboration model

Point of contact.

roshan-elastic commented Jul 31, 2024

bmorelli25 commented Jul 31, 2024

bmorelli25 commented Aug 1, 2024

dedemorton commented Aug 8, 2024

dedemorton commented Aug 8, 2024

cauemarcondes commented Aug 8, 2024

tonyghiani commented Aug 8, 2024

mdbirnstiehl commented Aug 8, 2024

dedemorton commented Aug 15, 2024

mdbirnstiehl commented Aug 25, 2024

roshan-elastic commented Aug 27, 2024

mdbirnstiehl commented Sep 4, 2024

roshan-elastic commented Sep 5, 2024

mdbirnstiehl commented Sep 6, 2024

roshan-elastic commented Jul 31, 2024 •

edited

Loading