Guidance on product/project name inside attribute/metric name #608

lmolkova · 2023-12-13T17:36:23Z

We should provide a guidance on product/project name being fully qualified (or not). We should keep the same pattern in different semconvs.
Examples of inconsistencies:

db.cosmosdb (Azure), db.dynamodb (AWS), db.couchdb (under Apache umbrella), db.spanner (GCP), db2 (IBM), hanadb (SAP HANA) etc and corresponding values in the db.system enum
we also have aws_sqs, gcp_pubsub, azure_servicebus in the messaging.system enum

We should provide a guidance on how to represent multiple words:

lowercase product name (couchdb) (#Codegen: proper casing for multiword attribute/metric names #599)
multiple namespaces vs single namespace (e.g. messaging.aws_sqs.destination.custom_attr vs messaging.aws.sqs.destination.custom_attr)
acronyms/abbreviations guidance (Add guidance on abbreviation usage in semantic attribute names #602)

We should require consistency across signals:

Redis can be used as messaging system and should have exactly the same representation in db.system and messaging.system
If Azure Service Bus is instrumented on the service side to report telemetry to end users, it should use the same value in resource attributes (Define cloud.platform and/or rename it #609)

Misc discrepancies:

mssqlcompact (Microsoft SQL Server Compact) should probably become mssql_compact

[Update]
Other attributes that have the same problem:

cloud.platform

The text was updated successfully, but these errors were encountered:

alanwest · 2024-02-09T00:51:24Z

With regards to representing multiple words, can a product's branding provide a guide? For example, use dynamodb, couchdb, and cosmos_db because they are respectively branded DynamoDB, CouchDB, and Cosmos DB.

trask · 2024-02-09T15:47:32Z

It would be nice if whatever the enum is, e.g. cosmosdb, that is also the namespace, e.g. db.cosmosdb.*, for product-specific attributes

trask · 2024-02-09T15:49:50Z

do we think we need mssqlcompact? that doesn't seem like something we'd necessarily know from the client side, and on the server side it could potentially be a resource attribute describing the "edition"

KalleOlaviNiemitalo · 2024-02-09T16:35:38Z

SQL Server Compact runs in-process rather than as a network service, so yes, the application should know it's using that.
It's no longer supported by Microsoft, but open-telemetry/opentelemetry-specification#3105 shows it's still used.

lmolkova · 2024-06-20T03:56:34Z

Based on the messaging SIG discussion on 6/13, none of the controversial systems are part of the initial stability (kafka + RabbitMQ), so removing the stability blocker label.

lmolkova · 2025-01-10T23:45:53Z

Been discussing it in the scope of #1734. There are competing consistency goals when it comes to project names. Let's explore them:

1. Stay consistent with external project/product/system/etc name whenever possible

The guidance would be:

Use a registered trademark (wordmark) or another 'official' name
When it's ambiguous (e.g. caché), it needs to be disambiguated, for example, prefixed with a company name (intersystems_cache)

Non-controversial examples:

mongodb
postgresql
cassandra
ibm.mq
oracle.db (oracledb, oracle.database and other possible variations)

Controversial examples

informix - it's a product that was acquired by IBM but had history before it, has a unique name, and registered as a trademark. The controversy is that it coexists with ibm.mq
sap.hana (trademark on SAP HANA) and maxdb (trademark on MaxDB)
cloud_spanner and gcp.pubsub - the former is a trademark, the latter is ambiguous and has to be qualified
s3 - it's a trademark. Controversy is that we use aws.s3 today and have a root aws namespace for cross-AWS attributes.

2. Stay consistent within semantic conventions

The guidance would be:

Product name should be qualified with a company/division name with the following exceptions:

company and product name are the same (or similar).
OSS/community-driven projects that don't belong to a company

Non-controversial examples:

mongodb - company name is the same as product name
elasticsearch - elastic is already part of elasticsearch, let's use common sense
cassandra - apache project
ibm.mq

Controversial examples

ibm.informix - informix was a product before IBM acquired it

Obviously wrong examples:

oracle.mysql - TIL that MySQL belongs to Oracle.
broadcom.spring - Spring deserves a root namespace.

I personally prioritize consistency within semantic conventions higher than strict consistency with a trademark. Given that products get acquired/renamed/evolve, we won't be able to stay fully consistent.

I'd prefer option 1.7 (bullets are ordered with descending priority):

Avoid ambiguity - always qualify ambiguous names (gcp.pubsub, ibm.mq, oracle.db - never pubsub, mq, oracle)
Use well-recognized and unique projects names as is (spring, mysql, mssql, postgresql) regardless of company affiliations
Qualify product name with the company/division/etc in other cases. Qualify cloud services with the cloud provider name.
When defining system name/attribute for a product, check if there are system names/attributes for other products from this company. Follow the same pattern.

There are plenty of edge-cases and we'd need to use our judgement on case-by-case basis. E.g. is informix a well-recognized product? Then is should fall under p2 and be informix, otherwise it falls under p3 and becomes ibm.informix.

trask · 2025-01-11T02:12:34Z

is Xyz a well-recognized product

this is a tough thing to decide, do you think it's possible to only use "avoid ambiguity", and so as long as it's not ambiguous (maybe relying on ownership of a common TLDs or high google SEO ranking?), we'd allow xyz.* to be a top-level namespace

not sure how this would work with hive, geode, derby (apache projects) though...

lmolkova · 2025-01-11T02:47:10Z

I'm thinking about azure.
Leaving az vs azure aside, I feel value in having az.cosmosdb, az.servicebus, az.blob vs cosmosdb, servicebus, az.blob:

when typing query, users don't need to remember where to use az prefix
everything under az would be governed by azure if we had decentralized semconv
az.* things use az attributes (including common ones like az.namespace)

From this perspective, I don't see a difference between IBM DB2 that can be hosted anywhere and Azure CosmosDB that's a cloud service and can run only on Azure. So if we use az for the latter, why don't we use ibm for the former?

trask · 2025-01-12T03:59:02Z

I think we can justify mysql not being oracle.mysql given it's a TLD: https://mysql.com

trask · 2025-01-13T16:35:22Z

I think we can justify mysql not being oracle.mysql given it's a TLD: https://mysql.com

Similar justification for spring as top-level namespace due to it being hosted at https://spring.io

github-actions bot assigned joaopgrassi Dec 13, 2023

lmolkova mentioned this issue Dec 13, 2023

Define additional Azure messaging attributes #572

Merged

3 tasks

pyohannes added this to Spec: Messaging Semantics Feb 1, 2024

github-project-automation bot moved this to V1 - Stable Semantics in Spec: Messaging Semantics Feb 1, 2024

pyohannes added the messaging-stability-blocker label Feb 1, 2024

trask added this to Database Client Semantic Conventions Feb 7, 2024

pyohannes unassigned joaopgrassi Feb 8, 2024

lmolkova mentioned this issue Mar 27, 2024

LLM Semantic Conventions: Initial PR #825

Merged

3 tasks

trask moved this to Post Stability in Database Client Semantic Conventions Apr 24, 2024

lmolkova mentioned this issue May 9, 2024

Database: review db.system list #1023

Closed

lmolkova moved this from V1 - Stable Semantics to Post-stability in Spec: Messaging Semantics Jun 20, 2024

lmolkova removed the messaging-stability-blocker label Jun 20, 2024

This was referenced Dec 18, 2024

[chore] Update vertex_ai to vertex.ai #1684

Closed

Change azure_ and az. to azure. across all conventions #1698

Draft

lmolkova mentioned this issue Jan 10, 2025

Consistent naming: db.system to db.system.name, namespace constants, remove db from system-specific names #1734

Open

3 tasks

lmolkova mentioned this issue Jan 17, 2025

Add system-specific naming guidance #1708

Open

3 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Guidance on product/project name inside attribute/metric name #608

Guidance on product/project name inside attribute/metric name #608

lmolkova commented Dec 13, 2023 •

edited

Loading

alanwest commented Feb 9, 2024

trask commented Feb 9, 2024 •

edited

Loading

trask commented Feb 9, 2024

KalleOlaviNiemitalo commented Feb 9, 2024

lmolkova commented Jun 20, 2024

lmolkova commented Jan 10, 2025 •

edited

Loading

trask commented Jan 11, 2025 •

edited

Loading

lmolkova commented Jan 11, 2025 •

edited

Loading

trask commented Jan 12, 2025

trask commented Jan 13, 2025

Guidance on product/project name inside attribute/metric name #608

Guidance on product/project name inside attribute/metric name #608

Comments

lmolkova commented Dec 13, 2023 • edited Loading

alanwest commented Feb 9, 2024

trask commented Feb 9, 2024 • edited Loading

trask commented Feb 9, 2024

KalleOlaviNiemitalo commented Feb 9, 2024

lmolkova commented Jun 20, 2024

lmolkova commented Jan 10, 2025 • edited Loading

1. Stay consistent with external project/product/system/etc name whenever possible

2. Stay consistent within semantic conventions

trask commented Jan 11, 2025 • edited Loading

lmolkova commented Jan 11, 2025 • edited Loading

trask commented Jan 12, 2025

trask commented Jan 13, 2025

lmolkova commented Dec 13, 2023 •

edited

Loading

trask commented Feb 9, 2024 •

edited

Loading

lmolkova commented Jan 10, 2025 •

edited

Loading

trask commented Jan 11, 2025 •

edited

Loading

lmolkova commented Jan 11, 2025 •

edited

Loading