Provide a way for Logs SDK to sample based on span context #3207

aabmass · 2025-01-24T17:39:40Z

The OpenAI instrumentation currently only records events if the span is recording:

opentelemetry-python-contrib/instrumentation-genai/opentelemetry-instrumentation-openai-v2/src/opentelemetry/instrumentation/openai_v2/patch.py

Lines 59 to 63 in 2756c1e

    
           if span.is_recording(): 
        
               for message in kwargs.get("messages", []): 
        
                   event_logger.emit( 
        
                       message_to_event(message, capture_content) 
        
                   )

This makes sense, since event recording can be quite expensive and we can piggyback on the sampling decision. However it feels like the Logging API should be more integrated and instrumentation shouldn't need to do these checks. Most logging libraries also have an API to check if recording like Logger.isEnabledFor() and/or support lazy evaluation of log fields. So two questions are

Should GenAI events be recorded only when a span is recording?
Should the instrumentation be in charge of this or should the Logging SDK provide its own API to check for recording or lazily evaluate log entry fields.

The text was updated successfully, but these errors were encountered:

drewby · 2025-01-27T08:34:11Z

We are currently making an implicit assumption that events are recorded as part of a span, but the specification does not explicitly require this—though all the examples at the end show events within spans. It might be worth updating the specification to clarify this expectation either way.

xrmx · 2025-01-27T09:58:58Z

My thinking is that traces and logs are two different signals and so they should be ~independent.

aabmass · 2025-01-27T16:30:34Z

I agree it's cleaner to keep them separate when writing instrumentation. Maybe the logging SDK can provide a means to tie span sampling to event sampling. We have some precedent here in Exemplars where the default ExemplarFilter is TraceBased.

aabmass · 2025-01-31T17:57:31Z

Discussed in SIG and we agreed Logs/events SDK should have way to configure this behavior. Changing to a feature request and I think we may need a spec discussion

aabmass added gen-ai Related to generative AI instrumentation labels Jan 24, 2025

github-project-automation bot added this to GenAI Semantic Conventions and Instrumentation libraries Jan 24, 2025

github-project-automation bot moved this to New issues in GenAI Semantic Conventions and Instrumentation libraries Jan 24, 2025

aabmass mentioned this issue Jan 24, 2025

VertexAI emit user, system, and assistant events #3203

Merged

10 tasks

aabmass mentioned this issue Jan 28, 2025

OpenAI instrumentation should capture events regardless of span recording #3217

Closed

lmolkova moved this from New issues to Todo in GenAI Semantic Conventions and Instrumentation libraries Jan 30, 2025

aabmass changed the title ~~Should GenAI events be recorded only when a span is recording?~~ Provide a way for Logs SDK to sample based on span context Jan 31, 2025

aabmass added the feature-request label Jan 31, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Provide a way for Logs SDK to sample based on span context #3207

Provide a way for Logs SDK to sample based on span context #3207

aabmass commented Jan 24, 2025

drewby commented Jan 27, 2025

xrmx commented Jan 27, 2025

aabmass commented Jan 27, 2025

aabmass commented Jan 31, 2025

Provide a way for Logs SDK to sample based on span context #3207

Provide a way for Logs SDK to sample based on span context #3207

Comments

aabmass commented Jan 24, 2025

drewby commented Jan 27, 2025

xrmx commented Jan 27, 2025

aabmass commented Jan 27, 2025

aabmass commented Jan 31, 2025