GenAI: clarify what 'chunk' means in time-to-chunk metrics

This PR talks about "chunks". What is the definition of "chunk"? Is that only about assistant content? Does it include reasoning if the model makes a distinction between reasoning and response text? Does it include any notification from the service, like a function call request (or a part of one)? Etc. My assumption is it's any packet of data from the llm, such that each update produced as part of a streaming implementation, regardless of what that update contains, counts as a "chunk".

_Originally posted by @stephentoub in https://github.com/open-telemetry/semantic-conventions/issues/3377#issuecomment-3976746954_
            

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

GenAI: clarify what 'chunk' means in time-to-chunk metrics #3483

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

GenAI: clarify what 'chunk' means in time-to-chunk metrics #3483

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions