Skip to content

feat: Add sync streaming support for Anthropic instrumentation#4155

Open
vasantteja wants to merge 45 commits intoopen-telemetry:mainfrom
vasantteja:anthropic-sync-streaming
Open

feat: Add sync streaming support for Anthropic instrumentation#4155
vasantteja wants to merge 45 commits intoopen-telemetry:mainfrom
vasantteja:anthropic-sync-streaming

Conversation

@vasantteja
Copy link
Contributor

@vasantteja vasantteja commented Feb 1, 2026

Description

This PR adds sync streaming support for the Anthropic instrumentation. It enables telemetry capture for:

  1. Messages.create(stream=True) - Streaming responses via the create method with stream parameter
  2. Messages.stream() - The dedicated streaming method that returns a MessageStreamManager

Key changes:

  • Added StreamWrapper class to wrap Stream[RawMessageStreamEvent] and extract telemetry from streaming chunks
  • Added MessageStreamManagerWrapper to wrap MessageStreamManager context manager
  • Added MessageWrapper for non-streaming response telemetry extraction
  • Renamed MessageCreateParams to MessageRequestParams to reflect broader API coverage
  • Modified messages_create to use manual lifecycle management (start_llm/stop_llm) instead of context manager to support both streaming and non-streaming

Fixes #3949 partially.

Type of change

Please delete options that are not relevant.

  • Bug fix (non-breaking change which fixes an issue)
  • New feature (non-breaking change which adds functionality)
  • Breaking change (fix or feature that would cause existing functionality to not work as expected)
  • This change requires a documentation update

How Has This Been Tested?

Added comprehensive tests for sync streaming functionality:

  • test_sync_messages_create_streaming - Tests streaming with context manager
  • test_sync_messages_create_streaming_iteration - Tests direct iteration without context manager
  • test_sync_messages_create_streaming_connection_error - Tests error handling for streaming
  • test_sync_messages_stream_basic - Tests Messages.stream() method
  • test_sync_messages_stream_with_params - Tests stream with additional parameters (temperature, top_p, top_k)
  • test_sync_messages_stream_token_usage - Tests token usage capture in streaming
  • test_sync_messages_stream_connection_error - Tests error handling for stream method

All tests use VCR cassettes for reproducible HTTP interaction replay.

Does This PR Require a Core Repo Change?

  • Yes. - Link to PR:
  • No.

Checklist:

See contributing.md for styleguide, changelog guidelines, and more.

  • Followed the style guidelines of this project
  • Changelogs have been updated
  • Unit tests have been added
  • Documentation has been updated

- Add support for Messages.create(stream=True) with StreamWrapper
- Add support for Messages.stream() with MessageStreamManagerWrapper
- Add MessageWrapper for non-streaming response telemetry
- Rename MessageCreateParams to MessageRequestParams
- Add comprehensive tests for sync streaming functionality
- Add type: ignore[arg-type] for Union type narrowing in messages_create
- Add type: ignore[return-value] for wrapper return types
- Add type: ignore[return-value] for __exit__ returning None
@vasantteja vasantteja force-pushed the anthropic-sync-streaming branch from 99a2596 to 504d0df Compare February 1, 2026 16:57
@vasantteja vasantteja removed their assignment Feb 5, 2026
@lmolkova
Copy link
Member

lmolkova commented Feb 8, 2026

tagging @anirudha who was interested to review the PR :)

@anirudha
Copy link

anirudha commented Feb 8, 2026

Thanks. Taking a look today

…r handling

- Introduce constants for provider name and cache token attributes.
- Normalize stop reasons and aggregate cache token fields in MessageWrapper and StreamWrapper.
- Enhance tests to validate input token aggregation and stop reason normalization.
- Update cassettes for new request and response structures in streaming scenarios.
@vasantteja vasantteja removed their assignment Feb 9, 2026
…d consistency

- Simplify constant definitions and normalize function calls in utils.py.
- Enhance test cases by removing unnecessary line breaks and improving formatting.
- Ensure consistent usage of type hints and comments in test functions.
@vasantteja vasantteja removed their assignment Feb 9, 2026
- Update the pylint directive to disable too-many-arguments warning for better clarity.
- Maintain consistency in function signature and improve code readability.
Copy link

@anirudha anirudha left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Tests all pass locally. Nice work overall — the wrapper separation is clean. One bug to fix (double finalize), rest are suggestions.

Note: conftest.py isn't in this diff so I can't leave a line comment, but scrub_response_headers is a no-op and all new cassettes leak anthropic-organization-id: 455ea6be-bd92-4199-83ec-0c6b39c5c169. Worth scrubbing that or adding it to filter_headers.

Also, the PR description says Fixes #3949 but async streaming isn't covered. Totally fine to scope this to sync only, but Fixes will auto-close the issue on merge. Maybe Partially addresses #3949 instead?

…tion

- Update test cases to validate streaming behavior with various parameters, including token usage and stop reasons.
- Introduce new cassettes for different scenarios, ensuring comprehensive coverage of streaming interactions.
- Refactor existing tests for clarity and consistency in structure and assertions.
…ocals in test_stream_wrapper_finalize_idempotent function
…Ds, timestamps, and token usage across various test cases. Refine content capture logic and ensure consistency in message formats, including adjustments to event data and headers for improved clarity and accuracy.
Copy link
Member

@MikeGoldsmith MikeGoldsmith left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

🚀

…oved type safety. Replace 'Any' with 'object' in several function signatures and class attributes. Introduce logging for error handling in MessagesStreamWrapper to enhance instrumentation reliability.
… clarity and safety. Update function signatures to use specific types instead of 'object', including changes to parameters in extract_params, get_input_messages, and get_system_instruction. Refactor messages_create to ensure correct type handling for streaming and non-streaming responses. Additionally, streamline message handling in MessagesStreamWrapper for better performance and reliability.
…function signatures in `messages_extractors.py` and `wrappers.py` to include specific types, improving clarity and reliability. Introduce handling for `None` values in `get_input_messages` and `get_system_instruction`. Refactor `MessagesStreamWrapper` to better manage usage updates and ensure correct type handling for streaming responses. Add new test cases for aggregating cache tokens and handling streaming errors.
…. Simplify assertion statements by removing unnecessary parentheses, enhancing code clarity in cache token tests.
@tammy-baylis-swi tammy-baylis-swi moved this from Approved PRs that need fixes to Approved PRs in @xrmx's Python PR digest Feb 26, 2026
return False
if mode == ContentCapturingMode.EVENT_ONLY and not should_emit_event():
return False
return True
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@DylanRussell do we have this util already?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@aabmass We don't have this in util. This is specifically used for understanding whether we should we do the work of converting messages/content blocks into structured parts. I couldn't see any function which is having same functionality as this one in util.

@tammy-baylis-swi tammy-baylis-swi moved this from Approved PRs to Reviewed PRs that need fixes in @xrmx's Python PR digest Feb 26, 2026
@tammy-baylis-swi tammy-baylis-swi moved this from Reviewed PRs that need fixes to Approved PRs that need fixes in @xrmx's Python PR digest Feb 26, 2026
…equirements.oldest.txt for compatibility improvements.
…s for improved clarity and type safety. Update extract_usage_tokens function to return UsageTokens instead of a tuple, and adjust related invocations in MessageWrapper and MessagesStreamWrapper accordingly.
Copy link
Member

@Cirilla-zmh Cirilla-zmh left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM, thank you!

@vasantteja vasantteja removed their assignment Feb 27, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

Status: Approved PRs that need fixes

Development

Successfully merging this pull request may close these issues.

Add OpenTelemetry instrumentation for the Anthropic Claude Python SDK