Add `logfire.instrument_claude_agent_sdk()` by alexmojaki · Pull Request #1799 · pydantic/logfire

alexmojaki · 2026-03-24T19:09:23Z

Summary

Adds logfire.instrument_claude_agent_sdk() for native OpenTelemetry instrumentation of the Claude Agent SDK
Monkey-patches ClaudeSDKClient to create conversation, turn, and tool spans
Injects tracing hooks (PreToolUse, PostToolUse, PostToolUseFailure) for tool call tracing
Records usage metrics, cost, session metadata on the root conversation span
Uses threading.local() for context propagation (workaround for anyio breaking contextvars)

Test plan

45 tests in tests/otel_integrations/test_claude_agent_sdk.py covering all code paths
100% coverage on logfire/_internal/integrations/claude_agent_sdk.py
test_logfire_api.py updated to include the new integration
CI green (pyright, ruff, pytest all passing)

🤖 Generated with Claude Code

Summary by cubic

Adds native logfire.instrument_claude_agent_sdk() tracing for the Claude Agent SDK using OTel GenAI semconv. Creates invoke_agent, chat {model}, and execute_tool {tool_name} spans with provider, messages (incl. reasoning), per‑turn usage/cost, conversation ID, and error details.

New Features
- Instruments claude_agent_sdk.ClaudeSDKClient via SDK hooks (scope suffix claude_agent_sdk); returns a context manager to revert; API exposed in logfire and logfire_api (stub uses nullcontext()).
- Span model: root invoke_agent; sibling chat {model} per assistant turn; sibling execute_tool {tool_name} for tool calls.
- Records gen_ai.provider.name, gen_ai.input.messages, gen_ai.output.messages, gen_ai.system_instructions, reasoning via ReasoningPart (ThinkingBlock), per‑turn gen_ai.usage.* (incl. cache), gen_ai.response.model, gen_ai.conversation.id, operation.cost, and error.type; sets error level on failures.
- Captures prompt from .query(); only ClaudeSDKClient is instrumented; instances created after instrumentation get full tool tracing. Docs updated to use logfire.instrument_claude_agent_sdk().
Bug Fixes
- Idempotent instrumentation and clean uninstrument; guard duplicate hook injection (incl. shared/default options).
- Correct token/cache counts; reset prompt per call; set error level on failed turns/tool calls; cleanup orphaned tool spans.
- Stability/typing: import claude_agent_sdk at module import; use HookMatcher; isinstance checks for SDK block types; replace getattr with direct attribute access on typed SDK blocks; tighten types; remove dead code; add coverage pragmas; fix sys.modules leak.
- Tests: cassette‑based integration via a fake claude CLI with --record-claude-cassettes; add dev dependency claude-agent-sdk; CI/resource warning fixes.

^{Written for commit 09a9ce9. Summary will update on new commits.}

…remove unused collected list

…_logfire_prompt assignments

…esults, hook edge cases

…th no span

…text items

Reset _logfire_prompt at the start of patched_query() so it doesn't carry over from a previous call when prompt is None. Also remove unused scaffolding: _logfire_start_time, _logfire_streamed_input, _next_start_time, and mark_next_start() — all stored but never read.

docs/integrations/llms/claude-agent-sdk.md

logfire-api/logfire_api/__init__.py

logfire/_internal/integrations/claude_agent_sdk.py

tests/otel_integrations/test_claude_agent_sdk.py

cloudflare-workers-and-pages · 2026-03-25T11:22:01Z

Deploying logfire-docs with Cloudflare Pages

Latest commit:	`b096c58`
Status:	✅ Deploy successful!
Preview URL:	https://b46dbff3.logfire-docs.pages.dev
Branch Preview URL:	https://instrument-claude-agent-sdk.logfire-docs.pages.dev

View logs

original_init sets self.options to ClaudeAgentOptions() when no options are passed. The kwargs extraction would miss this, skipping hook injection.

tests/otel_integrations/test_claude_agent_sdk.py

…pec.md

logfire/_internal/integrations/claude_agent_sdk.py

…tion Tool responses from the SDK are already structured data. Removing the str() conversion lets them serialize as proper JSON objects in both the execute_tool span attribute and conversation history input messages. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

…hook Consistency with post_tool_use_hook and post_tool_use_failure_hook which already access input_data inside the error handler. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

Prevents stale references (including conversation history) from persisting in thread-local storage after receive_response completes. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

Chat spans now include gen_ai.system_instructions so each span is self-contained. Span name is updated to 'chat {model}' via update_name when the model is known. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

Cross-async-context detach always fails with OTel error logs, and keeping context attached doesn't help (tool code runs in yet another async context). Now: attach root context, _start() the span to capture the parent link, immediately detach. No context left attached means no cross-context detach errors. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

The model comes from AssistantMessage (not ResultMessage), so we store it on the turn tracker and set it on the root span when the result arrives. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

Replaces scattered thread-local attributes and the global _active_tool_spans dict with a single _ConversationState object. This fixes cross-thread interference where two conversations would share the same _active_tool_spans dict. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

…fire into instrument-claude-agent-sdk

alexmojaki added 19 commits March 24, 2026 17:16

add claude agent sdk integration with tests

51af258

expose instrument_claude_agent_sdk on public API

1e23082

update claude agent sdk docs to use native logfire integration

782a2db

fix _clear_active_tool_spans to destructure (span, token) tuples and …

0d93e55

…remove unused collected list

add instrument_claude_agent_sdk to logfire-api test

16ff849

guard instrument_claude_agent_sdk test on package availability

e937488

fix pyright: use importlib.import_module for claude_agent_sdk guard

3b1324a

fix cache token counts: use to_int instead of to_float

af36aa8

guard against duplicate hook injection on shared options

b6cfe41

fix sys.modules leak: save/restore previous module, remove redundant …

b6b4226

…_logfire_prompt assignments

add pragma: no cover for defensive and CI-untestable paths

b5b6cf3

add coverage tests for edge cases and utility branches

2908c5f

add tests for remaining branch partials: no-content turns, no-usage r…

0d15f39

…esults, hook edge cases

add tests for final branch partials: loop exits, TurnTracker.close wi…

f167e39

…th no span

add pragma: no branch for loop-exit branch partials

a4c344d

eliminate branch partials: use setdefault, add test for non-dict non-…

ce6897a

…text items

add test for unknown message type to cover elif-exit branch

515e66d

add pragma: no cover to logfire-api claude_agent_sdk stub

4514b23

This comment was marked as resolved.

Sign in to view