feat: cross-provider reasoning trace roundtrip by darinkishore · Pull Request #1396 · 0xPlaygrounds/rig

darinkishore · 2026-02-13T19:08:27Z

Stacked on #1395 — review/merge that first. The diff here includes both PRs; the incremental diff is only the provider-specific changes and integration tests.

Summary

Builds on #1395 to implement full reasoning trace preservation across all providers, ensuring reasoning content survives multi-turn conversations including tool call loops.

OpenAI Responses API: Threads msg_ IDs through streaming/non-streaming paths; reverts strip-reasoning workaround; emits MessageId from OutputItemDone; includes reasoning_encrypted_content automatically when reasoning is configured
Anthropic: Multi-block Reasoning → Thinking/RedactedThinking conversion with signature preservation; streaming RedactedThinking support via get_or_insert_with
Gemini: thought_signature roundtrip preservation; emits full Reasoning block (not just delta) when signature is present in streaming
xAI: Structured Reasoning ↔ Chat Completions reasoning_content conversion with encrypted/redacted mapping; fixes #[serde(rename)] → #[serde(rename_all)]
OpenRouter: Groups reasoning_details by ID preserving ordering; maps Encrypted/Redacted content types through Chat Completions format
Streaming agent loop bug fix: ReasoningDelta events now accumulate in a separate text buffer instead of being converted to unsigned Reasoning blocks — fixes Anthropic 400 "signature required" errors in multi-turn tool call loops

Closes #1147, closes #1146

Integration Tests

Four test suites covering both streaming and non-streaming code paths with per-provider coverage:

Streaming Tests

reasoning_roundtrip.rs (streaming) — Text-only 2-turn streaming conversation:

Turn 1: model.stream() → collect reasoning blocks/deltas + text
Turn 2: follow-up with reasoning in chat history → provider accepts

reasoning_tool_roundtrip.rs (streaming) — Reasoning + tool call via agent.stream_chat().multi_turn():

Agent receives weather prompt → model thinks → calls get_weather tool
Agent loop preserves [Reasoning, ToolCall] in chat history
Agent sends tool result → model responds with text referencing tool output
11 universal assertions per provider (no errors, tool invoked, reasoning before tool call, etc.)
Provider-specific assertions (OpenAI: Encrypted content, Anthropic: signatures)

Non-Streaming Tests

reasoning_roundtrip.rs (non-streaming) — Text-only 2-turn non-streaming conversation:

Turn 1: model.completion() → verify CompletionResponse.choice contains Reasoning blocks
Turn 2: follow-up with reasoning in chat history → provider accepts
Directly inspects reasoning content types in the structured response

reasoning_tool_roundtrip.rs (non-streaming) — Reasoning + tool call via agent.chat():

Non-streaming agent loop internally calls model.completion(), preserves reasoning via resp.choice.clone()
Verifies: no errors (400 = dropped reasoning), tool invoked, text references tool output
4 universal assertions per provider

Results

Provider	Streaming Text	Streaming Tool	Non-Streaming Text	Non-Streaming Tool
OpenAI	PASS	PASS	PASS	PASS
Anthropic	PASS	PASS	PASS	PASS
Gemini	PASS	PASS	PASS	PASS
OpenRouter	PASS	PASS	PASS	PASS
xAI	—	—	—	—

Test plan

cargo fmt clean
cargo clippy -p rig-core --tests — 0 warnings
cargo test -p rig-core --lib — 271 tests pass
Streaming reasoning_roundtrip integration tests pass (4/4 providers)
Streaming reasoning_tool_roundtrip integration tests pass (4/4 providers)
Non-streaming reasoning_roundtrip integration tests pass (4/4 providers)
Non-streaming reasoning_tool_roundtrip integration tests pass (4/4 providers)
xAI tests pending API key validation with grok-4-0725

Implement lossless reasoning trace preservation for all providers that require it, enabling multi-turn conversations with reasoning models (tool call loops, follow-up questions). Provider changes: - OpenAI Responses API: encrypted_content roundtrip, msg_ ID threading - Anthropic: redacted reasoning blocks, multi-block signature preservation - Gemini: thinking-part thought_signature roundtrip - xAI: structured reasoning conversion and streaming alignment - OpenRouter: reasoning_details response parsing and request emission Includes integration tests for all 5 providers (reasoning_roundtrip.rs). Implements 0xPlaygrounds#1147, 0xPlaygrounds#1146, 0xPlaygrounds#684

joshua-mo-143 · 2026-02-17T01:14:26Z

Looks like just needs cargo fmt then lgtm

darinkishore mentioned this pull request Feb 13, 2026

feat: Add support for more reasoning types #1147

Closed

1 task

darinkishore force-pushed the pr/reasoning-provider-roundtrip branch from 75c9270 to 5a93c3f Compare February 13, 2026 22:28

joshua-mo-143 mentioned this pull request Feb 16, 2026

refactor: typed reasoning content model #1395

Merged

4 tasks

darinkishore force-pushed the pr/reasoning-provider-roundtrip branch from 5a93c3f to 561e102 Compare February 17, 2026 00:44

chore: cargo fmt

891ed29

joshua-mo-143 added this pull request to the merge queue Feb 17, 2026

Merged via the queue into 0xPlaygrounds:main with commit 0977167 Feb 17, 2026
6 checks passed

github-actions Bot mentioned this pull request Feb 16, 2026

chore: release #1353

Merged

This was referenced Feb 25, 2026

bug(openai): StreamingDelta missing reasoning_content — reasoning silently dropped for OpenAI-compatible providers #1440

Closed

fix(openai): add reasoning_content to StreamingDelta for OpenAI-compatible providers #1441

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: cross-provider reasoning trace roundtrip#1396

feat: cross-provider reasoning trace roundtrip#1396
joshua-mo-143 merged 2 commits into0xPlaygrounds:mainfrom
darinkishore:pr/reasoning-provider-roundtrip

darinkishore commented Feb 13, 2026 •

edited

Loading

Uh oh!

joshua-mo-143 commented Feb 17, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

darinkishore commented Feb 13, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Integration Tests

Streaming Tests

Non-Streaming Tests

Results

Test plan

Uh oh!

joshua-mo-143 commented Feb 17, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

darinkishore commented Feb 13, 2026 •

edited

Loading