feat: add LLM trace diagnostics to session exports by Astro-Han · Pull Request #457 · Astro-Han/pawwork

Astro-Han · 2026-05-05T14:52:11Z

Summary

Adds lightweight LLM trace summaries for assistant model runs and includes them in local session exports.

Why

Session exports currently preserve final messages, parts, and token metadata, but they do not show whether an output anomaly happened at the request boundary, AI SDK normalized stream, PawWork processor, or persisted message. This PR adds count-only diagnostics so issue #454 can be diagnosed from the export without raw prompts, raw model output, headers, API keys, or provider chunks.

Related Issue

Closes #454
Closes #214

Human Review Status

Pending. A human should make the final merge decision after reviewing the final diff and verification evidence.

Review Focus

Please focus on the trace contract and privacy boundary: request summary allowlist, normalized stream event counts, final stored part counts, and whether assistant-message retention is the right persistence layer.

Risk Notes

Additive assistant message diagnostics widen the v2 SDK schema and persist one count-only trace per assistant model run. The trace intentionally omits prompts, messages, headers, API keys, raw provider chunks, raw output text, and tool bodies. Generated SDK files changed after bun --cwd packages/sdk/js build. No desktop, packaging, updater, signing, path, shell, or permission behavior is changed.

How To Verify

LLM trace/export tests: 35 passed, 0 failed
Processor integration tests: 15 passed, 0 failed
opencode typecheck: passed
SDK build: passed, v2 SDK generated files updated
SDK typecheck: passed
Diff check: no whitespace errors

Commands run:

bun --cwd packages/opencode test test/session/llm-trace.test.ts test/session/export.test.ts --timeout 30000
bun --cwd packages/opencode test test/session/processor-effect.test.ts --timeout 30000
bun --cwd packages/opencode typecheck
bun --cwd packages/sdk/js build
bun --cwd packages/sdk/js typecheck
git diff --check

Screenshots or Recordings

Not applicable. No visible UI changes.

Checklist

Requested maintainer labeling for type, scope, and priority.

coderabbitai · 2026-05-05T14:52:20Z

Warning

Rate limit exceeded

@Astro-Han has exceeded the limit for the number of commits that can be reviewed per hour. Please wait 29 minutes and 5 seconds before requesting another review.

To keep reviews running without waiting, you can enable usage-based add-on for your organization. This allows additional reviews beyond the hourly cap. Account admins can enable it under billing.

⌛ How to resolve this issue?

After the wait time has elapsed, a review can be triggered using the @coderabbitai review command as a PR comment. Alternatively, push new commits to this PR.

We recommend that you space out your commits to avoid hitting the rate limit.

🚦 How do rate limits work?

CodeRabbit enforces hourly rate limits for each developer per organization.

Our paid plans have higher rate limits than the trial, open-source and free plans. In all cases, we re-allow further reviews after a brief timeout.

Please see our FAQ for further information.

ℹ️ Review info

⚙️ Run configuration

Configuration used: Path: .coderabbit.yaml

Review profile: CHILL

Plan: Pro Plus

Run ID: 05ae57ee-b773-4a06-8bba-10abf008c6dd

📥 Commits

Reviewing files that changed from the base of the PR and between b1656c3 and 8d2bbb2.

⛔ Files ignored due to path filters (2)

packages/sdk/js/src/v2/gen/sdk.gen.ts is excluded by !**/gen/**
packages/sdk/js/src/v2/gen/types.gen.ts is excluded by !**/gen/**

📒 Files selected for processing (11)

packages/opencode/src/session/export.ts
packages/opencode/src/session/llm-trace/index.ts
packages/opencode/src/session/llm-trace/recorder.ts
packages/opencode/src/session/llm-trace/types.ts
packages/opencode/src/session/llm.ts
packages/opencode/src/session/message-v2.ts
packages/opencode/src/session/processor.ts
packages/opencode/test/session/export.test.ts
packages/opencode/test/session/llm-trace.test.ts
packages/opencode/test/session/llm.test.ts
packages/opencode/test/session/processor-effect.test.ts

✨ Finishing Touches

🧪 Generate unit tests (beta)

Create PR with unit tests
Commit unit tests in branch codex/i454-llm-trace-export

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

gemini-code-assist

Code Review

This pull request implements a comprehensive LLM tracing system that records request metadata, stream events, and token usage, integrating these traces into session processing and exports. The SDK is also updated with new turn-change endpoints and enhanced question-handling capabilities. Feedback was provided to refine the isEmptyCompletion check in the recorder to include all stored part types for better accuracy.

Astro-Han added 2 commits May 5, 2026 22:51

feat: add llm trace summary helper

bd5651c

feat: export assistant llm trace diagnostics

81cc8e1

Astro-Han added enhancement New feature or request harness Model harness, prompts, tool descriptions, and session mechanics P2 Medium priority labels May 5, 2026

gemini-code-assist Bot reviewed May 5, 2026

View reviewed changes

Comment thread packages/opencode/src/session/llm-trace/recorder.ts

Astro-Han added 2 commits May 5, 2026 23:05

fix: preserve noop active tool compatibility

ab90282

test: cover llm trace error flags

8d2bbb2

Astro-Han merged commit 36e2f24 into dev May 5, 2026
20 checks passed

Astro-Han deleted the codex/i454-llm-trace-export branch May 5, 2026 15:29

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: add LLM trace diagnostics to session exports#457

feat: add LLM trace diagnostics to session exports#457
Astro-Han merged 4 commits into
devfrom
codex/i454-llm-trace-export

Astro-Han commented May 5, 2026 •

edited

Loading

Uh oh!

coderabbitai Bot commented May 5, 2026 •

edited

Loading

Rate limit exceeded

Uh oh!

gemini-code-assist Bot left a comment

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

Astro-Han commented May 5, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Why

Related Issue

Human Review Status

Review Focus

Risk Notes

How To Verify

Screenshots or Recordings

Checklist

Uh oh!

coderabbitai Bot commented May 5, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Rate limit exceeded

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Astro-Han commented May 5, 2026 •

edited

Loading

coderabbitai Bot commented May 5, 2026 •

edited

Loading