✨ feat(agent-runtime): persist `ERROR_CODE_SPECS` classification on operation errors by arvinxx · Pull Request #15273 · lobehub/lobehub

arvinxx · 2026-05-27T17:41:00Z

💻 Change Type

✨ feat
✅ test

🔗 Related Issue

Follow-up to #15262, which introduced ERROR_CODE_SPECS but only consumed it internally (HTTP status mapping + retry decisions). This PR makes the classification a first-class field on every persisted operation error.

🔀 Description of Change

When an agent operation fails, the catch block in AgentRuntimeService.executeStep normalizes the thrown value into a ChatMessageError and writes it to three sinks:

agent_operations.error JSONB (via agentOperationModel.recordCompletion)
S3 trace snapshot (via OperationTraceRecorder.finalize)
Agent-gateway WS push (via GatewayStreamNotifier.publishAgentRuntimeEnd, which forwards finalState.error verbatim)

Today the persisted shape is just { message, type, body }. Downstream consumers (DC observability dashboards, agent-gateway error triage, support tooling) that want to know whether an error is user-attributable, what category it belongs to, and whether it counts as a failure have to re-derive that themselves — typically by maintaining a parallel mirror of ERROR_CODE_SPECS.

This PR does the lookup once, at the chokepoint:

packages/types/src/message/common/base.ts — ChatMessageError gains optional attribution / category / severity / httpStatus / retryable / countAsFailure / numericId fields plus exported ChatMessageErrorAttribution and ChatMessageErrorSeverity literal types. Zod schema mirrored.
src/server/modules/AgentRuntime/formatErrorForState.ts (new) — extracted from AgentRuntimeService.ts and wraps the existing normalization with an enrichWithSpec step that merges getErrorCodeSpec(formatted.type) onto the result. Codes outside the spec table (e.g. raw InternalServerError) pass through unchanged.
src/server/services/agentRuntime/AgentRuntimeService.ts — imports the extracted helper; the error finalize call to traceRecorder.finalize forwards the classification fields so the S3 snapshot carries them too.
src/server/services/agentRuntime/OperationTraceRecorder.ts — FinalizeParams.error widened to accept the classification fields.

The agent-gateway notifier needs no changes — it forwards finalState.error as-is, so the new fields ride along automatically.

🧪 How to Test

Tested locally
Added/updated tests

```bash
bun run type-check # passes
bunx vitest run src/server/modules/AgentRuntime/formatErrorForState.test.ts # 7 new cases
bunx vitest run src/server/services/agentRuntime/AgentRuntimeService.test.ts # 60 existing pass
bunx vitest run src/server/modules/AgentRuntime/tests/formatErrorEventData.test.ts \
src/server/modules/AgentRuntime/tests/llmErrorClassification.test.ts # 41 pass
```

New test cases cover:

Normalization for all three input shapes (ChatCompletionErrorPayload / standard Error / unknown).
Spec enrichment attaches the expected fields for InsufficientQuota (user / quota / non-retryable / not-a-failure) and RateLimitExceeded (provider / capacity / retryable).
The QuotaLimitReached → RateLimitExceeded alias is resolved through to the canonical spec.
Codes outside the spec table leave every classification field undefined.

📸 Screenshots / Videos

N/A (no UI changes)

📝 Additional Information

Backward-compatible. All new fields on ChatMessageError are optional; the Zod schema additions are also optional. Old persisted error rows in agent_operations.error simply lack the new keys.

Forward path. Once this lands, DC's operation tracing can render attribution / severity / category badges directly from agent_operations.error without maintaining a spec mirror. Agent-gateway already imports isUserSideError from its own mirror; this PR doesn't touch that path.

🤖 Generated with Claude Code

…ration errors Look up the runtime error's spec in `ERROR_CODE_SPECS` at the single catch chokepoint and merge `attribution` / `category` / `severity` / `httpStatus` / `retryable` / `countAsFailure` / `numericId` onto the normalized `ChatMessageError`. The enriched object flows through to all three downstream sinks — `agent_operations.error` JSONB, S3 trace snapshot, and the agent-gateway WS push — without each consumer having to re-run pattern matching. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>

vercel · 2026-05-27T17:41:06Z

The latest updates on your projects. Learn more about Vercel for GitHub.

Project	Deployment	Actions	Updated (UTC)
lobehub	Ready	Preview, Comment	May 27, 2026 6:06pm

sourcery-ai

We've reviewed this pull request using the Sourcery rules engine

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: e2615f07f5

ℹ️ About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".

codecov · 2026-05-27T17:46:39Z

Codecov Report

❌ Patch coverage is 97.43590% with 2 lines in your changes missing coverage. Please review.
✅ Project coverage is 70.96%. Comparing base (c4b1475) to head (56118e0).
⚠️ Report is 1 commits behind head on canary.

Additional details and impacted files

@@           Coverage Diff            @@
##           canary   #15273    +/-   ##
========================================
  Coverage   70.95%   70.96%            
========================================
  Files        3160     3161     +1     
  Lines      317287   317334    +47     
  Branches    34491    33581   -910     
========================================
+ Hits       225127   225181    +54     
+ Misses      91991    91984     -7     
  Partials      169      169

Flag	Coverage Δ
app	`61.72% <97.18%> (+0.01%)`	⬆️
database	`92.22% <ø> (ø)`
packages/agent-runtime	`80.48% <ø> (ø)`
packages/builtin-tool-lobe-agent	`19.87% <ø> (ø)`
packages/context-engine	`84.13% <ø> (ø)`
packages/conversation-flow	`91.28% <ø> (ø)`
packages/file-loaders	`87.89% <ø> (ø)`
packages/memory-user-memory	`74.99% <ø> (ø)`
packages/model-bank	`99.99% <ø> (ø)`
packages/model-runtime	`84.55% <ø> (ø)`
packages/prompts	`72.67% <ø> (ø)`
packages/python-interpreter	`92.90% <ø> (ø)`
packages/ssrf-safe-fetch	`0.00% <ø> (ø)`
packages/types	`35.64% <100.00%> (+0.25%)`	⬆️
packages/utils	`88.47% <ø> (ø)`
packages/web-crawler	`88.08% <ø> (ø)`

Flags with carried forward coverage won't be shown. Click here to find out more.

Components	Coverage Δ
Store	`67.71% <ø> (ø)`
Services	`54.64% <ø> (ø)`
Server	`72.26% <97.18%> (+0.02%)`	⬆️
Libs	`56.97% <ø> (ø)`
Utils	`85.94% <ø> (ø)`

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.
📦 JS Bundle Analysis: Save yourself from yourself by tracking and limiting bundle sizes in JS merges.

Model-runtime failures caught inside `runtime.step()` resolve normally with `newState.status = 'error'` instead of throwing, so the prior commit's outer `executeStep` catch never sees common provider errors like `InvalidProviderAPIKey` / `InsufficientQuota`. Those were reaching `agent_operations.error` JSONB and the success-path trace snapshot raw — without `attribution` / `category` / `severity` / … Run `formatErrorForState` on `stepResult.newState.error` immediately after `runtime.step()` returns, before the state is saved to Redis, hooks are dispatched, or the trace is finalized. Made the helper idempotent (recognizes already-normalized `ChatMessageError` shape) so a second pass through the outer catch can't collapse it back to `AgentRuntimeError`. Success-path `traceRecorder.finalize` now forwards the classification fields too. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>

arvinxx · 2026-05-27T18:00:50Z

Good catch — fixed in 56118e0. The inner runtime.step() catch resolves with newState.status = 'error' instead of throwing, so the outer executeStep catch never sees InvalidProviderAPIKey / InsufficientQuota etc. Those were reaching agent_operations.error JSONB and the success-path trace snapshot raw.

Now:

stepResult.newState.error runs through formatErrorForState immediately after runtime.step() returns, before saveStepResult / publishStreamEvent / dispatchHooks → persistCompletion / traceRecorder.finalize.
formatErrorForState is now idempotent — it recognizes an already-normalized ChatMessageError (type present, no errorType) and re-enriches in place. Without this, the outer catch running over state.error a second time would collapse it to AgentRuntimeError.
The success-path traceRecorder.finalize call (around line 960) now forwards attribution / category / severity / httpStatus / retryable / countAsFailure / numericId from newState.error, matching the outer-catch finalize.

Tests: 69 pass (61 existing AgentRuntimeService + 8 in formatErrorForState.test.ts including the new idempotency + partial-input cases). bun run type-check clean.

@localfile

# 🚀 LobeHub Release (20260528) **Release Date:** May 28, 2026 **Since v2.2.0:** 220 merged PRs · 15 contributors > This cycle brings heterogeneous "platform agents" you can dispatch to local or remote devices, a rebuilt onboarding flow, document-centric chat, and a unified model-runtime error model — with new DeepSeek V4 and Gemini 3.5 Flash support along the way. --- ## ✨ Highlights - **More Hetero Agents (OpenClaw / Hermes)** — Create heterogeneous agents and dispatch them to local or remote devices through the device gateway, with an execution-target switcher in the composer and persistent CLI sessions. (#15065, #15179, #15022) - **iMessage on Desktop** — New iMessage setup and bridge on desktop, plus bot attachments across every platform. (#15228, #15227, #15029) - **Skills in the Composer** — Drag skill chips into chat, trigger installed skills from the slash menu mid-line, and surface project-level skills in the homogeneous agent runtime. (#15095, #15061, #15110) - **New Models** — DeepSeek V4 Flash/Pro and Gemini 3.5 Flash across providers, with thinking params for structured output and chat cost estimates. (#15031, #15001, #15051, #14876) - **Agent Runtime Observability** — OpenTelemetry GenAI semantic conventions plus per-call generation tracing. (#15123, #15124) --- ## 🤖 Agents & Heterogeneous Runtime - **Platform agent creation** — OpenClaw/Hermes creation UI, device guard, and remote dispatch backend. (#15065) - **Execution-target switcher** — Pick local vs remote execution directly in the composer; device-selection UX with actionable guidance. (#15179, #15111) - **CLI hetero dispatch** — OpenClaw/Hermes dispatch with persistent sessions and a notify protocol. (#15022) - **Gateway snapshot as source of truth** — Consume the gateway `uiMessages` snapshot at step boundaries to keep chat state consistent. (#15153, #15152) - **Client sub-agent as a normal tool call** — Simplifies the sub-agent execution path. (#15281) - **Hermes agent chain** — Implements the Hermes agent chain logic. (#15189) - **Device registry** — TRPC endpoints to register, list, update, and remove devices. (#15299) - **Desktop device routing** — Route gateway agent runs through `lh hetero exec`; restore `userId` in gateway dispatch and gate local-system by execution target. (#15132, #15232) - **Agent signals** — Anchor agent-signal receipts to messages and isolate memory-agent messages into a child thread. (#14969, #14921) --- ## 🚀 Onboarding - **Simplified first screen** — Defer topic creation to first send. (#15090) - **Market Agent Picker** — Added as a classic onboarding step, with template prefetch. (#14980, #15041) - **Welcome guidance** — Show agent welcome guidance on first run. (#15098) - **Mobile** — Adapt agent onboarding UI and restore Classic-step padding on mobile. (#15019, #15032) - **Discovery** — Streamline discovery to a single profession question. (#14987) - **Analytics** — Track onboarding step events and create-agent modal source. (#15133, #15028) --- ## 📄 Documents, Pages & Knowledge - **Thread chat in preview** — Embed thread chat in the document preview portal. (#15216) - **Non-markdown rendering** — Render non-markdown docs as a read-only highlight. (#15272) - **Multi-select** — Multi-select delete in the document tree. (#15125) - **Page-agent streaming** — Preview `initPage` streaming arguments. (#15039) - **Per-agent topics** — Per-agent topic management page. (#15207) - **Server-side category** — Derive document category server-side and drop frontend predicates. (#15076) --- ## 🧩 Skills & Tools - **Drag skill chips** — Drag skills into chat input and register agent-document skills. (#15095) - **Slash menu** — Installed skills appear in the slash menu with a mid-line trigger. (#15061) - **Project skills** — Recognize project-level skills in the homogeneous agent runtime and surface them regardless of active device. (#15110, #15177) - **VFS archiving** — Archive oversized tool results to VFS instead of truncating. (#15074) - **@localfile mentions** — Drag folders into chat input as `@localFile` mentions on desktop. (#15071) --- ## 🧠 Model Runtime & Providers - **Error spec registry** — Unify error codes into a spec + pattern registry, split `ProviderBizError` into finer codes, classify Cloud-only codes via a tier digit, and add `DatabasePersistError`. (#15262, #15286, #15278, #15279) - **New models** — DeepSeek V4 Flash/Pro (opencode-go) and Gemini 3.5 Flash; DeepSeek V4 Pro on SiliconCloud. (#15031, #15001, #15017, #15267) - **Structured output** — Thinking params for structured output, Bedrock structured generation, and DeepSeek `generateObject` tool choice. (#15051, #15174, #15054) - **Cost** — Chat cost estimate support; preserve usage cost in custom streams. (#14876, #15218) --- ## 💬 Chat & User Experience - **Follow-up chips** — Extend follow-up chip suggestions to general chat with scene-specific model config. (#15101, #14797) - **Input drafts** — Persist unsent input drafts across tab switches and prevent repeated draft restore. (#14992, #15024) - **Command menu** — Order topic/message search by recency and promote inline type filters. (#15094, #14986) - **Zoom HUD** — Show a zoom-level HUD on Cmd +/− and Cmd 0. (#15294) - **Copy** — Unescape markdown escapes when copying user messages. (#15253) --- ## 🖥️ Desktop - **App Nap fix** — Prevent App Nap from dropping the gateway WebSocket during display sleep. (#14994) - **File preview** — Preview `.cjs`/`.mjs`/no-extension files instead of binary fallback and expand `~` when opening local files. (#15168, #15284) - **Cross-platform settings** — Open settings via main-window navigation on Windows/Linux and restore the route after an update restart. (#15036, #14922) - **Token refresh** — Prevent frequent logout from token-refresh retries. (#14928) --- ## 📊 Observability - **OTel GenAI** — Instrument Agent Runtime with OpenTelemetry GenAI semantic conventions. (#15123) - **Generation tracing** — Per-call `llm_generation_tracing` with a pre-allocated tracingId and recordFeedback router. (#15124, #15146) - **Error classification** — Persist `ERROR_CODE_SPECS` classification on operation errors. (#15273) --- ## 🗃️ Database Migrations - **Batch migrations** — Topic usage stats, push tokens, `tasks.editor_data`, and document shares. (#15280) - **Tracing & eval tables** — Add `llm_generation_tracing` and agent eval experiment tables. (#15126) > Self-hosted operators should run the database migration (`pnpm db:migrate`, or restart with auto-migrate enabled) after upgrading. The changes are additive and backwards-compatible. --- ## 🔒 Security & Reliability - **Security:** Remove the `getPlaintextCred` tool to prevent plaintext credential exposure. (#14998) - **Security:** Prompt account selection for Google OAuth and add `prompt=consent` to the OIDC authorization URL to fix missing refresh tokens. (#15234, #15010) - **Reliability:** Preserve streamed content across a mid-stream cancel. (#15173) - **Reliability:** Bound the Redis command timeout and configure the Anthropic client timeout. (#15091, #15042) - **Reliability:** Prevent infinite recursion in the assistant chain. (#15288) --- ## 👥 Contributors Huge thanks to **15 contributors** who shipped **220 merged PRs** this cycle. @AnotiaWang · @sxjeru · @algojogacor · @hardy-one · @arvinxx · @Innei · @tjx666 · @lijian · @AmAzing129 · @rdmclin2 · @neko · @cy948 · @CanisMinor · @sudongyuer · @rivertwilight Plus @lobehubbot and renovate[bot] for maintenance. --- **Full Changelog**: v2.2.0...release/weekly-20260528

dosubot Bot added size:L This PR changes 100-499 lines, ignoring generated files. feature:agent Assistant/Agent configuration and behavior labels May 27, 2026

sourcery-ai Bot reviewed May 27, 2026

View reviewed changes

arvinxx changed the title ~~✨ feat(agent-runtime): persist ERROR_CODE_SPECS classification on operation errors~~ ✨ feat(agent-runtime): persist ERROR_CODE_SPECS classification on operation errors May 27, 2026

chatgpt-codex-connector Bot reviewed May 27, 2026

View reviewed changes

Comment thread src/server/services/agentRuntime/AgentRuntimeService.ts

vercel Bot deployed to Preview May 27, 2026 17:50 View deployment

vercel Bot deployed to Preview May 27, 2026 18:06 View deployment

arvinxx merged commit 8c0e66b into canary May 27, 2026
35 checks passed

arvinxx deleted the arvinxu/feat/operation-error-classification branch May 27, 2026 18:25

This was referenced May 28, 2026

✨ feat(model-runtime): classify Cloud-only error codes via numericId tier digit #15278

Merged

✨ feat(model-runtime): add DatabasePersistError code for failed DB queries #15279

Merged

🚀 release: 20260528 #15302

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

✨ feat(agent-runtime): persist `ERROR_CODE_SPECS` classification on operation errors#15273

✨ feat(agent-runtime): persist `ERROR_CODE_SPECS` classification on operation errors#15273
arvinxx merged 2 commits into
canaryfrom
arvinxu/feat/operation-error-classification

arvinxx commented May 27, 2026

Uh oh!

vercel Bot commented May 27, 2026 •

edited

Loading

Uh oh!

sourcery-ai Bot left a comment

Uh oh!

chatgpt-codex-connector Bot left a comment

Uh oh!

Uh oh!

codecov Bot commented May 27, 2026 •

edited

Loading

Uh oh!

arvinxx commented May 27, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Uh oh!

Conversation

arvinxx commented May 27, 2026

💻 Change Type

🔗 Related Issue

🔀 Description of Change

🧪 How to Test

📸 Screenshots / Videos

📝 Additional Information

Uh oh!

vercel Bot commented May 27, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

sourcery-ai Bot left a comment

Choose a reason for hiding this comment

Uh oh!

chatgpt-codex-connector Bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

Uh oh!

codecov Bot commented May 27, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

arvinxx commented May 27, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

vercel Bot commented May 27, 2026 •

edited

Loading

codecov Bot commented May 27, 2026 •

edited

Loading