β¨ feat(model-runtime): classify Cloud-only error codes via numericId tier digit#15278
Conversation
β¦tier digit The three Cloud-only `ChatErrorType` codes (`FreePlanLimit`, `InsufficientBudgetForModel`, `LobeHubModelDeprecated`) were emitted by the managed gateway but had no spec, so they showed up unclassified on the operation dashboards. Rather than add a 10th `ErrorCategory` (the single-digit category prefix 1-9 is exhausted, and a 10th would break the 4-digit numericId scheme + its validation tests), encode the OSS-vs-Cloud distinction in the **second digit** of `numericId`: `0` = open-source runtime, `9` = Cloud-only. Every existing code already has tier digit 0, so this is purely additive β the category leading-digit invariant, 4-digit range, and `E####` regex all hold unchanged. - `taxonomy.ts` β document the tier digit, add `CLOUD_TIER_DIGIT = 9`. - `specs.ts` β widen the spec key/`code` type to `SpecErrorCode` (`ILobeAgentRuntimeErrorType | CloudErrorCode`); add the three entries under their semantic categories with tier-9 ids: `FreePlanLimit` E2901 & `InsufficientBudgetForModel` E2902 (quota), `LobeHubModelDeprecated` E4901 (request). All `attribution: user`, `countAsFailure: false`. - `match.test.ts` β assert every spec's tier digit is 0 or 9, and the three Cloud codes resolve under the cloud tier. Locale keys (`response.<code>`) for all three already exist. The agent-gateway mirror is updated separately. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>
|
The latest updates on your projects. Learn more about Vercel for GitHub.
|
There was a problem hiding this comment.
π‘ Codex Review
Here are some automated review suggestions for this pull request.
Reviewed commit: 85f99d1eee
βΉοΈ About Codex in GitHub
Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you
- Open a pull request for review
- Mark a draft as ready
- Comment "@codex review".
If Codex has suggestions, it will comment; otherwise it will react with π.
Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".
| [ChatErrorType.FreePlanLimit]: { | ||
| code: ChatErrorType.FreePlanLimit, | ||
| numericId: 2901, |
There was a problem hiding this comment.
Keep Cloud errors on legacy locale path
When Cloud returns FreePlanLimit, InsufficientBudgetForModel, or LobeHubModelDeprecated, adding these ChatErrorType strings to ERROR_CODE_SPECS makes getRuntimeErrorMessage route them to modelRuntime:${code} because it checks getErrorCodeSpec(code). These messages currently only exist as error.response.* keys, not in src/locales/default/modelRuntime.ts or generated locale files, so affected Cloud users will see a missing/raw translation key instead of the existing billing/model-deprecated text. Either keep these Cloud-only codes out of that routing path or add the corresponding modelRuntime locale entries.
Useful? React with πΒ / π.
Codecov Reportβ
All modified and coverable lines are covered by tests. Additional details and impacted files@@ Coverage Diff @@
## canary #15278 +/- ##
=======================================
Coverage 71.02% 71.03%
=======================================
Files 3172 3172
Lines 317388 317420 +32
Branches 27919 27919
=======================================
+ Hits 225437 225473 +36
+ Misses 91782 91778 -4
Partials 169 169
Flags with carried forward coverage won't be shown. Click here to find out more.
π New features to boost your workflow:
|
# π LobeHub Release (20260528) **Release Date:** May 28, 2026 **Since v2.2.0:** 220 merged PRs Β· 15 contributors > This cycle brings heterogeneous "platform agents" you can dispatch to local or remote devices, a rebuilt onboarding flow, document-centric chat, and a unified model-runtime error model β with new DeepSeek V4 and Gemini 3.5 Flash support along the way. --- ## β¨ Highlights - **More Hetero Agents (OpenClaw / Hermes)** β Create heterogeneous agents and dispatch them to local or remote devices through the device gateway, with an execution-target switcher in the composer and persistent CLI sessions. (#15065, #15179, #15022) - **iMessage on Desktop** β New iMessage setup and bridge on desktop, plus bot attachments across every platform. (#15228, #15227, #15029) - **Skills in the Composer** β Drag skill chips into chat, trigger installed skills from the slash menu mid-line, and surface project-level skills in the homogeneous agent runtime. (#15095, #15061, #15110) - **New Models** β DeepSeek V4 Flash/Pro and Gemini 3.5 Flash across providers, with thinking params for structured output and chat cost estimates. (#15031, #15001, #15051, #14876) - **Agent Runtime Observability** β OpenTelemetry GenAI semantic conventions plus per-call generation tracing. (#15123, #15124) --- ## π€ Agents & Heterogeneous Runtime - **Platform agent creation** β OpenClaw/Hermes creation UI, device guard, and remote dispatch backend. (#15065) - **Execution-target switcher** β Pick local vs remote execution directly in the composer; device-selection UX with actionable guidance. (#15179, #15111) - **CLI hetero dispatch** β OpenClaw/Hermes dispatch with persistent sessions and a notify protocol. (#15022) - **Gateway snapshot as source of truth** β Consume the gateway `uiMessages` snapshot at step boundaries to keep chat state consistent. (#15153, #15152) - **Client sub-agent as a normal tool call** β Simplifies the sub-agent execution path. (#15281) - **Hermes agent chain** β Implements the Hermes agent chain logic. (#15189) - **Device registry** β TRPC endpoints to register, list, update, and remove devices. (#15299) - **Desktop device routing** β Route gateway agent runs through `lh hetero exec`; restore `userId` in gateway dispatch and gate local-system by execution target. (#15132, #15232) - **Agent signals** β Anchor agent-signal receipts to messages and isolate memory-agent messages into a child thread. (#14969, #14921) --- ## π Onboarding - **Simplified first screen** β Defer topic creation to first send. (#15090) - **Market Agent Picker** β Added as a classic onboarding step, with template prefetch. (#14980, #15041) - **Welcome guidance** β Show agent welcome guidance on first run. (#15098) - **Mobile** β Adapt agent onboarding UI and restore Classic-step padding on mobile. (#15019, #15032) - **Discovery** β Streamline discovery to a single profession question. (#14987) - **Analytics** β Track onboarding step events and create-agent modal source. (#15133, #15028) --- ## π Documents, Pages & Knowledge - **Thread chat in preview** β Embed thread chat in the document preview portal. (#15216) - **Non-markdown rendering** β Render non-markdown docs as a read-only highlight. (#15272) - **Multi-select** β Multi-select delete in the document tree. (#15125) - **Page-agent streaming** β Preview `initPage` streaming arguments. (#15039) - **Per-agent topics** β Per-agent topic management page. (#15207) - **Server-side category** β Derive document category server-side and drop frontend predicates. (#15076) --- ## π§© Skills & Tools - **Drag skill chips** β Drag skills into chat input and register agent-document skills. (#15095) - **Slash menu** β Installed skills appear in the slash menu with a mid-line trigger. (#15061) - **Project skills** β Recognize project-level skills in the homogeneous agent runtime and surface them regardless of active device. (#15110, #15177) - **VFS archiving** β Archive oversized tool results to VFS instead of truncating. (#15074) - **@localfile mentions** β Drag folders into chat input as `@localFile` mentions on desktop. (#15071) --- ## π§ Model Runtime & Providers - **Error spec registry** β Unify error codes into a spec + pattern registry, split `ProviderBizError` into finer codes, classify Cloud-only codes via a tier digit, and add `DatabasePersistError`. (#15262, #15286, #15278, #15279) - **New models** β DeepSeek V4 Flash/Pro (opencode-go) and Gemini 3.5 Flash; DeepSeek V4 Pro on SiliconCloud. (#15031, #15001, #15017, #15267) - **Structured output** β Thinking params for structured output, Bedrock structured generation, and DeepSeek `generateObject` tool choice. (#15051, #15174, #15054) - **Cost** β Chat cost estimate support; preserve usage cost in custom streams. (#14876, #15218) --- ## π¬ Chat & User Experience - **Follow-up chips** β Extend follow-up chip suggestions to general chat with scene-specific model config. (#15101, #14797) - **Input drafts** β Persist unsent input drafts across tab switches and prevent repeated draft restore. (#14992, #15024) - **Command menu** β Order topic/message search by recency and promote inline type filters. (#15094, #14986) - **Zoom HUD** β Show a zoom-level HUD on Cmd +/β and Cmd 0. (#15294) - **Copy** β Unescape markdown escapes when copying user messages. (#15253) --- ## π₯οΈ Desktop - **App Nap fix** β Prevent App Nap from dropping the gateway WebSocket during display sleep. (#14994) - **File preview** β Preview `.cjs`/`.mjs`/no-extension files instead of binary fallback and expand `~` when opening local files. (#15168, #15284) - **Cross-platform settings** β Open settings via main-window navigation on Windows/Linux and restore the route after an update restart. (#15036, #14922) - **Token refresh** β Prevent frequent logout from token-refresh retries. (#14928) --- ## π Observability - **OTel GenAI** β Instrument Agent Runtime with OpenTelemetry GenAI semantic conventions. (#15123) - **Generation tracing** β Per-call `llm_generation_tracing` with a pre-allocated tracingId and recordFeedback router. (#15124, #15146) - **Error classification** β Persist `ERROR_CODE_SPECS` classification on operation errors. (#15273) --- ## ποΈ Database Migrations - **Batch migrations** β Topic usage stats, push tokens, `tasks.editor_data`, and document shares. (#15280) - **Tracing & eval tables** β Add `llm_generation_tracing` and agent eval experiment tables. (#15126) > Self-hosted operators should run the database migration (`pnpm db:migrate`, or restart with auto-migrate enabled) after upgrading. The changes are additive and backwards-compatible. --- ## π Security & Reliability - **Security:** Remove the `getPlaintextCred` tool to prevent plaintext credential exposure. (#14998) - **Security:** Prompt account selection for Google OAuth and add `prompt=consent` to the OIDC authorization URL to fix missing refresh tokens. (#15234, #15010) - **Reliability:** Preserve streamed content across a mid-stream cancel. (#15173) - **Reliability:** Bound the Redis command timeout and configure the Anthropic client timeout. (#15091, #15042) - **Reliability:** Prevent infinite recursion in the assistant chain. (#15288) --- ## π₯ Contributors Huge thanks to **15 contributors** who shipped **220 merged PRs** this cycle. @AnotiaWang Β· @sxjeru Β· @algojogacor Β· @hardy-one Β· @arvinxx Β· @Innei Β· @tjx666 Β· @lijian Β· @AmAzing129 Β· @rdmclin2 Β· @neko Β· @cy948 Β· @CanisMinor Β· @sudongyuer Β· @rivertwilight Plus @lobehubbot and renovate[bot] for maintenance. --- **Full Changelog**: v2.2.0...release/weekly-20260528
π» Change Type
π Related Issue
Follow-up to #15262 / #15273. Three Cloud-only
ChatErrorTypecodes were reaching the operation dashboards unclassified because they had noERROR_CODE_SPECSentry.π Description of Change
FreePlanLimit,InsufficientBudgetForModel,LobeHubModelDeprecatedare emitted only by the managed LobeHub Cloud gateway. They live inChatErrorType(notAgentRuntimeErrorType) and had no spec, so on Cloud traffic they showed up asuncategorizedon the error dashboards.Why not a new
cloudcategory: the single-digit category prefix (1=auth β¦ 9=config) is exhausted. A 10th category would need prefix 10, which breaks the 4-digitnumericIdscheme, theE####parse regex, and thenumericId contracttests.Instead β encode tier in the second digit of
numericId:0= open-source / self-host runtime,9= Cloud-only. Every existing code already has tier digit0, so the change is purely additive β the category leading-digit invariant, the 4-digit range, and theE####regex all hold unchanged, and no existing numericId moves.Changes:
taxonomy.tsβ document the digit layout (<category><tier><seq>), exportCLOUD_TIER_DIGIT = 9.specs.tsβ widen the spec key +codefield toSpecErrorCode = ILobeAgentRuntimeErrorType | CloudErrorCode; add the three entries under their semantic categories with tier-9 ids:FreePlanLimitβ E2901 (quota)InsufficientBudgetForModelβ E2902 (quota)LobeHubModelDeprecatedβ E4901 (request)attribution: user,severity: warning,countAsFailure: false; httpStatus 402/402/404. (PreviouslygetStatusreturnedNaNfor these string codes β now it resolves a sane status; Cloud gateway sets its own status independently so live behavior is unchanged.)match.test.tsβ assert every spec's tier digit is0or9, and the three Cloud codes resolve under the cloud tier.π§ͺ How to Test
```bash
bun run type-check # passes
cd packages/model-runtime && bunx vitest run src/errors/ # 49 pass (47 + 2 new tier tests)
```
π Additional Information
response.FreePlanLimit/response.InsufficientBudgetForModel/response.LobeHubModelDeprecated) already exist β nothing to add.FreePlanLimit=E2002 /InsufficientBudgetForModel=E2003 under quota; those migrate to E2901/E2902 andLobeHubModelDeprecated=E4901 is added). Those gateway ids were never published externally, so the append-only contract is not violated.π€ Generated with Claude Code