-
-
Notifications
You must be signed in to change notification settings - Fork 79.1k
Fallback provider can become pinned in session metadata and trap a chat on unavailable LM Studio model #90462
Copy link
Copy link
Open
Labels
P1High-priority user-facing bug, regression, or broken workflow.High-priority user-facing bug, regression, or broken workflow.clawsweeper:needs-live-reproClawSweeper needs live local, crabbox, or manual validation to confirm this issue.ClawSweeper needs live local, crabbox, or manual validation to confirm this issue.clawsweeper:needs-maintainer-reviewClawSweeper marked this issue as needing maintainer review before automation.ClawSweeper marked this issue as needing maintainer review before automation.clawsweeper:needs-product-decisionClawSweeper marked this issue as needing a product or behavior decision.ClawSweeper marked this issue as needing a product or behavior decision.clawsweeper:no-new-fix-prClawSweeper does not recommend queueing a new automated fix PR for this issue.ClawSweeper does not recommend queueing a new automated fix PR for this issue.impact:auth-providerAuth, provider routing, model choice, or SecretRef resolution may break.Auth, provider routing, model choice, or SecretRef resolution may break.impact:session-stateSession, memory, transcript, context, or agent state can drift or corrupt.Session, memory, transcript, context, or agent state can drift or corrupt.issue-rating: 🐚 platinum hermitGood issue quality with a plausible reproduction path needing some confirmation.Good issue quality with a plausible reproduction path needing some confirmation.
Metadata
Metadata
Assignees
Labels
P1High-priority user-facing bug, regression, or broken workflow.High-priority user-facing bug, regression, or broken workflow.clawsweeper:needs-live-reproClawSweeper needs live local, crabbox, or manual validation to confirm this issue.ClawSweeper needs live local, crabbox, or manual validation to confirm this issue.clawsweeper:needs-maintainer-reviewClawSweeper marked this issue as needing maintainer review before automation.ClawSweeper marked this issue as needing maintainer review before automation.clawsweeper:needs-product-decisionClawSweeper marked this issue as needing a product or behavior decision.ClawSweeper marked this issue as needing a product or behavior decision.clawsweeper:no-new-fix-prClawSweeper does not recommend queueing a new automated fix PR for this issue.ClawSweeper does not recommend queueing a new automated fix PR for this issue.impact:auth-providerAuth, provider routing, model choice, or SecretRef resolution may break.Auth, provider routing, model choice, or SecretRef resolution may break.impact:session-stateSession, memory, transcript, context, or agent state can drift or corrupt.Session, memory, transcript, context, or agent state can drift or corrupt.issue-rating: 🐚 platinum hermitGood issue quality with a plausible reproduction path needing some confirmation.Good issue quality with a plausible reproduction path needing some confirmation.
Type
Fields
Give feedbackNo fields configured for issues without a type.
Summary
OpenClaw can trap a single chat/topic on an unavailable fallback model after a transient model-call failure.
In our case, one Telegram topic started replying only:
The configured/default model had been restored to
openai/gpt-5.5, but the affected topic continued using an effective session-level route to:The only way to recover the chat was to manually edit session metadata.
Environment
2026.5.28 (e932160)openai/gpt-5.5Observed behavior
During an unrelated network/server incident, a regular OpenAI/Codex request failed while the Telegram topic session was active. The observed provider-side response was a
403from:OpenClaw interpreted this transient network/provider/auth-like failure as a reason to trigger fallback.
Fallback selected:
That fallback route was then persisted in session metadata for the affected Telegram topic. LM Studio could not serve the selected model and returned:
After that, every subsequent message in the topic produced only
Model is unloaded.Running:
did not recover the topic. The visible/default model changed back to OpenAI, but the effective session route was still pinned to LM Studio.
Neighboring Telegram topics remained on
openai/gpt-5.5, so this was not a full gateway outage.Affected state
Local inspection found pinned model/provider metadata on the direct session and one Telegram topic session.
The affected topic metadata had:
Other active Telegram topics still had:
Why this is a bug
Automatic fallback became a persistent per-session route after a transient failure, and the selected fallback model was unavailable.
That made the affected chat unrecoverable through normal user-facing model commands.
Problems observed:
/model openai/gpt-5.5changed the visible/default model but not the effective pinned route;doctordid not clearly report session-level fallback pinning;Model is unloadedwas sent repeatedly instead of a recovery hint.Expected behavior
Automatic fallback should be safe, observable, and reversible.
Fallback should not permanently pin a session unless:
If a fallback model fails with
Model is unloaded, OpenClaw should treat that fallback as failed and either:Suggested fixes
Diagnostics:
openclaw sessions inspect <session-id>showing configured vs effective provider/model, override source, and last automatic fallback reason/time./model statusfor the current chat/topic.doctorwarn when active sessions are pinned to fallback/local providers.Recovery:
/model resetto clear current chat/topic model/provider overrides.openclaw sessions reset-model <session-id>to clear session-level model/provider override metadata.Fallback safety:
Model is unloaded, clear or skip that fallback route instead of repeatedly returning the raw provider error.Related
This is related in spirit to model/session reset/reconcile concerns, but it is a concrete failure mode where automatic fallback persisted an unavailable provider route and trapped one Telegram topic.