-
-
Notifications
You must be signed in to change notification settings - Fork 79.1k
[Bug] Ollama Cloud rate-limit cooldown permanently blocks agents — not released after API recovery #87608
Copy link
Copy link
Closed
Labels
P1High-priority user-facing bug, regression, or broken workflow.High-priority user-facing bug, regression, or broken workflow.bugSomething isn't workingSomething isn't workingclawsweeper:needs-live-reproClawSweeper needs live local, crabbox, or manual validation to confirm this issue.ClawSweeper needs live local, crabbox, or manual validation to confirm this issue.clawsweeper:needs-maintainer-reviewClawSweeper marked this issue as needing maintainer review before automation.ClawSweeper marked this issue as needing maintainer review before automation.clawsweeper:needs-product-decisionClawSweeper marked this issue as needing a product or behavior decision.ClawSweeper marked this issue as needing a product or behavior decision.clawsweeper:no-new-fix-prClawSweeper does not recommend queueing a new automated fix PR for this issue.ClawSweeper does not recommend queueing a new automated fix PR for this issue.impact:auth-providerAuth, provider routing, model choice, or SecretRef resolution may break.Auth, provider routing, model choice, or SecretRef resolution may break.impact:message-lossChannel message delivery can be lost, duplicated, or misrouted.Channel message delivery can be lost, duplicated, or misrouted.impact:session-stateSession, memory, transcript, context, or agent state can drift or corrupt.Session, memory, transcript, context, or agent state can drift or corrupt.issue-rating: 🐚 platinum hermitGood issue quality with a plausible reproduction path needing some confirmation.Good issue quality with a plausible reproduction path needing some confirmation.
Metadata
Metadata
Assignees
Labels
P1High-priority user-facing bug, regression, or broken workflow.High-priority user-facing bug, regression, or broken workflow.bugSomething isn't workingSomething isn't workingclawsweeper:needs-live-reproClawSweeper needs live local, crabbox, or manual validation to confirm this issue.ClawSweeper needs live local, crabbox, or manual validation to confirm this issue.clawsweeper:needs-maintainer-reviewClawSweeper marked this issue as needing maintainer review before automation.ClawSweeper marked this issue as needing maintainer review before automation.clawsweeper:needs-product-decisionClawSweeper marked this issue as needing a product or behavior decision.ClawSweeper marked this issue as needing a product or behavior decision.clawsweeper:no-new-fix-prClawSweeper does not recommend queueing a new automated fix PR for this issue.ClawSweeper does not recommend queueing a new automated fix PR for this issue.impact:auth-providerAuth, provider routing, model choice, or SecretRef resolution may break.Auth, provider routing, model choice, or SecretRef resolution may break.impact:message-lossChannel message delivery can be lost, duplicated, or misrouted.Channel message delivery can be lost, duplicated, or misrouted.impact:session-stateSession, memory, transcript, context, or agent state can drift or corrupt.Session, memory, transcript, context, or agent state can drift or corrupt.issue-rating: 🐚 platinum hermitGood issue quality with a plausible reproduction path needing some confirmation.Good issue quality with a plausible reproduction path needing some confirmation.
Type
Fields
Give feedbackNo fields configured for issues without a type.
Bug type
Regression (worked before, now fails)
Beta release blocker
No
Summary
After an Ollama Cloud 429 rate-limit error, OpenClaw's internal rate-limit cooldown permanently blocks agent sessions. Even after the Ollama API fully recovers (confirmed by direct API test returning HTTP 200), blocked agents cannot start new sessions. Only agents with already-active sessions continue to work. Gateway restarts do not clear the cooldown state. Observed in v2026.5.22, persists after upgrade to v2026.5.26.
Steps to reproduce
Expected behavior
After a temporary 429 rate-limit error clears, OpenClaw should retry API calls and resume normal operation. Agents should be able to start new sessions and respond to messages once the Ollama API returns to normal (HTTP 200). Gateway restarts should clear any internal rate-limit cooldown state. In v2026.5.22 and earlier, temporary 429 errors were handled with exponential backoff and agents recovered automatically.
Actual behavior
Agents Kaho and Tom are permanently blocked with FailoverError:⚠️ API rate limit reached. Please try again later. Direct API test with the same key returns HTTP 200. Gateway logs show EmbeddedAttemptSessionTakeoverError: session file changed while embedded prompt lock was released. The Ollama dashboard confirms 0.6% weekly usage — no real limit reached. Only agents with already-active sessions (Tim, Tea) continue to function on the same API key.
OpenClaw version
2026.5.26
Operating system
Windows 11 (10.0.26200) x64
Install method
npm global (npm install -g openclaw)
Model
ollama/glm-5.1:cloud (primary), ollama/kimi-k2.5:cloud, ollama/minimax-m2.7:cloud (fallbacks)
Provider / routing chain
openclaw -> ollama.com (Ollama Cloud)
Additional provider/model setup details
4 agents share the same Ollama Cloud API key (Pro tier). Config in
~/.openclaw/openclaw.jsonundermodels.providers.ollama. Each agent hasmodelset toollama/glm-5.1:cloudwith fallbacks. Kaho was updated to use model object with fallbacks (primary+fallbacks[]) but the fallbacks also return 429. Direct Node.js API test with the same key returns HTTP 200, confirming the issue is in OpenClaw's internal rate-limit state, not in Ollama.Logs, screenshots, and evidence
Impact and severity
Additional information
Last known good: v2026.5.22 (before upgrade, agents worked intermittently). First known bad: v2026.5.22 (after burst of 429 errors, agents became permanently blocked).
Workaround attempted:
Suggested fix:
openclaw gateway reset-ratelimit)Retry-Afterheaders from Ollama instead of indefinite cooldownhttps.requesttoollama.com/v1/chat/completionswith the same API key returns HTTP 200 immediately, while OpenClaw's internal tracker continues returning FailoverError