Summary
Recurring operational failure (~1-2 times per month): assistant confirms a requested action, but no execution starts (no tool calls, no artifacts), then later reports "no progress".
This is an execution-state bug (false "started"), not a model-quality complaint.
Environment
- OpenClaw:
2026.3.7
- Agent/session:
agent:main:main
- Model path in this case:
openai-codex/gpt-5.3-codex
- OS: Windows
Reproduction (from real session log, detailed timeline)
Session file:
C:\Users\karte\.openclaw\agents\main\sessions\859c8fac-93a6-41f2-af04-3b57519d76a4.jsonl
Timeline (UTC from JSONL):
2026-03-09T05:00:18Z assistant says "Проверю сейчас..." for a quick check request.
2026-03-09T05:07:50Z user asks: проведи глубокий ауди всех свои файлов.
2026-03-09T05:07:54Z assistant replies: "Принял. Проведу глубокий аудит..."
- Between
05:07:54Z and 05:18:52Z there is no execution evidence for that accepted audit action:
- no
toolCall
- no
toolResult
- no artifact path
- no run id / process id / state update
2026-03-09T05:18:52Z assistant reports: "Нет, валидного прогресса нет."
Representative message ids in this segment:
- accepted-action reply:
72f8c694
- no-progress reply:
ae45a24d
Expected behavior
- If execution cannot start for any reason, assistant should fail fast and return
blocked/error immediately.
- It should not acknowledge "started/doing" unless execution actually started.
- Runtime state transition should be deterministic (
accepted -> running only after real action event).
Actual behavior
- Assistant acknowledges execution in natural language.
- No action starts.
- User needs manual polling; later assistant admits no progress.
Why this matters
This creates hidden idle periods and breaks orchestration trust: chat state diverges from runtime truth.
Suggested fix direction
- Enforce execution gate: block "started/doing" replies unless there is a fresh artifact (
toolCall, process start, runId, state file update).
- Add watchdog: if no action event appears within N seconds after acceptance, auto-convert to
blocked/error with concrete reason.
- Expose this clearly in telemetry/state machine so UI/chat cannot drift from runtime.
I can provide additional traces from prior dates with the same pattern if needed.
Summary
Recurring operational failure (~1-2 times per month): assistant confirms a requested action, but no execution starts (no tool calls, no artifacts), then later reports "no progress".
This is an execution-state bug (false "started"), not a model-quality complaint.
Environment
2026.3.7agent:main:mainopenai-codex/gpt-5.3-codexReproduction (from real session log, detailed timeline)
Session file:
C:\Users\karte\.openclaw\agents\main\sessions\859c8fac-93a6-41f2-af04-3b57519d76a4.jsonlTimeline (UTC from JSONL):
2026-03-09T05:00:18Zassistant says "Проверю сейчас..." for a quick check request.2026-03-09T05:07:50Zuser asks:проведи глубокий ауди всех свои файлов.2026-03-09T05:07:54Zassistant replies: "Принял. Проведу глубокий аудит..."05:07:54Zand05:18:52Zthere is no execution evidence for that accepted audit action:toolCalltoolResult2026-03-09T05:18:52Zassistant reports: "Нет, валидного прогресса нет."Representative message ids in this segment:
72f8c694ae45a24dExpected behavior
blocked/errorimmediately.accepted -> runningonly after real action event).Actual behavior
Why this matters
This creates hidden idle periods and breaks orchestration trust: chat state diverges from runtime truth.
Suggested fix direction
toolCall, process start,runId, state file update).blocked/errorwith concrete reason.I can provide additional traces from prior dates with the same pattern if needed.