Skip to content

[Bug]: `Recurring cron jobs with isolated agentTurn + LLM time out when force-run, while equivalent one-shot at jobs succeed #41558

@alexodas

Description

@alexodas

Bug type

Regression (worked before, now fails)

Summary

Recurring cron jobs using isolated agentTurn with an LLM consistently time out when force-run, while equivalent one-shot at jobs succeed. This appears to affect recurring cron execution rather than the model, prompt, or tool use itself.

OpenClaw version
OpenClaw 2026.3.8 (3caab92)

Steps to reproduce

1. Create a recurring cron job with:
- `schedule.kind = "cron"`
- `sessionTarget = "isolated"`
- `payload.kind = "agentTurn"`
- model = `openai-codex/gpt-5.4`
- `timeoutSeconds = 60`

2. Use even a very small prompt, for example:
- `Use the message tool with action='send', channel='slack', to='<target>' and message='DEBUG recurring minimal cron'. Do not write anything else.`

3. Trigger the recurring job manually with force-run.

4. Wait for the run to complete.

5. Observe that it ends with:
- `Error: cron: job execution timed out`

6. Check diagnostic logs and observe:
- `FailoverError: LLM request timed out.`

7. Then create an equivalent one-shot cron job with:
- `schedule.kind = "at"`
- same model
- same `sessionTarget = "isolated"`
- same or equivalent prompt

8. Let the one-shot job run.

9. Observe that the one-shot job succeeds, including cases with tool use and moderately structured prompts.

Expected behavior

Recurring cron jobs using `sessionTarget: "isolated"` and `payload.kind: "agentTurn"` should execute successfully when force-run, just like equivalent one-shot `schedule.kind = "at"` jobs.

If the prompt is valid and the model/tool path works for one-shot jobs, the same logic should also work for recurring cron jobs without timing out.

Actual behavior

Recurring cron jobs consistently fail when force-run, even with very small prompts.

The run ends with:
- `Error: cron: job execution timed out`

Diagnostic logs show:
- `FailoverError: LLM request timed out.`

This happens even for minimal recurring jobs that only call `message.send` with a fixed string.

By contrast, equivalent one-shot `schedule.kind = "at"` jobs succeed.

OpenClaw version

OpenClaw 2026.3.8 (3caab92)

Operating system

Linux 6.17.0-1007-oracle (arm64)

Install method

npm global

Logs, screenshots, and evidence

## Relevant logs

Recurring cron job (force-run):

status: error
error: Error: cron: job execution timed out
durationMs: ~60000



lane task error: lane=cron durationMs=~50 error="FailoverError: LLM request timed out."
lane task error: lane=session:agent:whatsapp:cron:<job-id> durationMs=~59950 error="FailoverError: LLM request timed out."


Equivalent one-shot `schedule.kind = "at"` job:

status: ok
durationMs: ~9000
model: gpt-5.4

Impact and severity

Medium to high.

This breaks confidence in recurring cron jobs that rely on isolated `agentTurn` execution with LLMs, especially when users use force-run to test scheduled automations before relying on them.

The issue can make a recurring automation appear broken even when equivalent one-shot jobs succeed, which is confusing and can cause missed reminders or failed workflows.

Additional information

No response

Metadata

Metadata

Assignees

No one assigned

    Labels

    bugSomething isn't workingregressionBehavior that previously worked and now fails

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions