-
Notifications
You must be signed in to change notification settings - Fork 0
feat: implement Hybrid Plan + ReAct execution loop #199
Copy link
Copy link
Closed
Labels
prio:mediumShould do, but not blockingShould do, but not blockingscope:large3+ days of work3+ days of workspec:agent-systemDESIGN_SPEC Section 3 - Agent SystemDESIGN_SPEC Section 3 - Agent Systemspec:task-workflowDESIGN_SPEC Section 6 - Task & Workflow EngineDESIGN_SPEC Section 6 - Task & Workflow Enginetype:featureNew feature implementationNew feature implementation
Description
Summary
The spec defines a third execution loop (Loop 3: Hybrid) that combines planning with per-step mini-ReAct loops and checkpointing. ReAct (Loop 1) and Plan-and-Execute (Loop 2) are implemented; Hybrid is not.
Design Spec Reference
- §6.5 Agent Execution Loop — Loop 3: Hybrid Plan + ReAct Steps
Scope
- High-level plan generation (3–7 steps)
- Per-step mini-ReAct loop with own turn limit
- Checkpoint after each step (progress summary)
- Optional replanning of remaining steps
ExecutionLoopprotocol compliance
Research Context
StructuredAgent AND/OR Trees (arXiv:2603.05294) — Consider as Loop 4 candidate:
- AND nodes (all children succeed), OR nodes (any child suffices), DFS traversal with 3 states
- 52.6% on WebArena vs 46.4% flat ReAct — significant improvement on long-horizon tasks
- Node repair: modify/add children within revision budget before cascading failure up
- Observation summarizer: compress full history into progress + constraint summaries
- Auto-select for complex/epic tasks via
get_loop_type() -> "andor_tree" - No code released — adapt algorithms
See research/structuredagent-andor-trees.md in project memory.
Reactions are currently unavailable
Metadata
Metadata
Assignees
Labels
prio:mediumShould do, but not blockingShould do, but not blockingscope:large3+ days of work3+ days of workspec:agent-systemDESIGN_SPEC Section 3 - Agent SystemDESIGN_SPEC Section 3 - Agent Systemspec:task-workflowDESIGN_SPEC Section 6 - Task & Workflow EngineDESIGN_SPEC Section 6 - Task & Workflow Enginetype:featureNew feature implementationNew feature implementation