You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
GitHub Squad independently validates three patterns:
No-self-review: Coordinator prevents original agent from reviewing its own work
Decisions drop-box: Append-only decisions.md for persistent auditable shared knowledge
Versioned agent identities: Agent charters stored as plain text alongside code
Why This Matters
SynthOrg's approval gates should structurally prevent self-review (not just by convention). The decisions drop-box is more durable than current org memory consolidation for cross-agent institutional knowledge. Agent Skill Bus (#55) independently arrived at the same append-only pattern (JSONL queue).
Action Items
Audit approval gates: ensure original agent cannot review its own output
Design an auditable shared-knowledge artifact (versioned org-decisions file/store)
Evaluate append-only pattern vs current org memory consolidation approach
Consider versioning agent identities/charters as first-class artifacts
Key findings that validate and extend the no-self-review pattern:
"Out of the box, Claude is a poor QA agent" -- required iterative prompt refinement to achieve reasonable grading. Separate evaluator agents are "far more tractable than making a generator critical of its own work."
GAN-inspired adversarial dynamic: Generator and Evaluator improve iteratively through adversarial feedback loops
Sprint contract pattern: Generator and Evaluator negotiate expected outcomes before each sprint, bridging high-level specs and testable implementations
20x cost multiplier for multi-agent quality: $9 solo agent vs $200 full harness (3 agents, 6 hours). Budget module needs to account for multi-agent QA cost when review gates are active.
Harness simplification principle: complexity should decrease as models improve. Components encode assumptions about model limitations -- those assumptions need periodic stress testing.
Context
GitHub Squad independently validates three patterns:
decisions.mdfor persistent auditable shared knowledgeWhy This Matters
SynthOrg's approval gates should structurally prevent self-review (not just by convention). The decisions drop-box is more durable than current org memory consolidation for cross-agent institutional knowledge. Agent Skill Bus (#55) independently arrived at the same append-only pattern (JSONL queue).
Action Items
References
Additional Research (2026-03-26)
Generator/Evaluator/Planner Separation
Source: Harness Design for Long-Running Application Development (Anthropic Engineering, 2026-03-24)
Key findings that validate and extend the no-self-review pattern:
Cost Implications