feat: two-layer health monitoring pipeline (judge + triage) for stagnation events

## Context

Deep dive on [Hive](https://github.com/aden-hive/hive) revealed a two-layer health monitoring design that prevents alert fatigue:

1. **Health Judge** (sensitive): Timer-driven, reads tool logs, tracks `steps_since_last_accept`, detects stall/doom-loops. Emits structured `EscalationTicket`.
2. **Queen triage** (conservative): Filters tickets before operator notification. Dismiss: low severity + transient. Intervene: high/critical + doom loop + stall > threshold.

SynthOrg's stagnation detector handles layer 1 but has no post-termination escalation pipeline or alert filtering.

## Action Items

- [ ] After `STAGNATION` or repeated `FAILED` termination, emit structured health event to NotificationSink
- [ ] Design `EscalationTicket` model: severity, cause, evidence, steps_since_last_progress, stall_duration
- [ ] Implement triage filter: dismiss low-severity transient issues, escalate high/critical with evidence
- [ ] Wire to NotificationSink protocol (ntfy.sh research, 2026-03-14)
- [ ] Consider: should triage be a lightweight rule-based filter or an LLM agent?

## Design Notes

Maps directly to the `NotificationSink` protocol recommended in the ntfy.sh research (research log entry #33, 2026-03-14). The two-layer design prevents operator alert fatigue -- the judge is sensitive, the triage filter is conservative.

## References

- [Hive: worker-health-monitoring.md](https://github.com/aden-hive/hive/blob/main/docs/worker-health-monitoring.md)
- Research log: ntfy.sh v2.18.0 (2026-03-14) -- NotificationSink protocol
- Related: #706 (structured failure diagnosis -- provides the data for escalation tickets)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: two-layer health monitoring pipeline (judge + triage) for stagnation events #707

Context

Action Items

Design Notes

References

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

feat: two-layer health monitoring pipeline (judge + triage) for stagnation events #707

Description

Context

Action Items

Design Notes

References

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions