Skip to content

[Feature] Define Run Incident Framework for interrupted runs #808

@Astro-Han

Description

@Astro-Han

Goal

Define a unified Run Incident Framework for assistant runs that end unexpectedly. This is the umbrella direction for #802, #803, and #804: do not fix each observed interruption as an isolated patch; build one shared language for facts, cause, phase, recovery policy, and user-facing presentation.

Why this exists

Recent terminated-session exports showed that PawWork can collect many useful low-level signals, but the signals are still too ad hoc:

The common problem is not just one missing field. The common problem is that PawWork needs a unified run-incident model.

Direction

A run incident should be split into five layers:

  1. Facts — what actually happened.
  2. Cause — why the run ended.
  3. Phase — where the run was interrupted.
  4. Policy — what recovery action is safe.
  5. Presentation — what export reviewers and users should see.

The first design draft is recorded in the first comment on this issue.

Related work

Direct implementation issues:

Related active or residual reliability work:

Foundational completed work:

Series index:

Scope

In scope:

Out of scope:

  • One giant implementation PR.
  • Provider SDK/network fixes.
  • Telemetry sink integration.
  • Broad UI redesign.

Execution mode

Investigate and get the design plan approved first. Here, "plan" means the issue-level design / scope proposal, not a PR-level implementation checklist. Once the approved design exists, agents may proceed with implementation plans inside the agreed scope; post a new issue comment and wait for explicit "approved" only when the implementation would change that design scope.

Metadata

Metadata

Assignees

No one assigned

    Labels

    P1High priorityappApplication behavior and product flowsenhancementNew feature or requestharnessModel harness, prompts, tool descriptions, and session mechanics

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions