Skip to content

feat(ops): SLO baselines + weekly reliability scorecard#80

Merged
dgarson merged 1 commit intofeat/tool-reliability-layerfrom
julia/slo-baselines
Feb 23, 2026
Merged

feat(ops): SLO baselines + weekly reliability scorecard#80
dgarson merged 1 commit intofeat/tool-reliability-layerfrom
julia/slo-baselines

Conversation

@dgarson
Copy link
Owner

@dgarson dgarson commented Feb 22, 2026

Summary

Establishes the SLO baseline framework for ClawdBot tool reliability tracking.

SLOs Defined (6 total)

ID Name Target
TCR Task Completion Rate ≥ 95%
TCSR Tool Call Success Rate ≥ 98%
ASR Agent Stall Rate ≤ 2%
MTTFT Mean Time To First Tool ≤ 3s
EAL Error Acknowledgement Latency ≤ 30s
WRS Weekly Reliability Score ≥ 90

Files

  • docs/ops/slo-baselines.md — SLO definitions, targets, measurement methodology, and phase plan
  • docs/ops/weekly-reliability-scorecard-template.md — Fillable weekly scorecard template
  • scripts/generate-scorecard.ts — Reads session JSONL files and renders a filled-in scorecard automatically

Observation Window

2026-02-22 → 2026-03-08 (2 weeks) — passive observation only, no enforcement yet.

Phase 2 (after March 8)

  • Alerting on SLO breaches
  • Cron-scheduled scorecard generation
  • Dashboard integration

Notes

  • Lint errors in src/agents/tool-reliability.ts are pre-existing on feat/tool-reliability-layer (not introduced here)
  • Formatting fixes for 3 other files included per project style rules

Owner: Julia (CAO)

@dgarson dgarson merged commit a650abd into feat/tool-reliability-layer Feb 23, 2026
2 of 9 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant