Skip to content

contract(gpu-training-backend-v1): GATE-GPUTRAIN-004 verdict pending → pass (v1.4 → v1.5)#1071

Merged
noahgift merged 1 commit into
mainfrom
feat/gputrain-004-gate-verdict-pass
Apr 26, 2026
Merged

contract(gpu-training-backend-v1): GATE-GPUTRAIN-004 verdict pending → pass (v1.4 → v1.5)#1071
noahgift merged 1 commit into
mainfrom
feat/gputrain-004-gate-verdict-pass

Conversation

@noahgift

Copy link
Copy Markdown
Contributor

Summary

  • GATE-GPUTRAIN-004 verdict: pendingpass (370M step-time budget < 500ms on RTX 4090).
  • Contract gpu-training-backend-v1.yaml v1.4.0 → v1.5.0.
  • This is the contractual durable verdict for §19.4 Residual B / §20 live evidence.

Why

GATE-GPUTRAIN-004's paired falsification test FALSIFY-GPUTRAIN-005 has been DISCHARGED since 2026-04-24 with median 101.30 ms (20.3% of 500ms budget). The gate's own verdict: pending was a contract-cosmetic gap — the underlying invariant was already satisfied.

Evidence basis (now cited inline as verdict_basis)

  1. FALSIFY-GPUTRAIN-005 (canonical config seq_len=2048 batch=1): median 101.30 ms / 25 steps — evidence/task-132/
  2. §20 (PR docs(ship-two-001): §20 live CUDA training dispatch evidence — spec v2.65.0 #1070, config seq_len=512 num_steps=50): median 264.74 ms / 100 steps — evidence/task-132-residual-b/

Two evidence files at different config bands show budget compliance is robust at this margin (well below 500ms in both).

Validation

pv validate contracts/entrenar/gpu-training-backend-v1.yaml: 0 errors, 0 warnings.

Test plan

  • Contract version bumped 1.4.0 → 1.5.0 (top-level + metadata.version)
  • pv validate passes
  • PMAT pre-commit gates pass
  • No rule change — only verdict field flip + verdict_basis added

Stacks under

Coverage tally impact

GATE-GPUTRAIN-004 was already PARTIAL_ALGORITHM_LEVEL counted in the 33+12 tally. Promoting to verdict-pass keeps the overall PARTIAL count flat (already counted) and the DISCHARGED count unchanged (the falsifier was already DISCHARGED). The verdict flip is bookkeeping aligning gate state with falsifier state.

🤖 Generated with Claude Code

@noahgift noahgift enabled auto-merge (squash) April 26, 2026 09:46
…→ pass — spec §20 + #1059 evidence — v1.4.0 → v1.5.0

GATE-GPUTRAIN-004 (370M step-time budget < 500ms on RTX 4090) was
marked `verdict: pending` despite its paired falsification test
FALSIFY-GPUTRAIN-005 being DISCHARGED with median 101.30 ms
(20.3% of budget) since 2026-04-24.

This contract bump flips the gate to `verdict: pass` with a
`verdict_basis` field citing both:

1. **FALSIFY-GPUTRAIN-005 evidence** (canonical config seq_len=2048
   batch=1): median 101.30 ms across 25 steps on
   noah-Lambda-Vector RTX 4090 — `evidence/task-132/`.
2. **§20 evidence** (PR #1070, different config seq_len=512):
   median 264.74 ms across 100 steps — `evidence/task-132-residual-b/`.

Both well under the 500ms ceiling. Two evidence files at different
config bands demonstrate budget compliance is robust at this margin.

Contract version v1.4.0 → v1.5.0 (additive metadata, no rule
change). `pv validate`: 0 errors, 0 warnings.

This is a contract-cosmetic flip — GATE-GPUTRAIN-004's underlying
invariant has been satisfied since 2026-04-24; the `verdict: pending`
field was only the gate's own pointer was missing.

References:
- spec §20 (PR #1070): live evidence capture 2026-04-26
- spec §19.4 Residual B: this is the contractual durable verdict
- evidence/task-132/rtx4090-370m-step-budget-and-repro.json
- evidence/task-132-residual-b/cuda-50step-2026-04-26.json

Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>
@noahgift noahgift force-pushed the feat/gputrain-004-gate-verdict-pass branch from 5f962ce to 11f725c Compare April 26, 2026 13:04
@noahgift noahgift merged commit 83ef3de into main Apr 26, 2026
10 checks passed
@noahgift noahgift deleted the feat/gputrain-004-gate-verdict-pass branch April 26, 2026 13:23
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant