contract(gpu-training-backend-v1): GATE-GPUTRAIN-004 verdict pending → pass (v1.4 → v1.5) by noahgift · Pull Request #1071 · paiml/aprender

noahgift · 2026-04-26T09:46:20Z

Summary

GATE-GPUTRAIN-004 verdict: pending → pass (370M step-time budget < 500ms on RTX 4090).
Contract gpu-training-backend-v1.yaml v1.4.0 → v1.5.0.
This is the contractual durable verdict for §19.4 Residual B / §20 live evidence.

Why

GATE-GPUTRAIN-004's paired falsification test FALSIFY-GPUTRAIN-005 has been DISCHARGED since 2026-04-24 with median 101.30 ms (20.3% of 500ms budget). The gate's own verdict: pending was a contract-cosmetic gap — the underlying invariant was already satisfied.

Evidence basis (now cited inline as `verdict_basis`)

FALSIFY-GPUTRAIN-005 (canonical config seq_len=2048 batch=1): median 101.30 ms / 25 steps — evidence/task-132/
§20 (PR docs(ship-two-001): §20 live CUDA training dispatch evidence — spec v2.65.0 #1070, config seq_len=512 num_steps=50): median 264.74 ms / 100 steps — evidence/task-132-residual-b/

Two evidence files at different config bands show budget compliance is robust at this margin (well below 500ms in both).

Validation

pv validate contracts/entrenar/gpu-training-backend-v1.yaml: 0 errors, 0 warnings.

Test plan

Contract version bumped 1.4.0 → 1.5.0 (top-level + metadata.version)
pv validate passes
PMAT pre-commit gates pass
No rule change — only verdict field flip + verdict_basis added

Stacks under

docs(ship-two-001): §20 live CUDA training dispatch evidence — spec v2.65.0 #1070 (§20 — live CUDA training dispatch evidence)
Independent of feat(sub-ffn-telemetry): 4 new ActivationStats fields on LayerActivation — implements trace-ffn-sub-block-v1.yaml #1066, feat(pretrain): add wall_ms to StepMetrics — Residual B per spec §19.4 #1069 (those are code; this is contract metadata)

Coverage tally impact

GATE-GPUTRAIN-004 was already PARTIAL_ALGORITHM_LEVEL counted in the 33+12 tally. Promoting to verdict-pass keeps the overall PARTIAL count flat (already counted) and the DISCHARGED count unchanged (the falsifier was already DISCHARGED). The verdict flip is bookkeeping aligning gate state with falsifier state.

🤖 Generated with Claude Code

…→ pass — spec §20 + #1059 evidence — v1.4.0 → v1.5.0 GATE-GPUTRAIN-004 (370M step-time budget < 500ms on RTX 4090) was marked `verdict: pending` despite its paired falsification test FALSIFY-GPUTRAIN-005 being DISCHARGED with median 101.30 ms (20.3% of budget) since 2026-04-24. This contract bump flips the gate to `verdict: pass` with a `verdict_basis` field citing both: 1. **FALSIFY-GPUTRAIN-005 evidence** (canonical config seq_len=2048 batch=1): median 101.30 ms across 25 steps on noah-Lambda-Vector RTX 4090 — `evidence/task-132/`. 2. **§20 evidence** (PR #1070, different config seq_len=512): median 264.74 ms across 100 steps — `evidence/task-132-residual-b/`. Both well under the 500ms ceiling. Two evidence files at different config bands demonstrate budget compliance is robust at this margin. Contract version v1.4.0 → v1.5.0 (additive metadata, no rule change). `pv validate`: 0 errors, 0 warnings. This is a contract-cosmetic flip — GATE-GPUTRAIN-004's underlying invariant has been satisfied since 2026-04-24; the `verdict: pending` field was only the gate's own pointer was missing. References: - spec §20 (PR #1070): live evidence capture 2026-04-26 - spec §19.4 Residual B: this is the contractual durable verdict - evidence/task-132/rtx4090-370m-step-budget-and-repro.json - evidence/task-132-residual-b/cuda-50step-2026-04-26.json Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>

noahgift enabled auto-merge (squash) April 26, 2026 09:46

noahgift force-pushed the feat/gputrain-004-gate-verdict-pass branch from 5f962ce to 11f725c Compare April 26, 2026 13:04

noahgift merged commit 83ef3de into main Apr 26, 2026
10 checks passed

noahgift deleted the feat/gputrain-004-gate-verdict-pass branch April 26, 2026 13:23

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

contract(gpu-training-backend-v1): GATE-GPUTRAIN-004 verdict pending → pass (v1.4 → v1.5)#1071

contract(gpu-training-backend-v1): GATE-GPUTRAIN-004 verdict pending → pass (v1.4 → v1.5)#1071
noahgift merged 1 commit into
mainfrom
feat/gputrain-004-gate-verdict-pass

noahgift commented Apr 26, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

noahgift commented Apr 26, 2026

Summary

Why

Evidence basis (now cited inline as verdict_basis)

Validation

Test plan

Stacks under

Coverage tally impact

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Evidence basis (now cited inline as `verdict_basis`)