Skip to content

spec(ship-two-models): v2.89.0 — §44 FALSIFY-CPU-GPU-005 part b impl + distill-train 9/9 sweep close#1444

Merged
noahgift merged 2 commits into
mainfrom
spec/v2-89-cpu-gpu-005-part-b-and-train-sweep-close
May 3, 2026
Merged

spec(ship-two-models): v2.89.0 — §44 FALSIFY-CPU-GPU-005 part b impl + distill-train 9/9 sweep close#1444
noahgift merged 2 commits into
mainfrom
spec/v2-89-cpu-gpu-005-part-b-and-train-sweep-close

Conversation

@noahgift

@noahgift noahgift commented May 3, 2026

Copy link
Copy Markdown
Contributor

Summary

Canonical record of today's continuation cycle. Spec v2.88.0 → v2.89.0 documents the 2-PR chain (#1442 + #1443) that closes both §43.6 next-session pickup items in one amendment.

What §44 records

PR What Effect
#1442 FALSIFY-CPU-GPU-005 part b live implementation wgpu cosine parity gate inline at try_apr_wgpu_inference; symmetric to §41 CUDA parity_gate
#1443 distill-train 9/9 falsifier sweep close TRAIN-007/008 PARTIAL_ALGORITHM_LEVEL + TRAIN-009 BLOCKER_FIXTURE_ABSENT

Coverage flips

  • FALSIFY-CPU-GPU-005: PARTIAL_ALGORITHM_LEVEL (visibility-log+helper) → PARTIAL_ALGORITHM_LEVEL (gate impl in place; live smoke deferred)
  • TRAIN-007 + TRAIN-008: unbound → PARTIAL_ALGORITHM_LEVEL
  • TRAIN-009: unbound → BLOCKER_FIXTURE_ABSENT (honest classification; pending §35)

Net effects

  • MODEL-1 ship %: 88% → 89% (wgpu jidoka armor complete)
  • MODEL-2 ship %: 56% → 57% (last falsifier-binding gap closed; only §35 real-training remains)
  • Coverage tally: 15+35 → 15+37 (+2 PARTIAL closed; TRAIN-009 explicitly blocked)
  • Distill contract: 9/9 falsifiers algorithm-bound; terminal-binding state reached

Five Whys (in §44.4)

  1. Why ship part b now? §43.6 (a) named it; feat(apr-cpu-vs-gpu-output-parity-v1): cpu_vs_gpu_cosine_similarity helper for FALSIFY-CPU-GPU-005 part b #1440 helper unblocked the path.
  2. Why inline impl? Loop body is ~30 LOC; extracting forces 8+ borrowed locals.
  3. Why fail-closed on probe errors? Per jidoka — never ship silent gibberish.
  4. Why BLOCKER for TRAIN-009 not PARTIAL? Honest classification — no test exists, blocker is explicit (§35).
  5. Why bound to 2 PRs? Toyota Way — focused PRs preserve audit story.

Next-session pickup (§44.6)

(a) FALSIFY-CPU-GPU-005 live discharge — operator smoke on canonical 7B teacher (~10min rebuild + 30s + 5s)
(b) MODEL-2 §35 real-training implementation — multi-PR; would simultaneously discharge TRAIN-001/002/009
(c) MODEL-1 SHIP-007 GPU kernel root-cause fix — multi-PR research track

Test plan

  • CI green on required gates

🤖 Generated with Claude Code

…+ distill-train 9/9 sweep close

Canonical record of today's continuation cycle (PRs #1442 + #1443).
Closes the two §43.6 next-session pickup items in one v2.89.0 amendment.

Chain landed (post-§43 v2.88.0):
- #1442: FALSIFY-CPU-GPU-005 part b implementation
  ~70 LOC inline at try_apr_wgpu_inference (gguf_gpu_generate.rs
  ~441-510). Probe-token CPU forward via
  OwnedQuantizedModel::forward_single_with_cache (tiny max_seq=2
  cache) + wgpu single-step replay using the same fwd.forward_layer
  code path the autoregressive loop uses + cosine compare via
  cpu_vs_gpu_cosine_similarity (helper from #1440). < 0.99 → emit
  WGPU_FALLBACK_LOG_PREFIX + return None. Probe error paths
  fail-closed. Symmetric to §41 CUDA parity_gate. Contract
  apr-cpu-vs-gpu-output-parity-v1 v1.2.0 → v1.3.0 ACTIVE.

- #1443: distill-train 9/9 falsifier sweep close
  TRAIN-007 PARTIAL via pv validate (live: 0 errors / 0 warnings).
  TRAIN-008 PARTIAL via cargo test cli_commands registered_commands
  (live: 1 pass; test_no_unregistered_commands enforces the 3-surface
  invariant per feedback_cli_subcommand_three_surface_drift).
  TRAIN-009 BLOCKER_FIXTURE_ABSENT pending §35 real-training impl
  (no val_loss to compare without gradient descent).
  All 9 TRAIN-* falsifiers now have explicit algorithm_evidence
  blocks (8× PARTIAL_ALGORITHM_LEVEL + 1× BLOCKER_FIXTURE_ABSENT) —
  the distill contract has reached terminal-binding state.

§44 documents: what landed (table), coverage flips (FALSIFY-CPU-GPU-005
PARTIAL→PARTIAL deeper, TRAIN-007/008 unbound→PARTIAL, TRAIN-009
unbound→BLOCKER), why for MODEL-1+MODEL-2 (jidoka armor complete +
distill contract terminal-bound), Five Whys, ship % effects (MODEL-1
88→89, MODEL-2 56→57), and next-session pickup options (live
FALSIFY-CPU-GPU-005 discharge OR MODEL-2 §35 real-training OR
MODEL-1 SHIP-007 GPU kernel root-cause fix).

Coverage tally: 15+35 → 15+37 (+2 PARTIAL closed; TRAIN-009 blocked).

Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>
@noahgift noahgift enabled auto-merge (squash) May 3, 2026 23:17
@noahgift noahgift merged commit 0d9cf3a into main May 3, 2026
10 checks passed
@noahgift noahgift deleted the spec/v2-89-cpu-gpu-005-part-b-and-train-sweep-close branch May 3, 2026 23:59
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant