spec(ship-two-models): v2.89.0 — §44 FALSIFY-CPU-GPU-005 part b impl + distill-train 9/9 sweep close#1444
Merged
noahgift merged 2 commits intoMay 3, 2026
Conversation
…+ distill-train 9/9 sweep close Canonical record of today's continuation cycle (PRs #1442 + #1443). Closes the two §43.6 next-session pickup items in one v2.89.0 amendment. Chain landed (post-§43 v2.88.0): - #1442: FALSIFY-CPU-GPU-005 part b implementation ~70 LOC inline at try_apr_wgpu_inference (gguf_gpu_generate.rs ~441-510). Probe-token CPU forward via OwnedQuantizedModel::forward_single_with_cache (tiny max_seq=2 cache) + wgpu single-step replay using the same fwd.forward_layer code path the autoregressive loop uses + cosine compare via cpu_vs_gpu_cosine_similarity (helper from #1440). < 0.99 → emit WGPU_FALLBACK_LOG_PREFIX + return None. Probe error paths fail-closed. Symmetric to §41 CUDA parity_gate. Contract apr-cpu-vs-gpu-output-parity-v1 v1.2.0 → v1.3.0 ACTIVE. - #1443: distill-train 9/9 falsifier sweep close TRAIN-007 PARTIAL via pv validate (live: 0 errors / 0 warnings). TRAIN-008 PARTIAL via cargo test cli_commands registered_commands (live: 1 pass; test_no_unregistered_commands enforces the 3-surface invariant per feedback_cli_subcommand_three_surface_drift). TRAIN-009 BLOCKER_FIXTURE_ABSENT pending §35 real-training impl (no val_loss to compare without gradient descent). All 9 TRAIN-* falsifiers now have explicit algorithm_evidence blocks (8× PARTIAL_ALGORITHM_LEVEL + 1× BLOCKER_FIXTURE_ABSENT) — the distill contract has reached terminal-binding state. §44 documents: what landed (table), coverage flips (FALSIFY-CPU-GPU-005 PARTIAL→PARTIAL deeper, TRAIN-007/008 unbound→PARTIAL, TRAIN-009 unbound→BLOCKER), why for MODEL-1+MODEL-2 (jidoka armor complete + distill contract terminal-bound), Five Whys, ship % effects (MODEL-1 88→89, MODEL-2 56→57), and next-session pickup options (live FALSIFY-CPU-GPU-005 discharge OR MODEL-2 §35 real-training OR MODEL-1 SHIP-007 GPU kernel root-cause fix). Coverage tally: 15+35 → 15+37 (+2 PARTIAL closed; TRAIN-009 blocked). Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Summary
Canonical record of today's continuation cycle. Spec v2.88.0 → v2.89.0 documents the 2-PR chain (#1442 + #1443) that closes both §43.6 next-session pickup items in one amendment.
What §44 records
Coverage flips
Net effects
Five Whys (in §44.4)
Next-session pickup (§44.6)
(a) FALSIFY-CPU-GPU-005 live discharge — operator smoke on canonical 7B teacher (~10min rebuild + 30s + 5s)
(b) MODEL-2 §35 real-training implementation — multi-PR; would simultaneously discharge TRAIN-001/002/009
(c) MODEL-1 SHIP-007 GPU kernel root-cause fix — multi-PR research track
Test plan
🤖 Generated with Claude Code