spec(ship-two-models): v2.90.0 — §45 apr-cpu-vs-gpu-output-parity-v1 5/5 LIVE DISCHARGE milestone#1447
Merged
Merged
Conversation
…5/5 LIVE DISCHARGE milestone Canonical record of today's terminal-discharge milestone. The apr-cpu-vs-gpu-output-parity-v1 contract reaches its complete state: all 5/5 falsifiers DISCHARGED with live empirical evidence on the canonical Qwen2.5-Coder-7B teacher. Milestone significance: - First contract in the SHIP-TWO program to reach 5/5 DISCHARGED (complete-evidence terminal state). - Largest single-cycle coverage flip of the SHIP-TWO program: +5 falsifiers DISCHARGED in one 2-PR cycle (#1445 + #1446). - The §41 → §43 → §44 → §45 jidoka chain is contract-complete: silent-GPU-gibberish on canonical broken-GPU is no longer possible. Both impl closure AND end-to-end live verification delivered. Coverage tally: 15+37 → 20+32. Contract: v1.3.0 → v1.5.0 ACTIVE. MODEL-1 ship %: 89% → 91%. §45 documents: - 45.1 What landed (PR table) - 45.2 The complete observed jidoka chain (verbatim from smoke log) - 45.3 Coverage flip (5/5 status table) - 45.4 Why this milestone matters (audit + MODEL-1 + cadence) - 45.5 Five Whys - 45.6 Net effects - 45.7 Next-session pickup (SHIP-007 GPU kernel fix, MODEL-2 §35 real-training, cross-contract sweep — all multi-PR research tracks) Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Summary
Spec v2.89.0 → v2.90.0 records today's terminal-discharge milestone: `apr-cpu-vs-gpu-output-parity-v1` reaches 5/5 DISCHARGED — first contract in the SHIP-TWO program to achieve complete-evidence terminal state.
Milestone significance
What §45 records
Net effects
Test plan
🤖 Generated with Claude Code