Skip to content

docs(readme): SHIP-007 GPU known-issue warning per FALSIFY-MODEL-1-SHIP-CPU-004#1114

Merged
noahgift merged 22 commits into
mainfrom
docs/readme-ship-007-gpu-known-issue
May 13, 2026
Merged

docs(readme): SHIP-007 GPU known-issue warning per FALSIFY-MODEL-1-SHIP-CPU-004#1114
noahgift merged 22 commits into
mainfrom
docs/readme-ship-007-gpu-known-issue

Conversation

@noahgift

Copy link
Copy Markdown
Contributor

Summary

Adds a "Known Issue (SHIP-007)" callout to the README Quick Start that recommends `--no-gpu` for the canonical 7B Q4K teacher.

Why this matters

Per FALSIFY-MODEL-1-SHIP-CPU-004 (apr-cli-model-1-ship-via-cpu-v1.yaml v1.0.0, PR #1113), the SHIP gate REQUIRES user-facing docs to warn about the GPU known-issue. This PR satisfies that gate.

Without the warning, users running:
```bash
apr run paiml/qwen2.5-coder-7b-apache-q4k-v1 "What is 2+2?"
```
get gibberish ("ampiezza = 1") because the default GPU path triggers SHIP-007.

With the warning, users follow the recommended:
```bash
apr run paiml/qwen2.5-coder-7b-apache-q4k-v1 "What is 2+2?" --no-gpu
```
and get correct output ("2 + 2 equals").

Verification

```
$ grep -rE -- '(--no-gpu|SHIP-007|GPU known issue|use --no-gpu)' README.md docs/
README.md:> Known Issue (SHIP-007): For the canonical 7B Q4K teacher
README.md:> (paiml/qwen2.5-coder-7b-apache-q4k-v1), use `--no-gpu` until the
README.md:> apr run paiml/qwen2.5-coder-7b-apache-q4k-v1 "What is 2+2?" --no-gpu
```

✓ FALSIFY-MODEL-1-SHIP-CPU-004 PASSES.

Plain progress on shipping models

This is the LAST gate-completing piece for the MODEL-1 SHIP-via-CPU chain. With this + #1113 + #1111 (§40 spec) all merged:

  • FALSIFY-MODEL-1-SHIP-CPU-001 (CPU correctness) ✓
  • FALSIFY-MODEL-1-SHIP-CPU-002 (§40 in spec) ✓
  • FALSIFY-MODEL-1-SHIP-CPU-003 (pv validate) ✓
  • FALSIFY-MODEL-1-SHIP-CPU-004 (docs warn) ✓ ← THIS PR
  • FALSIFY-MODEL-1-SHIP-CPU-005 (semver) ✓
  • FALSIFY-MODEL-1-SHIP-CPU-006 (spec back-ref) → needs §40.6 update mentioning the contract; small follow-up

MODEL-1 ships TODAY via CPU once these 4 PRs merge to main.

Methodology adherence

  • Five-whys codified in commit body ✓
  • Per `feedback_fix_root_cause_never_route_around.md`: this is documentation, NOT a workaround. Bug remains tracked in §40 ✓
  • Removes itself when GPU fix lands (commit body §5) ✓

Test plan

  • PMAT pre-commit gates pass
  • FALSIFY-MODEL-1-SHIP-CPU-004 grep verifies live
  • Authored in worktree (no git racing)
  • Independent of other in-flight PRs (clean against main)

🤖 Generated with Claude Code

noahgift and others added 9 commits April 28, 2026 15:23
…-SHIP-CPU-004

Adds a "Known Issue (SHIP-007)" callout to README Quick Start that
recommends `--no-gpu` for the canonical 7B Q4K teacher
(paiml/qwen2.5-coder-7b-apache-q4k-v1).

Per SPEC-SHIP-TWO-001 §40 (PR #1111) + apr-cli-model-1-ship-via-cpu-v1.yaml
(PR #1113), the GPU dispatch path on this specific model currently
produces gibberish output ("ampiezza = 1") while the CPU path produces
correct mathematical reasoning ("2 + 2 equals").

This satisfies FALSIFY-MODEL-1-SHIP-CPU-004 (user-facing docs warn
about GPU known-issue):

  $ grep -rE -- '(--no-gpu|SHIP-007|GPU known issue|use --no-gpu)' \
      README.md docs/ apr-cookbook/
  README.md:> **Known Issue (SHIP-007)**: For the canonical 7B Q4K teacher
  README.md:> (paiml/qwen2.5-coder-7b-apache-q4k-v1), use `--no-gpu` until the
  README.md:> apr run paiml/qwen2.5-coder-7b-apache-q4k-v1 "What is 2+2?" --no-gpu

Five-whys (consistent with §40):
1. Why does the README need this warning? Users running `apr run` on the
   canonical 7B Q4K teacher get gibberish without `--no-gpu`.
2. Why? The default GPU dispatch path has SHIP-007 (GPU FP8/dequant defect).
3. Why now? PR #1113 contract requires user-facing docs warning per
   FALSIFY-MODEL-1-SHIP-CPU-004 — this PR satisfies that gate.
4. Why this README placement? Quick Start is the highest-traffic section;
   users see it first.
5. What removes the warning? GPU fix lands → contract bumps to v2.0.0
   → README warning becomes obsolete and gets removed in same PR.

Spec ref: §40.6 Option A (PR #1111).
Contract ref: apr-cli-model-1-ship-via-cpu-v1 (PR #1113).
Coverage: contributes to MODEL-1 SHIP gate completeness.

🤖 Generated with [Claude Code](https://claude.com/claude-code)

Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>
@noahgift noahgift enabled auto-merge (squash) May 11, 2026 18:48
@noahgift noahgift merged commit 38a756e into main May 13, 2026
10 checks passed
@noahgift noahgift deleted the docs/readme-ship-007-gpu-known-issue branch May 13, 2026 06:07
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant