Skip to content

evidence(ship-007): live apr trace --save-tensor smoke on canonical 7B teacher#1419

Merged
noahgift merged 1 commit into
mainfrom
evidence/ship-007-pr-c-real-2026-05-03
May 3, 2026
Merged

evidence(ship-007): live apr trace --save-tensor smoke on canonical 7B teacher#1419
noahgift merged 1 commit into
mainfrom
evidence/ship-007-pr-c-real-2026-05-03

Conversation

@noahgift

@noahgift noahgift commented May 3, 2026

Copy link
Copy Markdown
Contributor

Summary

End-to-end live smoke of apr trace --save-tensor on the canonical Qwen2.5-Coder-7B-Instruct-Q4K teacher (RTX 4090, lambda-labs). APRT byte format produced verbatim for both embedding and lm_head stages.

Result

[PMAT-171] Loaded embedded BPE tokenizer: 152064 vocab, 151387 merges, 25 special tokens
Test prompt:  "What is 2+2?"
Token ids:    [3838, 374, 220, 17, 10, 17, 30] (7 tokens)

Wrote 2 stage tensor file(s):
  /tmp/save-tensor-smoke/layer-0/embedding.bin (100364 bytes)
  /tmp/save-tensor-smoke/lm_head.bin (608268 bytes)

Forward pass succeeded — 28 layer activations, 152064 logits

xxd confirms APRT magic + correct layer/dim header fields:

  • embedding.bin: layer=0, dim=25088 (=7×3584 hidden_dim) ✓
  • lm_head.bin: layer=0xFFFFFFFF (WHOLE_MODEL_LAYER), dim=152064 ✓

Why

Empirically discharges three FALSIFY-APR-TRACE-SAVE-* gates from PARTIAL_ALGORITHM_LEVEL → FUNCTIONAL_DISCHARGED:

  • FALSIFY-APR-TRACE-SAVE-009 (apr_diff_values_compat — APRT format)
  • FALSIFY-APR-TRACE-SAVE-010 (LmHead step-2 capture)
  • FALSIFY-APR-TRACE-SAVE-011 (CLI dispatch wire-up)

Contract v1.4.0 bump promoting these gates follows once #1418 v1.3.0 paperwork lands. This evidence file is the source of truth for that bump.

Test plan

  • Live smoke executed on RTX 4090
  • APRT byte format verified via xxd
  • Forward pass produces 28 layer activations + 152,064 logits
  • CI green (markdown-only change)
  • Auto-merge on green

🤖 Generated with Claude Code

… 7B teacher

End-to-end live smoke of `apr trace --save-tensor` against the canonical
Qwen2.5-Coder-7B-Instruct-Q4K teacher on RTX 4090 (lambda-labs).

Outcome: APRT byte format produced verbatim for `embedding,lm_head` stages.
- `layer-0/embedding.bin`: 100,364 bytes = 12-byte header + (7×3584)×4
- `lm_head.bin`: 608,268 bytes = 12-byte header + 152,064×4

xxd-confirmed APRT magic + correct layer/dim header fields. Forward pass
succeeded — 28 layer activations + 152,064 logits — same as `apr run`.

Discharges three FALSIFY-APR-TRACE-SAVE-* gates from PARTIAL_ALGORITHM_LEVEL
to FUNCTIONAL_DISCHARGED level (009/010/011). Contract v1.4.0 bump follows
once #1418 v1.3.0 paperwork lands.

🤖 Generated with [Claude Code](https://claude.com/claude-code)

Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>
@noahgift noahgift enabled auto-merge (squash) May 3, 2026 10:06
@noahgift noahgift merged commit 32cbe7e into main May 3, 2026
11 checks passed
@noahgift noahgift deleted the evidence/ship-007-pr-c-real-2026-05-03 branch May 3, 2026 10:30
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant