Skip to content

contracts(llama-370m-sovereign): GATE-ARCH-370M-002 PARTIAL_ALGORITHM_LEVEL (AC-SHIP2-005 uplift)#1682

Merged
noahgift merged 2 commits into
mainfrom
feat/model-2-ac-ship2-005-uplift
May 15, 2026
Merged

contracts(llama-370m-sovereign): GATE-ARCH-370M-002 PARTIAL_ALGORITHM_LEVEL (AC-SHIP2-005 uplift)#1682
noahgift merged 2 commits into
mainfrom
feat/model-2-ac-ship2-005-uplift

Conversation

@noahgift

Copy link
Copy Markdown
Contributor

Summary

Uplifts GATE-ARCH-370M-002 (binds AC-SHIP2-005) from an undated `verdict: pass` to PARTIAL_ALGORITHM_LEVEL with live evidence from the first real MODEL-2 pretraining checkpoint. Closes the §22 "STRUCTURALLY DISCHARGED" note that has lacked a contract-level evidence binding since 2026-04-26.

Live evidence (re-verified 2026-05-15)

`apr 0.33.0` (crates.io install) → `apr inspect --json` on the canonical artifact:

Field Value
file `/mnt/nvme-raid0/runs/model-2-from-scratch-006-50k-tuned/ckpt/epoch-002.apr`
size 1.39 GiB (1,494,053,060 bytes)
apr inspect exit 0
valid `true`
format `"APR v2"`
tensor_count 219
architecture `LlamaForCausalLM`
checksum_valid `true`

The .apr-format invariants (magic, header, tensor manifest, checksum) are all satisfied on a real on-disk artifact produced by the actual training loop — not a synthetic fixture, not the stub.

What this discharges and what it doesn't

Discharges (algorithm-level):

  • AC-SHIP2-005: checkpoint weights saved as .apr (native format, no PyTorch)
  • GATE-ARCH-370M-002: every INV-ARCH-370M-* invariant verified against real on-disk artifact

Does NOT discharge (still PARTIAL or BLOCKER on capacity):

  • AC-SHIP2-003: val_loss=9.78 vs target 2.2 (§34 capacity ceiling; needs §49 pivot)
  • AC-SHIP2-007/008: model output quality (gibberish at val_loss=9.78)
  • Full discharge of GATE-ARCH-370M-002 itself: requires `apr qa --arch-contract ` subcommand binding (~50 LOC follow-up; data fixture is in place)

Contract diff

`llama-370m-sovereign-v1.yaml` v1.10.0 → v1.11.0:

  • GATE-ARCH-370M-002 gains `evidence_discharged_by`, `discharge_status: PARTIAL_ALGORITHM_LEVEL`, `partial_discharge_note`, `full_discharge_blocks_on`
  • New changelog entry 1.11.0
  • 2 new evidence files under `evidence/section-22-ac-ship2-005-uplift-2026-05-15/`

MODEL-2 ship % movement

57% → 60% (1 of 10 PARTIAL_ALGORITHM_LEVEL ACs gains a structured contract-level evidence binding; the others share this fixture or remain blocked on AC-SHIP2-003's val_loss capacity ceiling per §34).

🤖 Generated with Claude Code

…_LEVEL

Uplift GATE-ARCH-370M-002 (binds AC-SHIP2-005) from undated `verdict: pass`
to PARTIAL_ALGORITHM_LEVEL with live evidence from the first real MODEL-2
pretraining checkpoint, closing the §22 "STRUCTURALLY DISCHARGED" note
that lacked a contract-level evidence binding until now.

Evidence on canonical artifact `epoch-002.apr` (§22 / PR #1073, 2026-04-26):
  - file:           /mnt/nvme-raid0/runs/model-2-from-scratch-006-50k-tuned/ckpt/epoch-002.apr
  - size:           1.39 GiB (1494053060 bytes)
  - apr inspect:    exit 0
  - valid:          true
  - format:         "APR v2"
  - tensor_count:   219
  - architecture:   LlamaForCausalLM
  - checksum_valid: true

Re-verified live 2026-05-15 against `apr 0.33.0` (crates.io install).

The .apr-format invariants (magic, header, tensor manifest, checksum)
are all satisfied on a real on-disk artifact produced by the actual
training loop — not a synthetic fixture, not the stub. Full discharge
PENDING on `apr qa --arch-contract <name>` subcommand binding
(~50 LOC + 1 test follow-up; the underlying data fixture is already
in place per this evidence dir).

Contract llama-370m-sovereign v1.10.0 → v1.11.0:
  - GATE-ARCH-370M-002 gains:
      evidence_discharged_by, discharge_status: PARTIAL_ALGORITHM_LEVEL,
      partial_discharge_note, full_discharge_blocks_on
  - New changelog entry 1.11.0 documenting the uplift

MODEL-2 ship %: 57% → 60% (1 of 10 PARTIALs gains a structured
contract-level evidence binding; the 9 others either share this fixture
or are blocked on AC-SHIP2-003's val_loss capacity ceiling per §34).

Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>
@noahgift noahgift enabled auto-merge (squash) May 15, 2026 05:58
@noahgift noahgift merged commit 5654acf into main May 15, 2026
10 checks passed
@noahgift noahgift deleted the feat/model-2-ac-ship2-005-uplift branch May 15, 2026 07:33
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant