Skip to content

feat(ci): opt in to cargo nextest run (Phase 2 pilot)#897

Merged
noahgift merged 1 commit into
mainfrom
feat/use-nextest-pilot
Apr 18, 2026
Merged

feat(ci): opt in to cargo nextest run (Phase 2 pilot)#897
noahgift merged 1 commit into
mainfrom
feat/use-nextest-pilot

Conversation

@noahgift

Copy link
Copy Markdown
Contributor

Summary

Opt aprender into use_nextest: true in the reusable sovereign-ci workflow (Refs PMAT-155).

Phase 2 pilot — use_nextest was just added as a workflow_call boolean input to paiml/.github/.github/workflows/sovereign-ci.yml (default false). We're piloting it on the same cohort as the sccache Phase 3 pilot (PMAT-151): copia, bashrs, aprender.

aprender is the heavy pilot — APR-MONO monorepo with 60+ workspace crates, the largest dep graph in the fleet and a driver of fleet p95. This is the most interesting repo for nextest: parallel test execution should win big here. Measured F11 baseline p95=449s for aprender.

After 7 days of measurement via F11 falsifier (cargo run --example falsify_f11_test_job_p95), if p95 ≤ 300s we flip the reusable workflow's default to true. This PR is just the pilot enrollment — see paiml/infra#52 for the falsifier.

Test plan

  • CI passes (ci / gate, workspace-test)
  • F11 falsifier records post-change p95 daily for 7 days
  • If workspace-test (40m timeout) holds on cold path, pilot is green

🤖 Generated with Claude Code

Phase 2 pilot — F11 baseline p95=449s for aprender (heaviest pilot, APR-MONO 60+ crates), see paiml/infra#52 for the falsifier.
@noahgift noahgift enabled auto-merge (squash) April 18, 2026 18:30
@noahgift noahgift merged commit 49c3284 into main Apr 18, 2026
11 checks passed
@noahgift noahgift deleted the feat/use-nextest-pilot branch April 18, 2026 18:44
noahgift added a commit that referenced this pull request Apr 19, 2026
…-TWO-001 teacher

Re-dated 2026-04-15 → 2026-04-19 since v0.31.0 tag was never cut; rolls
up everything that landed between PR #748 (initial 0.30→0.31 bump) and
PR #888 (MCP spec v1.2.0 + parity epic closure).

New in [0.31.0] - 2026-04-19 (beyond original 2026-04-15 rc1 content):

- MCP Server M1–M3: 9 apr tools over stdio JSON-RPC 2.0, YAML-codegen'd
  schemas (FALSIFY-MCP-008), notifications/cancelled + progress,
  JSON Schema Draft 7 meta-validation in CI.
- apr code — Claude Code parity epic CLOSED at v5.1 (14/3/4 over 21 rows;
  PMAT-CODE-PARITY-MATRIX-001 both closure conditions met). 10 tickets
  closed in one cycle (P0×4, P1×5, P2×2).
- Contracts harness: `pv check-parity` SEMANTIC gate + new
  apr-claude-proxy-v1.yaml DRAFT.
- SHIP-TWO-001 teacher shipped: paiml/qwen2.5-coder-7b-apache-q4k-v1
  (7.5 GB .apr, Apache-2.0) — first artifact to pass full apr publish
  contract. `apr validate-manifest` + `--live` + FALSIFY-PM-007
  safetensors dtype Poka-Yoke.
- Perf: decode hot-path hygiene HP-001/002/003 (184→382 tok/s on 1.5B
  Q4_K_M, 2.07×); 32-tok bench 442.8→479.9; FlashDecoding gated for
  small models.
- CI: sccache pilot (#894), nextest opt-in (#897).
- Flaky perf test fixes: tui_load, F-203, RP-002-prop, citl-neural.

Previous [0.31.0] - 2026-04-15 entry was an rc draft; merged into the
final 2026-04-19 cut to keep the release history single-authoritative.

Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>
noahgift added a commit that referenced this pull request Apr 19, 2026
…-TWO-001 teacher (#899)

Re-dated 2026-04-15 → 2026-04-19 since v0.31.0 tag was never cut; rolls
up everything that landed between PR #748 (initial 0.30→0.31 bump) and
PR #888 (MCP spec v1.2.0 + parity epic closure).

New in [0.31.0] - 2026-04-19 (beyond original 2026-04-15 rc1 content):

- MCP Server M1–M3: 9 apr tools over stdio JSON-RPC 2.0, YAML-codegen'd
  schemas (FALSIFY-MCP-008), notifications/cancelled + progress,
  JSON Schema Draft 7 meta-validation in CI.
- apr code — Claude Code parity epic CLOSED at v5.1 (14/3/4 over 21 rows;
  PMAT-CODE-PARITY-MATRIX-001 both closure conditions met). 10 tickets
  closed in one cycle (P0×4, P1×5, P2×2).
- Contracts harness: `pv check-parity` SEMANTIC gate + new
  apr-claude-proxy-v1.yaml DRAFT.
- SHIP-TWO-001 teacher shipped: paiml/qwen2.5-coder-7b-apache-q4k-v1
  (7.5 GB .apr, Apache-2.0) — first artifact to pass full apr publish
  contract. `apr validate-manifest` + `--live` + FALSIFY-PM-007
  safetensors dtype Poka-Yoke.
- Perf: decode hot-path hygiene HP-001/002/003 (184→382 tok/s on 1.5B
  Q4_K_M, 2.07×); 32-tok bench 442.8→479.9; FlashDecoding gated for
  small models.
- CI: sccache pilot (#894), nextest opt-in (#897).
- Flaky perf test fixes: tui_load, F-203, RP-002-prop, citl-neural.

Previous [0.31.0] - 2026-04-15 entry was an rc draft; merged into the
final 2026-04-19 cut to keep the release history single-authoritative.

Co-authored-by: Claude Opus 4.7 <noreply@anthropic.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant