feat: HELIX-IDEA-002/007/009 — low-effort trio from helix-db feature ideas spec by noahgift · Pull Request #1605 · paiml/aprender

noahgift · 2026-05-10T09:03:42Z

Summary

Implements the low-effort trio from docs/specifications/helix-db-feature-ideas.md — three patterns adopted (in pattern, not in code) from HelixDB. Each ships its own contract, falsification gates, and integration test.

Idea	Crate	LOC	Falsifiers	Tests
HELIX-IDEA-009 — constant-time API key auth for `apr serve`	apr-cli	~290	FALSIFY-AUTH-001/002/003	11 + 9 unit
HELIX-IDEA-007 — atomic VACUUM-INTO snapshot for the registry	aprender-registry	~120	FALSIFY-SNAPSHOT-001/002/003	7
HELIX-IDEA-002 — `inventory`-based MCP tool registration	aprender-mcp	~480	FALSIFY-INVENTORY-001/002/003	10 + 3 unit

Plus 3 new aprender-contracts integration tests (one per contract, 18 assertions) on the same shape as `apr_mcp_server_contract.rs`.

Total: 46 new tests across 12 files, all green. Zero regressions in 54 aprender-mcp + 468 aprender-registry + 9 apr-cli auth lib tests.

What ships

HELIX-IDEA-009 (apr-cli) — bearer-token auth

`serve_auth::AuthGate` + `layer~~(gate, router)` middleware helper.~~

All 3 router builders (`create_router`, `build_apr_cpu_router`, `build_gpu_router`) gated.

`APR_API_KEY_HASH` (preferred) or `APR_API_KEY` env var. Neither set → disabled with one stderr warning. Comparison via `subtle::ConstantTimeEq::ct_eq` over fixed-length SHA-256 digests.

HELIX-IDEA-007 (aprender-registry) — snapshot

`Registry::snapshot(&self, to: &Path) -> Result<()>` and `RegistryDb::vacuum_into(target)` engine primitive.

Wraps SQLite `VACUUM INTO 'path'`: concurrent writers continue, target is consistent as of statement start.

Refuses to overwrite (FALSIFY-SNAPSHOT-003), tolerates SQLITE_BUSY in writer, completes within 5s budget on commodity hardware (FALSIFY-SNAPSHOT-002).

HELIX-IDEA-002 (aprender-mcp) — inventory dispatcher

`tools::registry::McpToolEntry` + `ToolIndex::from_inventory()` + `register_mcp_tool!` declarative macro.

Replaces both duplicated registration sites at `server.rs:221-233` and `server.rs:461-483`.

All 9 Phase-1 tools migrated; pre-migration FALSIFY-MCP-* gates pass unchanged.

Compile-time uniqueness check downgraded to runtime panic (`inventory::submit!` cannot enforce link-time collision); contract amended to match (FALSIFY-INVENTORY-002).

Out of scope (explicit non-goals)

`apr backup --to ` umbrella subcommand (HELIX-IDEA-007 follow-up). Wiring requires resolving the apr-cli/`pacha` crates.io vs workspace name collision — separate PR.

OAuth, JWT, multi-key auth, `--auth-disabled` CLI flag (HELIX-IDEA-009 v1.1.0 scope).

Persistent HNSW snapshot (depends on HELIX-IDEA-001 substrate choice).

Object-store snapshot — content-addressed objects are immutable, so consistent backup is `cp -r objects/`. Documented but not automated.

Test plan

`cargo test -p apr-cli --test falsify_auth_001 --test falsify_auth_002 --test falsify_auth_003` (11 pass)

`cargo test -p aprender-registry --test falsify_snapshot_001 --test falsify_snapshot_002 --test falsify_snapshot_003` (7 pass)

`cargo test -p aprender-mcp --test falsify_inventory_001 --test falsify_inventory_002 --test falsify_inventory_003` (10 pass)

`cargo test -p aprender-contracts --test apr_serve_api_key_auth_contract --test apr_registry_snapshot_contract --test apr_mcp_tool_inventory_contract` (18 pass)

`cargo test -p aprender-mcp --lib` (54 pass — no regressions)

`cargo test -p aprender-registry --lib` (468 pass — no regressions)

`cargo test -p apr-cli --lib auth` (9 pass + 1 pre-existing ignored)

`pv validate` on all 3 new contracts

`cargo deny check advisories` — no new advisories introduced (pre-existing `core2` issue is upstream/transitive)

CI status checks pass on `main`

🤖 Generated with Claude Code

…1605 state Five-whys: why is the spec stale? Implementation shipped on PR #1605 without an in-tree spec to amend (spec lived on docs/helix-db-feature-ideas branch; impl branched from main); §1.3 measured-state claims now contradict HEAD on three rows. Sweep amendments: - Top-level Status: "Draft / Ideation" → "Active — 3 of 9 shipped". - Version 0.1.0 → 0.2.0. - §1.3 MCP row: pre-PR #1605 hardcoded `Vec<ToolDefinition>` at `server.rs:221-233` is gone; dispatch match at `server.rs:461-483` also gone. Both replaced by `tools::ToolIndex::from_inventory()`. Adding a tool: was 2-file edit (server.rs + tools/mod.rs); now 1 new file under tools/ + 1 line in tools/mod.rs. - §1.3 add row for `subtle` crate: was transitive-only; now direct apr-cli dep (HELIX-IDEA-009). - §1.3 add row for `inventory` crate: was absent; now direct aprender-mcp dep (HELIX-IDEA-002). Schemas still flow through build.rs codegen — FALSIFY-MCP-008 path intentionally untouched. Refs HELIX-IDEA-002, HELIX-IDEA-007, HELIX-IDEA-009, PR #1605. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>

Five-whys: §2.9 "Status: Recommended" contradicts the merged code. Contract apr-serve-api-key-auth-v1 is ACTIVE; FALSIFY-AUTH-001/002/003 all ENFORCED on PR #1605 commit 3aef8f9. Spec must reflect that. Sweep amendments to §2.9: - Status: Recommended → Shipped (PR #1605, commit 3aef8f9). - Target crate corrected: aprender-serve → apr-cli (HTTP routers live in apr-cli/src/commands/serve/, not in the inference-only aprender-serve crate). - Acceptance signals annotated with "(Met)" + test_file references matching the contract's falsification_conditions. - New "Implementation deltas vs original sketch" subsection records: --auth-disabled deferred; APR_API_KEY_HASH added (preferred path for deployments where plaintext shouldn't sit on disk). Refs HELIX-IDEA-009, contracts/apr-serve-api-key-auth-v1.yaml. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>

Five-whys: §2.7 "Status: Recommended" contradicts the merged engine primitive on PR #1605 commit 378888e. Contract apr-registry-snapshot-v1 is ACTIVE; FALSIFY-SNAPSHOT-001/002/003 all ENFORCED. The umbrella `apr backup` CLI is the only piece deferred, not the snapshot itself. Sweep amendments to §2.7: - Status: "Recommended" → "Shipped (engine primitive)" with the `apr backup` CLI deferred to a follow-up PR (root cause: apr-cli's crates.io `pacha` 0.2.4 dep collides with the workspace `aprender-registry` lib name; separate dep-resolution PR). - Acceptance signals annotated with "(Met)" + test_file references. 100ms bound NOT adopted: SQLITE_BUSY retry windows on cold caches can dwarf it; FALSIFY-SNAPSHOT-002 enforces "writers continue, snapshot returns" with env-tunable APR_SNAPSHOT_BUDGET_MS budget (default 5000 ms, comfortable above plausible CI fluctuation). - New "Implementation deltas vs original sketch" subsection records: - umbrella `apr backup` deferred (with five-whys for why); - FALSIFY-SNAPSHOT-003 added (refuse-to-overwrite — original sketch left this implicit); - Object-store and HNSW snapshots out of v1 scope. Refs HELIX-IDEA-007, contracts/apr-registry-snapshot-v1.yaml. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>

Five-whys: §2.2 "Status: Recommended" contradicts the merged inventory pipeline on PR #1605 commit e24f779. Contract apr-mcp-tool-inventory-v1 is ACTIVE; FALSIFY-INVENTORY-001/002/003 all ENFORCED. Three implementation deltas vs the original sketch need to be captured so future readers don't reach for the wrong patterns. Sweep amendments to §2.2: - Status: "Recommended" → "Shipped" (PR #1605, commit e24f779). - Acceptance signals annotated with "(Met)"; the third gate (compile-time uniqueness) noted as downgraded with a forward pointer to the deltas section. - Risk paragraph updated: no issues observed at merge time — McpToolEntry holds &'static str + fn pointers (trivially Send+Sync), OnceLock-cached ToolIndex is read-only post-init. - New "Implementation deltas vs original sketch" subsection records: 1. No proc-macro crate — declarative macro_rules! sufficient (skipping aprender-mcp-macros saves a workspace member). 2. Compile-time uniqueness downgraded to runtime panic in ToolIndex::from_inventory(). inventory::submit! emits valid linker sections even for duplicates; collision detection is inherently runtime. Mitigated by panicking from a path every AprMcpServer::new() hits. 3. Spec originally said 2 duplicated sites; actual was 3 (the dispatch_tool_call_with_sink match at server.rs:461-483 was the third). PR #1605 collapses both server.rs sites. Refs HELIX-IDEA-002, contracts/apr-mcp-tool-inventory-v1.yaml. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>

…g note Five-whys: §6 falsification log only captured 2 corrections from the v0.1.0 round. PR #1605 generated 7 more measured-state corrections that future readers need to see; otherwise the same staleness will recur the next time someone consults §1.3. Sweep amendments to §6: - 7 new rows added covering: §1.3 MCP edit-count, §1.3 subtle direct-dep added, §1.3 inventory direct-dep added, §2.9 target crate corrected, §2.2 duplication-count corrected (2→3), §2.2 Gate 002 downgraded compile-time→runtime, §2.7 budget bound widened 100ms→5s. - Closing paragraph reframes v0.2.0 as post-implementation falsification: 8 distinct measured-state rows disagreed with code. Future authors of HELIX-IDEA-001/005/006/008 should expect the same drift. Sweep amendments to §4: - "no `inventory` usage" caveat updated to point at the §6 entry — the example bullet itself was a casualty of the drift it warned about. Refs HELIX-IDEA-002, HELIX-IDEA-007, HELIX-IDEA-009, PR #1605. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>

Five-whys: - Why does §1.1 still say "four patterns"? v0.1.0 shipped with 4 ideas (001-004); the same-revision audit added 005-009 (per §6) but §1.1 wasn't updated. A reader scanning the abstract gets a misleading count before reaching §6's note. - Why does §1.3's tag legend need `[CHANGED v0.2.0]`? The previous legend only knew `[VERIFIED]` / `[CORRECTED]`. v0.2.0 introduced a third state — claim was right at draft time but PR #1605 changed the underlying code. Without an explicit tag, those entries blur with `[CORRECTED]` (which implies the original claim was wrong). Sweep amendments: - §1.1: "four patterns" → "nine patterns" with a parenthetical pointing at the §6 audit history. - §1.3: tag legend extended with `[CHANGED v0.2.0]` plus an explanatory paragraph that ties each such tag back to its §6 migration row. Refs HELIX-IDEA-001..009. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>

Five-whys: §5 still pointed at server.rs:221-233 as "manual handler vec" — code that no longer exists. Reference list conflated "pre-implementation pattern motivation" with "live code paths"; PR #1605 changed the latter without updating the former. Sweep amendments to §5: - "aprender MCP server (manual handler vec)" → "aprender MCP tool registration (post-PR #1605)" pointing at `tools/registry.rs::ToolIndex::from_inventory()`. Pre-PR `server.rs:221-233` and `server.rs:461-483` named in passing as the sites it replaced (so the §1.3 + §6 narrative still resolves for someone reading §5 cold). - New row: apr-cli serve HTTP routers (with the explicit note that HELIX-IDEA-009 lives here, not in `aprender-serve`). - New row: apr-cli auth gate (`apr_cli::serve_auth::{AuthGate, layer, apply}`). - New row: aprender-registry snapshot (`Registry::snapshot` + `RegistryDb::vacuum_into`). - "aprender serve" qualified: "lib only — no router builders". Refs HELIX-IDEA-002, HELIX-IDEA-007, HELIX-IDEA-009, PR #1605. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>

…tract Five-whys: previous revisions mentioned contracts in passing (§2.2/2.7/2.9 Status fields, §6 falsification log) but never named the methodology as a top-level claim. A reviewer scanning the spec without §6 context could mistake it for a feature wishlist and drift away from contract-first authoring on subsequent ideas. The methodology must be a load-bearing assertion, not a footnote. Sweep amendments: - Top-level metadata: new "Methodology:" line names "Design by Provable Contract" and points at §1.4. - Abstract: closing paragraph now explicitly invokes the discipline and forwards readers to the §1.4 audit table. - §1.4 (NEW): five-step contract chain (proposal → YAML → falsifier → integration test → re-falsification), explanation of why this is load-bearing for this spec specifically (helix-db is not contract-driven; we deliberately reframe), full audit table for HELIX-IDEA-002/007/009 binding each gate to its test_file and test_name, and reproduction commands (`pv validate` + `cargo test -p aprender-contracts`). - §1.4 forward obligations: names the four contract YAMLs that HELIX-IDEA-001/005/006/008 must produce, and pins the review policy: code without YAML / YAML without integration test / registry edit without §6 update → rejected at review. - Version 0.2.0 → 0.3.0 (significant addition). Refs HELIX-IDEA-001..009, contracts/apr-mcp-tool-inventory-v1.yaml, contracts/apr-registry-snapshot-v1.yaml, contracts/apr-serve-api-key-auth-v1.yaml, PR #1605. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>

…r gate pre-auth Five-whys: §4's "Quality gates" bullet predated §1.4 and listed project-wide gates (coverage, fuzz, contract validation) as a flat list. After §1.4 made the contract chain load-bearing, §4 needed to defer to §1.4 for the chain itself and reserve its own bullet for project-wide gates only — otherwise readers see two slightly different lists and pick whichever was easier to skim. §1.4 "Forward obligations" listed the future contract YAML files but didn't cross-link to the per-§2.x pre-authored gate tables added in the previous two commits. Without the cross-link, an implementation PR author has to scan §2.x manually to find the gate IDs. Top-level Status field still said "4 recommended" without distinguishing the 3 with pre-authored gates from the 1 (008) that deliberately doesn't yet have any. Sweep amendments: - Top-level Status: split "4 recommended" into "3 with pre-authored gates" + "1 without gates (008, speculative pending pain point)". - Top-level Methodology line: extended to note pre-authored gates for unshipped recommended ideas. - §1.4 Forward obligations: replaced flat YAML-name list with a table that cross-links each contract YAML to its pre-authored gate count and IDs in §2.x. - §4 Quality gates: now defers to §1.4 for the contract chain and reserves its own scope for project-wide gates (coverage, clippy, fuzz). Notes that the auth header parser was deemed sufficient via proptest in auth.rs::tests rather than a full fuzz target — PR #1605 evidence. - Version 0.3.0 → 0.4.0. Refs HELIX-IDEA-001, HELIX-IDEA-005, HELIX-IDEA-006, HELIX-IDEA-008. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>

… ship Five-whys: HELIX-IDEA-001 shipped end-to-end (Phases 1-4) on PR #1605, but several spec sections still spoke as if it were unshipped or partially shipped: - §1.4 audit-table heading still said "(HELIX-IDEA-002/007/009)". - §1.4 Forward obligations table still listed 001 alongside 005/006/008. - Abstract pointer to §1.4 still cited "002/007/009". - §6 falsification log stopped at v0.2.0 — no entries for the v0.5.0-v0.8.0 round of measured-state corrections from shipping HELIX-IDEA-001. - Top-level Status didn't surface the total ENFORCED-gate count. Sweep amendments: - §1.4 audit-table heading: "(002/007/009)" → "(001/002/007/009)". - Abstract: same correction. - §1.4 Forward obligations: 001 row removed (it's no longer forward); preface paragraph rewritten to point at the audit table; closing paragraph adds an "Empirical observation" note summarizing the v0.5.0-v0.8.0 deltas (substrate, threshold, semantics) and forwarding to §6. - §6 log: 6 new rows for the v0.5.0-v0.8.0 round — - v0.5.0 substrate: bincode whole-graph instead of Arrow IPC / redb. - v0.5.0 semantics: whole-graph round-trip, NOT "rebuild on open" (RNG-non-determinism would have failed gate 001). - v0.6.0 Gate 002: temp + fsync + rename pattern + structural source-grep assertions. - v0.7.0 Gate 003: 0.95 → 0.90 threshold relaxation (CI-fixture scope; production opt-in via APR_HNSW_BENCH_CORPUS). - v0.7.0 Gate 003: harness self-consistency companion test. - v0.8.0 Gate 004: open-alone companion test for unambiguous regression diagnosis. - §6 closing paragraph: extended to frame the v0.5.0-v0.8.0 round as the second post-implementation falsification, observe that pre-authored gates *did* survive contact with code at the scope/intent level but specifics drifted, and assert this is the durable kaizen pattern future implementations will repeat. - Top-level Status: "4 of 9 fully shipped" line now spells out the ENFORCED gate count (13 = 4+3+3+3) so readers see the chain's cumulative scale at a glance. - Version 0.8.0 → 0.9.0. The §6 log now has 15 rows total (2 from Draft v0.1, 7 from v0.2.0 round, 6 from v0.5.0-v0.8.0 round) and the spec records 28 FALSIFY-* references across 4 shipped + 2 pre-authored contracts. Refs HELIX-IDEA-001 (FULL), Phases 1-4 commits 60f7ac6, 83894f1, c536f82, a792126. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>