docs: agent secret gateway — holistic architecture + adversarial tests by bglusman · Pull Request #16 · bglusman/calciforge

bglusman · 2026-04-24T04:07:43Z

Summary

Draft RFC pulling together scattered secret/gateway notes into one doc
designed to be falsified:

Threat model stated explicitly (§1)
Post-consolidation component inventory (§2) — one gateway, not two
Data flow for outbound substitution (§3)
Agent bootstrap via zeroclawed-MCP (discovery only, no `get_secret`) (§4)
User input via `!secure` chat commands (§5)
Installer audit table: which components wired today vs pending (§6)
T1–T10 adversarial test plan mapped to claims they try to break (§7)
Known unknowns (§8), sequencing (§9), explicit skepticism log (§10)

Each section ends with "what could go wrong" so the planned tests have
concrete targets. Explicitly in draft form — feedback on the attack
surfaces you think I'm missing is the most valuable thing to get now,
before code lands.

Test plan

Markdown renders OK
Human review of claims in §3 (substitution surfaces we cover/don't)
Human review of §10 skepticism items — are there others?

🤖 Generated with Claude Code

… test plan Consolidates scattered notes (security-gateway.md, vault-integration-plan.md, model-gateway-primitives.md) into one falsification-friendly document: - States the threat model and goal explicitly - Inventories post-consolidation components (one gateway, not two) - Walks the outbound substitution flow end-to-end - Describes agent bootstrap via zeroclawed-MCP (discovery only, no get_secret surface — by design) - Describes user-side input via !secure chat commands - Audits install.sh vs the full vision (table of gaps per component) - Sketches 10 adversarial tests (T1–T10) mapped to the claims they try to falsify - Ends with an explicit skepticism log of claims the author is not yet sure about Written to be broken. Each §ends with "what could go wrong" so future tests have a concrete target. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

Copilot

Pull request overview

Adds a draft RFC consolidating the “agent secret gateway” architecture into a single, falsifiable design doc, including an explicit threat model and an adversarial test plan to validate key security claims.

Changes:

Introduces a holistic component inventory (fnox, security-proxy, clashd, zeroclawed, planned zeroclawed-MCP) and intended consolidation plan.
Documents the intended outbound substitution data flow and the !secure user-input flows.
Adds a T1–T10 adversarial test plan mapped back to specific claims in the document.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Adds §11–§15 to the draft RFC, broadening from code-correctness to user-story and indirect threat models per review: - §11 Indirect threat models (10 scenarios): substituted-value exfil by upstream, upstream logging, agent-to-agent exfil, pre-substitution artifacts, memory persistence, error-message side-channels, indirect disclosure bypass (chaos-paper #3), adversarial third-party messages, name-leakage as signal, and a mapping of all 8 chaos lessons to our secret-gateway risk surface. - §12 User story failures (10 scenarios): first-run UX, .env migration, key rotation, "I need the value" (HMAC/JWT signing), non-HTTP protocols, blocked legitimate requests, cross-machine secret sync, mobile-without-LAN `!secure request`, request preview/dry-run, and perceived complexity cost. - §13 Legitimate cases we struggle with: HMAC/JWT, binary bodies, streams, WebSocket sessions, OAuth device-flow, mTLS certs, per-user per-request secrets. - §14 Explicitly out of scope: host compromise, fnox root compromise, user-side misuse, compromised model weights, supply chain, timing. - §15 Research pointers: 1Password op:// refs, Doppler, AWS Secrets Manager per-secret destination binding, Vault response-wrapping, SPIFFE/SPIRE, chaos-engineering mindset. Key architectural implication from §11.1 + §11.8: substitution must be bound per-secret to per-destination, and eligibility to substitute must be tagged at the ref site in code the agent wrote, not blindly at every outbound request. Spike item before committing substantial code to §29 (substitution implementation). Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

Three polish fixes from the Copilot review: - §3 substitution scope — the original said "unsupported content-types pass through unchanged, log warning", which creates a §11.8-shaped bypass (agent claims multipart/form-data with `{{secret:` in the body). Rewrote the bullet: a cheap raw-bytes scan runs FIRST; if the bytes contain `{{secret:` we fail-closed. Only bodies with no ref-shaped content pass through. - §6 installer audit table — replaced "✅ (this PR)" with concrete branch/PR references ("feat/fnox-integration, PR #15") so the attribution stays correct when read outside this PR. - §§5/7 — path consistency on the `security-gateway.md` reference; use `docs/security-gateway.md` everywhere so the link is unambiguous. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

Copilot

Pull request overview

Copilot reviewed 1 out of 1 changed files in this pull request and generated 3 comments.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

bglusman · 2026-04-25T18:37:43Z

Subsumed by #44 (squashed to 9ed51fbc on main). All commits from this branch are present in the squash. Closing as redundant rather than merging again.

Copilot AI review requested due to automatic review settings April 24, 2026 04:07

Copilot AI reviewed Apr 24, 2026

View reviewed changes

Comment thread docs/rfcs/agent-secret-gateway.md Outdated

Comment thread docs/rfcs/agent-secret-gateway.md

Comment thread docs/rfcs/agent-secret-gateway.md Outdated

bglusman and others added 2 commits April 24, 2026 00:15

Copilot AI review requested due to automatic review settings April 24, 2026 14:36

Copilot started reviewing on behalf of bglusman April 24, 2026 14:36 View session

Copilot AI reviewed Apr 24, 2026

View reviewed changes

Comment thread docs/rfcs/agent-secret-gateway.md

Comment thread docs/rfcs/agent-secret-gateway.md

Comment thread docs/rfcs/agent-secret-gateway.md

bglusman marked this pull request as ready for review April 24, 2026 16:06

bglusman closed this Apr 25, 2026

bglusman deleted the docs/agent-secret-gateway branch May 1, 2026 17:21

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

docs: agent secret gateway — holistic architecture + adversarial tests#16

docs: agent secret gateway — holistic architecture + adversarial tests#16
bglusman wants to merge 3 commits intomainfrom
docs/agent-secret-gateway

bglusman commented Apr 24, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

bglusman commented Apr 25, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

bglusman commented Apr 24, 2026

Summary

Test plan

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Uh oh!

Uh oh!

Uh oh!

bglusman commented Apr 25, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants