Problem
Two architecture issues identified in docs/ARCHITECTURE-IMPROVEMENTS.md:
tests/run-all.js uses a hardcoded list of test files. New tests are never run unless someone manually updates the list.
- Agent/skill/command counts drift between README.md, AGENTS.md, and actual filesystem contents.
Scope
- Replace hardcoded test list in
tests/run-all.js with glob-based discovery (**/*.test.js under tests/)
- Add a catalog sync script (e.g.
scripts/ci/catalog.js) that scans agents/*.md, commands/*.md, skills/*/SKILL.md and verifies counts match documentation
Acceptance Criteria
- New test files are automatically discovered and run
- CI validates documentation counts match filesystem reality
- README/AGENTS.md counts are accurate
Problem
Two architecture issues identified in docs/ARCHITECTURE-IMPROVEMENTS.md:
tests/run-all.jsuses a hardcoded list of test files. New tests are never run unless someone manually updates the list.Scope
tests/run-all.jswith glob-based discovery (**/*.test.jsundertests/)scripts/ci/catalog.js) that scansagents/*.md,commands/*.md,skills/*/SKILL.mdand verifies counts match documentationAcceptance Criteria