Skip to content

add dogfood skill for agent-driven exploratory qa#538

Merged
ctate merged 7 commits into
mainfrom
ctate/dogfood
Feb 24, 2026
Merged

add dogfood skill for agent-driven exploratory qa#538
ctate merged 7 commits into
mainfrom
ctate/dogfood

Conversation

@ctate

@ctate ctate commented Feb 24, 2026

Copy link
Copy Markdown
Collaborator
  • Adds a dogfood skill that agents can use to systematically test any web app and produce a structured report with reproduction evidence (screenshots, videos, step-by-step repro steps) for every issue found
  • Adds structural tests (test/e2e/dogfood.test.ts) validating skill file integrity and cross-consistency (runs in CI, no API key needed)
  • Adds an E2E eval (test/e2e/dogfood.eval.ts) that runs the full dogfood workflow against a local HTML fixture with intentional bugs, using the Claude Agent SDK and AI Gateway. Outputs report, screenshots, videos, and chat log to test/e2e/.dogfood-output/ for inspection.

@vercel

vercel Bot commented Feb 24, 2026

Copy link
Copy Markdown
Contributor

The latest updates on your projects. Learn more about Vercel for GitHub.

Project Deployment Actions Updated (UTC)
agent-browser Ready Ready Preview, Comment Feb 24, 2026 5:29pm

@ctate ctate merged commit c0e2b80 into main Feb 24, 2026
17 checks passed
timothywangdev pushed a commit to timothywangdev/banana-browser that referenced this pull request Mar 17, 2026
* dogfood skill

* evals

* haiku

* fixes

* caching

* fixes

* don't use npx
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant