Ramp

Accessibility audit → fix → PR. axe detects; Ramp understands and fixes.

Lighthouse and axe-core find WCAG violations and stop at a report. Ramp closes the loop: it audits a real rendered page, reasons about the WCAG criterion, writes the fix, verifies it with axe-core, and opens a merge-ready pull request — and it catches semantic issues axe is blind to (alt text that just says "image", links that say "click here").

axe: 0 violations · Ramp: 12 semantic issues axe can't see (across 3 demo pages).

Three pillars

Pillar	Package	What it does
A11y-Bench	`packages/bench`	51 ground-truth tasks mined from real merged a11y PRs; scores naked LLM vs harness on recall and precision, split by `html-live` / `source-code`.
Harness	`packages/harness`	Drives a headless page through Playwright + axe-core + accessibility tree + screen-reader simulation + contrast/focus inspectors + semantic review; an LLM agent reasons over the evidence (`runAudit`).
Auto-fix loop	`packages/control-plane`	sandbox checkout → audit → Claude Code fix → axe verify (before/after score) → GitHub PR. Sentry monitors the loop.

Real fix PRs (verified, before → after)

Repo	Fix	Score	PR
`bad.html` (fixture)	alt + contrast + button names	60 → 96	yangzhang75/Ramp#7
semantic (fixture)	meaningless alt/link/button names axe passes	semantic 5 → 0	yangzhang75/Ramp#10
`aigov-ops…` (real OSS)	landmarks + skip link + `<main>`	92 → 100	PR#1
`caelaria` (real OSS)	unlabeled `<select>` controls	84 → 92	PR#1
`Whatifarcade` (real OSS)	form label + `<main>` landmark	96 → 100	PR#1

PRs are opened on forks — Ramp audits and fixes the real page without spamming upstream maintainers.

Architecture

flowchart LR
  USER["Frontend repo / URL"] --> API

  subgraph CP["Control Plane · packages/control-plane"]
    API["node:http API<br/>POST /audit · /benchmark"]
    DB[("Drizzle + SQLite<br/>runs · findings · scores")]
    API --- DB
  end

  subgraph H["Harness · packages/harness"]
    PW["Playwright page"]
    TOOLS["axe-core · a11y tree<br/>screen-reader · contrast<br/>focus-order · semantic review"]
    AGENT(["LLM audit agent<br/>runAudit · gpt-4o-mini"])
    PW --> TOOLS --> AGENT
  end

  subgraph FL["Fix Loop · packages/control-plane"]
    SB["sandbox checkout<br/>fork @ base commit"]
    FIX["Claude Code fix<br/>claude -p"]
    VER["axe verify<br/>before → after score"]
    PR["GitHub PR · Octokit<br/>opened on fork"]
    SB --> FIX --> VER --> PR
  end

  API --> PW
  AGENT -->|"findings + before score"| DB
  AGENT --> SB
  VER -->|"after score"| DB
  PR --> GH["GitHub<br/>merge-ready PR"]
  SENTRY{{"Sentry monitoring"}} -.-> CP
  SENTRY -.-> H
  SENTRY -.-> FL

Quick start

pnpm install

# 1. Self-contained demo — repair bad.html and score it (free, no API keys)
pnpm --filter @ramp/control-plane fix:demo                    # 60 → 96

# 2. Detection benchmark — naked LLM vs harness, recall + precision
pnpm --filter @ramp/bench score:fixtures                      # needs OPENAI_API_KEY
pnpm --filter @ramp/scoring leaderboard

# 3. Real-repo fix loop — fork → audit → fix → verify → open PR
TASK_ID=ramp-048 pnpm --filter @ramp/control-plane fix:repo   # needs OPENAI_API_KEY + GITHUB_TOKEN

# 4. Web UI (landing + demo dashboard)
pnpm dev:control-plane     # :8787 — API (optional, for Live Run / benchmark tabs)
pnpm dev:dashboard         # :5173 — Home tab + axe vs Ramp · Auto-fix · Scores · …

Tech stack

Playwright · axe-core · Vercel AI SDK (ai + @ai-sdk/openai, gpt-4o-mini) · Claude Code (claude -p, headless fixer) · Drizzle ORM + SQLite · React + Vite · Sentry · Octokit · node:http · TypeScript + pnpm workspaces.

Monorepo layout

Path	Role
`packages/shared`	Types · Drizzle schema · DB client
`packages/harness`	Audit tools + `runAudit` agent
`packages/scoring`	Recall/precision metrics + leaderboard
`packages/bench`	A11y-Bench tasks + miners/curators
`packages/control-plane`	HTTP API + fix loop + GitHub PRs
`apps/dashboard`	React + Vite site: Home (product landing) + interactive demo tabs

Detect → Score → Fix → Validate → Pull Request. The artifact isn't a report — it's a reviewable PR.

Name		Name	Last commit message	Last commit date
Latest commit History 69 Commits
apps/dashboard		apps/dashboard
packages		packages
.env.example		.env.example
.gitignore		.gitignore
CONTRIBUTING.md		CONTRIBUTING.md
RAMP 作战手册.md		RAMP 作战手册.md
RAMP_同学并行任务清单.md		RAMP_同学并行任务清单.md
README.md		README.md
SKILL.md		SKILL.md
groupmate1.md		groupmate1.md
groupmate2.md		groupmate2.md
package.json		package.json
pnpm-lock.yaml		pnpm-lock.yaml
pnpm-workspace.yaml		pnpm-workspace.yaml
tsconfig.base.json		tsconfig.base.json
tsconfig.json		tsconfig.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Ramp

Three pillars

Real fix PRs (verified, before → after)

Architecture

Quick start

Tech stack

Monorepo layout

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Ramp

Three pillars

Real fix PRs (verified, before → after)

Architecture

Quick start

Tech stack

Monorepo layout

About

Resources

Contributing

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages