The Soul Problem — EQ Apologies Data Collection

Expert-labeled benchmark of LLM responses to apology-writing prompts.

Quickstart

cp .env.example .env.local and fill in Supabase + Anthropic keys
Apply supabase/migrations/0001_init.sql in the Supabase SQL editor
npm install
npm run seed — populates scenarios, screener, and model responses (one-time, ~80 Claude calls)
Insert a test expert in Supabase: insert into experts (invite_token) values ('test-token-abc');
npm run dev → visit /invite/test-token-abc

Verify setup

npx tsx scripts/verify-supabase.ts prints env + schema status.

Routes

Path	Purpose
`/invite/[token]`	Expert consent
`/onboarding`	Name + background
`/screener`	EQ-Bench intensity test (5 items, MAD-graded)
`/project`	Labeling landing — shows assigned scenarios + progress
`/project/scenario/[i]`	Per-scenario labeling (blinded A–D, 3-dim Likert)
`/done`	Completion
`/admin`	Dashboard + export (basic auth via `ADMIN_PASSWORD`)
`/admin/export.jsonl`	Full dataset download

Exporting data

Visit /admin/export.jsonl authenticated as admin (any username, password = ADMIN_PASSWORD). One JSON object per (expert, response) row.

Assignment model

Each scenario targets 3 graders. When an expert passes the screener, they're assigned the 10 scenarios with the fewest current graders (tie-broken by scenario id). This naturally balances coverage as experts arrive.

Rubric

Each response is rated on three 1-5 Likert dimensions:

Accountability — takes responsibility without deflection or JADE
Specificity — names the actual transgression and its impact
Warmth — honest emotional register; not robotic or saccharine

Plus optional free-text: "What would make this apology better?"

Ethics

Model identity is blinded per expert per scenario
Explicit consent before labeling; name field stripped from public export
Rubric encodes one Western-therapeutic model of "good apology" — documented as a limitation
RLS enabled on every table; only service_role (server-side) can read/write

Scripts

npm run dev / build / start — standard Next
npm test — vitest (12 tests covering scoring, blinding, assignments, export)
npm run seed — scenarios + screener + responses
npx tsx scripts/verify-supabase.ts — env + schema check

Stack

Next.js 16 App Router · TypeScript · Tailwind · Supabase Postgres · @supabase/supabase-js · zod · @anthropic-ai/sdk · vitest · Vercel

Name		Name	Last commit message	Last commit date
Latest commit History 93 Commits
app		app
data		data
grief-rag		grief-rag
lib		lib
prompts		prompts
public		public
scripts		scripts
supabase/migrations		supabase/migrations
test_results		test_results
tests		tests
.env 2.example		.env 2.example
.env.example		.env.example
.gitignore		.gitignore
AGENTS 2.md		AGENTS 2.md
AGENTS.md		AGENTS.md
CLAUDE 2.md		CLAUDE 2.md
CLAUDE.md		CLAUDE.md
README 2.md		README 2.md
README.md		README.md
eslint.config 2.mjs		eslint.config 2.mjs
eslint.config.mjs		eslint.config.mjs
middleware 2.ts		middleware 2.ts
middleware.ts		middleware.ts
next-env.d 2.ts		next-env.d 2.ts
next-env.d.ts		next-env.d.ts
next.config 2.ts		next.config 2.ts
next.config.ts		next.config.ts
package 2.json		package 2.json
package-lock 2.json		package-lock 2.json
package-lock.json		package-lock.json
package.json		package.json
postcss.config 2.mjs		postcss.config 2.mjs
postcss.config.mjs		postcss.config.mjs
tsconfig 2.json		tsconfig 2.json
tsconfig 2.tsbuildinfo		tsconfig 2.tsbuildinfo
tsconfig.json		tsconfig.json
tsconfig.tsbuildinfo		tsconfig.tsbuildinfo
vercel.ts		vercel.ts
vitest.config 2.ts		vitest.config 2.ts
vitest.config.ts		vitest.config.ts

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

The Soul Problem — EQ Apologies Data Collection

Quickstart

Verify setup

Routes

Exporting data

Assignment model

Rubric

Ethics

Scripts

Stack

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

The Soul Problem — EQ Apologies Data Collection

Quickstart

Verify setup

Routes

Exporting data

Assignment model

Rubric

Ethics

Scripts

Stack

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages