🤟 Phone With Hand

Berkeley AI Hackathon 2026 — Accessibility bridge for Deaf / ASL users on phone calls.

Prerequisites

Node.js 18+ (project was developed with Node 24)
Chrome (recommended — MediaPipe WASM + WebRTC works best there)
Webcam connected and accessible to the browser

Install & Run

# 1. Install dependencies
npm install

# 2. Start the dev server
npm run dev

# 3. Open in Chrome
open http://localhost:3000

That's it — no API keys, no backend, everything runs in the browser.

Granting Camera Permission in Chrome

Open http://localhost:3000/demo
Chrome shows a camera permission prompt — click Allow
If you accidentally denied it: click the camera icon (🎥) in the address bar → select Allow → refresh

Pages

URL	Description
`http://localhost:3000`	Home page — contacts + Train signs button
`http://localhost:3000/train`	Sign trainer — teach the app custom ASL handshapes
`http://localhost:3000/call/dr-smith`	Scripted demo call
`http://localhost:3000/call/testing-call`	Testing Call — sign playground (pretrained gestures work out of the box)
`http://localhost:3000/demo`	Redirects to the wired call route

Training your own signs (`/train`)

The trainer is browser-only — your webcam frames and the trained model never leave the device.

Open http://localhost:3000/train and allow camera access.
Pick a trainable sign from the Vocabulary list (right). The 7 pretrained gestures are marked and need no training — MediaPipe recognises them directly.
Make the handshape and hold the Record button (or press Space) to grab ~30 frames. Vary angle/distance slightly for robustness. The per-label sample count updates live.
Watch Live prediction — it shows what the current model thinks your hand is and turns green when it matches the selected sign.
Use Clear (trash icon / "Clear …" button) to redo a label, or Clear all to start over.
Export downloads the model as JSON; Import loads one back. Trained signs immediately drive sign→speech in the call screens.

Where the model is stored

localStorage key pwh.signModel.v1 (survives refreshes/restarts).
Export to file for backup or sharing between machines (Import to restore).

How recognition works (classifier-agnostic)

components/HandTracker.tsx owns the camera + MediaPipe Hands pipeline and emits, per frame, the 21 hand landmarks and MediaPipe's pretrained gesture. Landmarks are normalized to be translation- and scale-invariant (lib/landmarks.ts: wrist-centered, scaled by hand size) before classification.

All recognition goes through the SignClassifier interface (lib/classifier/types.ts) — train(), addSample(), predict(), export()/import(). The current implementation is a single-frame KNN (lib/classifier/knn.ts); swap it for an LSTM later without rewriting the trainer UI or the call pages — just satisfy the same interface. lib/signStore.ts holds the one app-wide instance + persistence.

Key Files

app/
  page.tsx                  Home page (contacts + Train signs button)
  train/page.tsx            ← Sign trainer (KNN, capture, export/import)
  call/[id]/page.tsx        In-call sign→speech experience
  globals.css               Tailwind base

components/
  HandTracker.tsx           ← CORE: webcam + MediaPipe Hands, emits landmarks
  CameraSignDetector.tsx    In-call readout: pretrained + KNN over HandTracker
  GlossPanel.tsx            Animated ASL gloss cards (Framer Motion)

lib/
  landmarks.ts              Translation/scale-invariant landmark normalization
  classifier/types.ts       SignClassifier interface (classifier-agnostic)
  classifier/knn.ts         KNN implementation of SignClassifier
  signStore.ts              App-wide model: localStorage + file import/export

data/
  signs.ts                  Vocabulary: pretrained + KNN labels, phrases, tones

How CameraSignDetector works

Calls getUserMedia for the webcam stream
Dynamically imports @mediapipe/tasks-vision (avoids SSR issues)
Loads the GestureRecognizer WASM from jsDelivr CDN + gesture model from Google Storage — no API key needed
Runs recognizeForVideo() in a requestAnimationFrame loop
Draws green skeleton connectors + red landmark dots on a <canvas> overlay
Shows: hands detected count, gesture name (Open_Palm / Closed_Fist / Victory / etc.), confidence %

Tech Stack

Layer	Tool
Framework	Next.js 15 (App Router)
Language	TypeScript
Styles	Tailwind CSS
Animations	Framer Motion
Hand tracking	@mediapipe/tasks-vision (GestureRecognizer, browser WASM)

What's Mocked (Next Iterations)

Feature	Status
Hand landmark detection	✅ Live (MediaPipe)
Gesture → ASL phrase mapping	🔜 Next iteration
ASL interpretation (LLM)	🔜 Next iteration
TTS voice output	🔜 Next iteration
Sign avatar animation	🔜 Next iteration
Two-device WebSocket link	🔜 Next iteration

Ethics Note

This is an accessibility guide / prototype, NOT a certified ASL interpreter. It does not replace human interpreters, Video Relay Services (VRS), or CART services. In high-stakes conversations (medical, legal, financial), use a certified human interpreter.

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
app		app
components		components
data		data
lib		lib
public		public
server		server
.gitignore		.gitignore
HANDSET-SETUP.md		HANDSET-SETUP.md
README.md		README.md
next.config.ts		next.config.ts
package-lock.json		package-lock.json
package.json		package.json
postcss.config.mjs		postcss.config.mjs
start-handset.sh		start-handset.sh
tailwind.config.ts		tailwind.config.ts
tsconfig.json		tsconfig.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🤟 Phone With Hand

Prerequisites

Install & Run

Granting Camera Permission in Chrome

Pages

Training your own signs (`/train`)

Where the model is stored

How recognition works (classifier-agnostic)

Key Files

How CameraSignDetector works

Tech Stack

What's Mocked (Next Iterations)

Ethics Note

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

🤟 Phone With Hand

Prerequisites

Install & Run

Granting Camera Permission in Chrome

Pages

Training your own signs (/train)

Where the model is stored

How recognition works (classifier-agnostic)

Key Files

How CameraSignDetector works

Tech Stack

What's Mocked (Next Iterations)

Ethics Note

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Training your own signs (`/train`)

Packages