GitHub - abacushacks/adahacks-2026

Real-time AI memory aid for people with dementia, face blindness, and memory impairment.

Demo • Features • How It Works • Getting Started • Architecture • Roadmap

"Because no one should forget who they love."

Faces watches your camera and listens to your conversations in real time. When someone walks up to you, it recognizes their face and shows you their name, your relationship, and every important detail from past conversations — instantly, with zero effort on your part.

The Problem

Over 55 million people worldwide live with dementia, projected to hit 139 million by 2050. Another 10 million have prosopagnosia (face blindness), 2 million live with aphasia, and nearly 40% of people over 65 experience age-associated memory impairment. There is no real-time, frictionless tool that helps someone know who is standing in front of them.

Demo

Click here to view demo.

Features

Face Recognition — Always On

Continuous real-time face detection and identification at full frame rate
Handles multiple faces simultaneously in a single frame, each with independent labels
Tracks people seamlessly as they move, turn, step out, and come back
Persistent identity — recognizes the same person across sessions

Conversational Learning

Say "My name is Sarah" → name updates on screen instantly
Say "I'm your daughter" → relationship badge changes live
Mention "doctor appointment on Tuesday" → captured as a key detail
No hardcoded phrases — the AI understands natural introductions and relationship declarations

Smart Key Info Extraction

Extracts dates, appointments, events, people, places, health info from natural speech
Maintains a living memory list — corrects outdated info instead of duplicating
Filters out small talk and filler — only stores what actually matters
All details persist across sessions and server restarts

Cross-Platform, Zero Setup

Runs entirely in the browser — no install, no native dependencies
Works on any device with a camera, microphone, and a browser
Open a URL, allow permissions, done

How It Works

Camera captures video → face-api.js detects and identifies faces client-side using 128-dim descriptors
Microphone streams raw PCM audio over WebSocket → custom audio gate filters silence → Whisper transcribes
Zen API (LLM) runs two parallel extraction tasks per transcription:
- Identity extraction — names and relationships from natural speech
- Key info extraction — dates, events, appointments, people, health details
Results are stored in SQLite and pushed to the frontend in real-time via WebSocket

Getting Started

Prerequisites

Python 3.9+
pip
A modern browser (Chrome recommended)
AUTH API key (get one here)

Installation

# Clone the repository
git clone https://github.com/abacushacks/adahacks-2026.git
cd adahacks-2026

# Install Python dependencies
cd backend
pip install -r requirements.txt

# Run migrations
python3 manage.py migrate

# Set your AUTH API key
export AUTH_KEY="your--api-key"

# Start the server
python3 manage.py runserver

Usage

Allow camera and microphone access
Click anywhere on the screen to start the audio pipeline
Faces are detected automatically — start talking and watch the magic happen

Architecture

Frontend

File	Purpose
`frontend/js/app.js`	Main app — camera setup, render loop, popup positioning, active face tracking
`frontend/js/face-tracker.js`	Face detection, recognition, descriptor matching, popup management
`frontend/js/audio-manager.js`	WebSocket connection, audio streaming, speech gate
`frontend/css/style.css`	Popup overlay styling, key info bullet list

Backend

File	Purpose
`backend/audio_stream/consumers.py`	WebSocket consumer — handles audio, face descriptors, transcription, extraction
`backend/audio_stream/zen_service.py`	Zen API (LLM) — name/relationship parsing, key info extraction
`backend/audio_stream/services.py`	Face matching service — Euclidean distance on 128-dim descriptors
`backend/audio_stream/models.py`	Face model — label, descriptor, name, relationship, metadata
`backend/audio_stream/constants.py`	Tunable parameters — buffer sizes, thresholds, Whisper config

AI Pipeline

The AI context engine is the core intelligence of Faces. Every transcription chunk triggers two parallel LLM calls via asyncio.gather():

Identity extraction — detects name/relationship introductions from natural speech. No regex, no hardcoded patterns — the model understands conversational phrasing.
Key info extraction — receives the full current memory state and returns an updated, curated list. Replaces outdated entries, filters noise, only keeps what matters.

Both are race-condition-safe: the face recognition pipeline re-reads the database after its API calls return to avoid overwriting speech-derived updates.

Tech Stack

Layer	Technology
Face Detection	face-api.js (SSD MobileNet + 128-dim descriptors)
Speech-to-Text	OpenAI Whisper (local, base.en model)
LLM	Zen API (OpenAI-compatible, kimi-k2.5)
Backend	Django + Django Channels + Daphne
Transport	WebSockets (raw PCM audio + JSON)
Database	SQLite
Frontend	Vanilla JavaScript + HTML5 Canvas + WebAudio API

Roadmap for the future

AR glasses integration — context in your actual field of view
Lip tracking — supplement audio with visual speech recognition
Multi-language support
Caregiver dashboard — family members can add context before visits
Emotion & mood detection
On-device LLM inference — fully offline, no API dependency

For our Community

This project was built for people who need it most:

Condition	Affected Population
Dementia	55 million worldwide (139M by 2050)
Prosopagnosia (face blindness)	10 million
Aphasia	2 million (US alone)
Age-related memory impairment	~40% of people over 65
Unpaid dementia caregivers	11 million (US alone)

License

See LICENSE for details.

Built by Abacus at AdaHacks 2026

Name		Name	Last commit message	Last commit date
Latest commit History 33 Commits
backend		backend
frontend		frontend
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

The Problem

Demo

Features

Face Recognition — Always On

Conversational Learning

Smart Key Info Extraction

Cross-Platform, Zero Setup

How It Works

Getting Started

Prerequisites

Installation

Usage

Architecture

Frontend

Backend

AI Pipeline

Tech Stack

Roadmap for the future

For our Community

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

The Problem

Demo

Features

Face Recognition — Always On

Conversational Learning

Smart Key Info Extraction

Cross-Platform, Zero Setup

How It Works

Getting Started

Prerequisites

Installation

Usage

Architecture

Frontend

Backend

AI Pipeline

Tech Stack

Roadmap for the future

For our Community

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages