Skip to content

dgne58/jarvis

Repository files navigation

🕶️ Jarvis — AI Networking Assistant for Meta Glasses

“Because LinkedIn is for amateurs.”
Jarvis transforms Meta Glasses into a real-time networking assistant — recognizing faces, recalling names, and generating AI-powered conversation cues during in-person interactions.

🚀 Overview

Cluely uses live speech transcription, facial recognition, and conversational AI to enhance real-world interactions.
It helps you remember people, recall context, and sound sharp — instantly.

Core Capabilities

  • Facial Recognition – Identify and recall people in real time
  • Live Transcription – Multi-speaker diarization with low latency
  • AI Conversation Hints – Context-aware, adaptive dialogue prompts
  • Voice Commands – Say “banana” to trigger recognition
  • On-Glasses UI – Optimized interface for Meta Glasses streaming

🧠 Tech Stack

Frontend

  • React 19 + Vite – Fast, modular UI
  • WebRTC APIs – Camera and mic access for live recognition
  • CSS3 (Glassmorphism) – Lightweight visual effects

Backend

  • FastAPI + SQLModel + PostgreSQL – Async Python stack
  • Docker Compose – Unified deployment
  • Face Recognition API – Custom image-matching service

AI & APIs

  • Deepgram Nova-3 – Real-time speech-to-text
  • OpenRouter (Grok-4-Fast) – Conversation intelligence model

⚙️ Setup

Prerequisites

  • Node.js 18+ and npm/pnpm
  • Python 3.10+ with UV
  • Deepgram + OpenRouter API keys
  • Docker (optional)

Frontend

npm install
npm run dev

Backend

cd backend
uv sync --dev
uv run uvicorn app.main:app --reload

Access the app at:

  • Frontend → http://localhost:5173
  • Backend → http://localhost:8000

📂 Project Structure

.
├── src/           # React frontend
│   ├── App.jsx
│   ├── main.jsx
│   └── styles/
├── backend/       # FastAPI service
│   ├── app/
│   ├── Dockerfile
│   └── pyproject.toml
└── public/        # Static assets

🔑 Environment Variables

Variable Description
VITE_DEEPGRAM_API_KEY Deepgram speech-to-text
VITE_OPENROUTER_API_KEY OpenRouter AI models
DATABASE_URL PostgreSQL connection string
SECRET_KEY Backend secret key
ENVIRONMENT local, staging, or production

🧩 Usage

  1. Enable Camera → Allow facial recognition access
  2. Start Transcription → Begin real-time analysis
  3. Say “banana” → Trigger face identification
  4. View Suggestions → Watch AI conversation prompts appear

🧪 Development Commands

Frontend

npm run build
npm run preview
npm run lint

Backend

make run
make check
make test

🧾 License

This project was developed for Calhacks 12.0.
© 2025

About

winner @ calhacks 12.0

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors