Inspiration

Every day, millions of professionals waste 40+ minutes managing email during dead time - commuting, walking, waiting. We experienced this firsthand as hackers. When OpenAI's voice models crossed the quality threshold needed for natural conversation, we realized the technology was finally ready to solve this problem.

What We Built

Nomad is an autonomous voice AI agent that manages your entire inbox hands-free. Say "What are my urgent emails?" and within 1.2 seconds, Nomad reads, prioritizes, and summarizes what matters most. It learns your writing style, drafts contextual replies, and can send emails - all through natural voice commands.

How We Built It

Architecture:

  • Voice Pipeline: OpenAI Whisper → Custom NLP → Action Engine → OpenAI TTS
  • Email Intelligence: Real-time IMAP sync → ML prioritization → Context extraction → Response generation
  • Autonomous Loop: Background monitoring → Pattern learning → Proactive drafting

Key Technical Achievements: // Sub-1.5s voice response achieved through parallel processing const [transcription, emailData] = await Promise.all([ whisper.transcribe(audio), emailStore.prefetch() ]);

ML Prioritization Algorithm: Analyzes sender importance, keywords, and historical patterns

Challenges We Faced

  1. Latency Optimization: Initial response time was 4+ seconds. We achieved 1.2s through:
  2. Parallel API calls
  3. Email pre-caching
  4. Optimized Whisper parameters

  5. Voice Context Understanding: Natural language is ambiguous. We built a custom intent parser that handles variations like "Reply to Jane" vs "Send Jane a response about the contract."

  6. Email Authentication: IMAP/SMTP integration required handling OAuth2, app passwords, and 2FA across Gmail, Outlook, and corporate servers.

  7. Autonomy Balance: Making the agent proactive without being intrusive - we implemented confidence thresholds and user confirmation for critical actions.

Nomad is reimagining productivity for the voice-first era.

Built With

Share this project:

Updates