Inspiration

Recent cloud outages showed how easily a single failure can ripple across systems and disrupt millions of users around the world. As developers, we’ve personally experienced that moment of panic when something breaks. Logs explode, services go dark, and it takes far too long to figure out what happened and how to fix it.

Logo

Continiuum is a tool that doesn’t just alert you, but helps you understand what’s going on.

What it does

Continiuum is an AI-powered incident detection and triage dashboard that helps developers maintain uptime effortlessly.

Continiuum can:

  • Can detect server errors in real time
  • Call and notify developers when a critical error occurs by phone call
  • Generate a root cause summary and suggested fixes using Gemini

How we built it

  • Frontend (Next.js + React + TypeScript): We built a responsive dashboard that displays uptime graphs, error timelines, and live incident data. The frontend communicates with our backend through a typed API layer (zod, Drizzle).
  • Database Layer (Neon + Postgres): Socket.io handles real-time event streaming, while Postgres stores historical logs, AI summaries, and server state. Our API layer connects both and exposes clean endpoints to the frontend and agent.
  • Agent Container (Python + Docker + Socket.io)
  • Voice Agent (Twilio + ElevenLabs)

Challenges we ran into

  • Generating Twilio and ElevenLabs call with dynamic input
  • DrizzleORM Schemas and data parsing

What we learned

  • Integratation of Twilio and ElevenLabs into an automated calling system triggered by any given events
  • Built our own historical uptime graph by adapting Atlassian’s uptime logic
  • Management of containers, capture logs, and outputs
  • Hands-on server management concepts: health checks, crashes, restarts
  • WebSockets with Socket.io

What's next for Continuum

  • Run stress tests on active servers
  • Implement agentic RAG to trace the exact failing commits in the Github repository

Built With

Share this project:

Updates