SQL Whisper

Talk to any database in plain English. Voice or text. With memory.

SQL Whisper is a voice-and-text AI database assistant built for SCU Hack-A-Stack 2026. Connect any MySQL or TDSQL-compatible database, ask questions in natural language, refine them in conversation, and get back real SQL, real results, charts, and semantic insights — no SQL knowledge required.

Built in 24 hours by three Santa Clara University students.

What it does

Type or speak a question. Get back an answer. Then ask a follow-up.

You: "Show me my top 5 customers by spending"

Whisper: Here are your highest-spending customers...

Rank Customer Total

1 Matthew White $4,817.37

2 William Smith $3,926.47

...

You: "Now only show the ones from California"

Whisper: William Smith is your top California customer with $3,926.47 in spending. Interestingly, he was also #2 on your overall top 5 list!

The agent introspects the database schema, generates safe SQL, runs it, explains the result, and remembers what you asked previously. Follow-up questions reference prior context automatically.

Architecture

┌─────────────────────────────────────────────────────────┐
│                      Frontend (React)                   │
│           Atomic Command Surface architecture           │
│   Connections │ Query Input │ Voice │ History │ Results │
└──────────────────────────┬──────────────────────────────┘
                           │ HTTPS
┌──────────────────────────▼──────────────────────────────┐
│                   Backend (Flask + Python)              │
│                                                         │
│  /ask  ──▶ Claude agent (tool-use loop)                 │
│              ├─▶ get_schema  ──▶ INFORMATION_SCHEMA     │
│              └─▶ run_query   ──▶ Safety pipeline:       │
│                                   • Comment strip       │
│                                   • SELECT-only         │
│                                   • Single-statement    │
│                                   • Row cap (500)       │
│                                   • Dual-layer timeout  │
│                                                         │
│  /trtc/*  ──▶ Tencent TRTC + StartAITranscription       │
│  /connections, /history, /schema, /tables, /query       │
│  /conversation/reset                                    │
└──────────────────────────┬──────────────────────────────┘
                           │
              ┌────────────┼────────────┐
              ▼            ▼            ▼
        App DB         Redis      User DB(s)
        (MySQL)      (cache +     (MySQL/TDSQL)
        encrypted    conversation  Schema introspected
        credentials  state)       Read-only access
        + history

Tech stack

Backend

Flask + flask-cors
pymysql + cryptography (Fernet)
Anthropic Claude (claude-haiku-4-5) via the official SDK
Tencent Cloud TRTC + StartAITranscription
Redis 7 for schema caching and multi-turn conversation state
MySQL 8 / TDSQL-compatible

Frontend

React with Atomic Command Surface architecture
TRTC SDK v5 for voice input
Tailwind CSS for styling
Recharts for auto-rendered data visualization

Infrastructure

Docker for local databases and Redis
ngrok for live frontend-backend tunnel during development

Features

Natural language → SQL

A Claude-powered agent calls tool definitions (get_schema, run_query) in a loop until it has enough information to answer. Generates correctly-joined multi-table SQL on the first try for most questions.

Multi-turn conversation

Ask a question, then refine. "Show me top customers" → "Now only the California ones" → "What did they buy?" Each follow-up references prior context. Conversation state is persisted in Redis per connection with a 10-minute TTL. The UI shows a "Follow-up Mode" indicator and a "New Conversation" button to reset.

Semantic stat cards

A second Claude call produces three editorialized stat cards per query — not just row counts, but computed insights like top values, group averages, and qualitative tags. Each card carries a sentiment color (positive, warning, neutral) for the UI.

Auto-rendered charts

Result data renders as charts when the shape supports it (bar, line, etc.), driven by the data structure rather than manual selection.

SQL safety pipeline

Every query passes through:

Comment stripping (so /*SELECT*/ DROP can't fool the validator)
First-token SELECT allowlist
Single-statement enforcement (no SELECT 1; DROP TABLE film)
Automatic row cap injection (LIMIT 501 — 500 + 1 to detect truncation)
Dual-layer 5-second timeouts (client-side thread + MySQL MAX_EXECUTION_TIME)

Redis caching

Two load-bearing Redis usages:

Schema cache — schemas introspected on first query and cached for 5 minutes. Repeat queries pay ~10ms instead of ~200ms.
Conversation state — multi-turn conversation messages cached per connection with 10-minute TTL.

Credentials encryption

Database credentials are encrypted at rest with Fernet symmetric encryption. Plaintext passwords are decrypted just-in-time for each query and never persisted to logs.

Voice input

Tencent Cloud TRTC integration. The user joins a TRTC room, an ASR bot joins as a phantom user and streams transcripts via TRTC.EVENT.CUSTOM_MESSAGE, and the final sentence gets fed into the same /ask pipeline that handles typed queries.

Multi-database support

Connect multiple databases simultaneously. Switch between them in the sidebar. Each connection has its own history and independent conversation state.

Auto-saved history

Every successful query is saved to query_history with the prompt, results, and stat cards. Click any past entry to replay without re-running the SQL. Delete individual entries.

API reference

Connection management

POST /connections — register a database
GET /connections — list connections
POST /connections/<id>/test — verify connection works
DELETE /connections/<id> — remove (cascades to history)

Schema

GET /schema — full compressed schema (Redis-cached, for the LLM)
GET /tables — table list with row counts (for sidebars)

Query

POST /ask — natural language → SQL + results + stats (multi-turn aware)
POST /query — raw SQL (safety-validated)
POST /conversation/reset — clear conversation history for a connection

History

GET /history?connection_id=<id> — list
GET /history/<id> — single entry
DELETE /history/<id> — remove

Voice

POST /trtc/usersig — mint a TRTC UserSig
POST /trtc/start-transcription — spawn ASR bot into room
POST /trtc/stop-transcription — tear down ASR bot

All data endpoints require X-Connection-Id: <uuid> header.

Running locally

Backend

cd backend
python3 -m venv venv
source venv/bin/activate
pip install -r requirements.txt

Create a .env file:

ENCRYPTION_KEY=<run: python -c "from cryptography.fernet import Fernet; print(Fernet.generate_key().decode())">
APP_DB_HOST=localhost
APP_DB_PORT=3316
APP_DB_USER=root
APP_DB_PASS=hackathon
APP_DB_NAME=hackastack_app
REDIS_HOST=localhost
REDIS_PORT=6379
ANTHROPIC_API_KEY=<your key>
TRTC_SDK_APP_ID=<from console.trtc.io>
TRTC_SDK_SECRET_KEY=<from console.trtc.io>
TENCENT_SECRET_ID=<from CAM console>
TENCENT_SECRET_KEY=<from CAM console>
TRTC_REGION=ap-singapore

Spin up databases and Redis:

docker run --name app-db-mysql -e MYSQL_ROOT_PASSWORD=hackathon -e MYSQL_DATABASE=hackastack_app -p 3316:3306 -d mysql:8
docker run --name demo-db-mysql -e MYSQL_ROOT_PASSWORD=demopass -e MYSQL_DATABASE=sakila -p 3326:3306 -d mysql:8
docker run --name whisper-redis -p 6379:6379 -d redis:7

Load the Sakila demo data:

curl -O https://downloads.mysql.com/docs/sakila-db.tar.gz
tar -xzf sakila-db.tar.gz
docker exec -i demo-db-mysql mysql -uroot -pdemopass sakila < sakila-db/sakila-schema.sql
docker exec -i demo-db-mysql mysql -uroot -pdemopass sakila < sakila-db/sakila-data.sql

(Optional) Load the synthetic small business demo data:

python3 seed_small_business.py

Initialize the app DB schema (run once):

docker exec -i app-db-mysql mysql -uroot -phackathon hackastack_app <<EOF
CREATE TABLE user_connections (
  id CHAR(36) PRIMARY KEY,
  name VARCHAR(255) NOT NULL,
  host VARCHAR(255) NOT NULL,
  port INT DEFAULT 3306,
  db_name VARCHAR(255) NOT NULL,
  username VARCHAR(255) NOT NULL,
  password_enc TEXT NOT NULL,
  db_type VARCHAR(50) DEFAULT 'mysql',
  created_at TIMESTAMP DEFAULT CURRENT_TIMESTAMP
);

CREATE TABLE query_history (
  id CHAR(36) PRIMARY KEY,
  connection_id CHAR(36) NOT NULL,
  name VARCHAR(255),
  prompt TEXT NOT NULL,
  response_payload JSON NOT NULL,
  created_at TIMESTAMP DEFAULT CURRENT_TIMESTAMP,
  INDEX idx_connection_created (connection_id, created_at DESC)
);
EOF

Run Flask:

python3 app.py
# Backend now live at http://localhost:5001

Frontend

cd frontend
npm install
npm run dev

Built for SCU Hack-A-Stack 2026

Tracks targeted:

Tencent Cloud — TRTC voice input + TRTC AI transcription + TDSQL-compatible architecture
Redis - not an track but a sponsor, used for context saving and multi-turn conversations

What we learned

Agentic AI with proper tool definitions is dramatically more capable than single-shot prompting for SQL generation. Letting Claude call get_schema and run_query separately lets it adapt to unknown databases without prompt engineering for every schema.
SQL safety is a layered problem. Comment stripping must run before validation — otherwise an attacker can hide DROP TABLE inside /* */ and slip past the SELECT allowlist.
Encrypt-at-rest with just-in-time decryption is the right pattern for credential storage even in a hackathon. Fernet makes this five lines of Python.
Coordinating three developers writing to the same Flask file requires a clear API contract negotiated up front. We aligned on field names and response shapes before either side wrote glue code.
Multi-turn conversation requires serializing Anthropic SDK objects (TextBlock, ToolUseBlock) before persisting to Redis. model_dump() is your friend.

What's next

Native Tencent Cloud TDSQL deployment
Team workspaces with per-user auth
Cross-database queries ("compare customers in DB A vs DB B")
Query result export (CSV, Sheets)

Team

Matthew Moyer — Backend architecture, Tencent TRTC integration, AI agent, Redis caching, multi-turn implementation
Zach Malinoski — Frontend, UX, Atomic Command Surface architecture, voice UI, charts
Joshua Cao — Agent design, prompt engineering, endpoint development, agentic tooling

Name		Name	Last commit message	Last commit date
Latest commit History 92 Commits
backend		backend
frontend/sql_whisper_frontend		frontend/sql_whisper_frontend
.gitignore		.gitignore
README.md		README.md
agentic_workflow.pdf		agentic_workflow.pdf

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

SQL Whisper

What it does

Architecture

Tech stack

Features

Natural language → SQL

Multi-turn conversation

Semantic stat cards

Auto-rendered charts

SQL safety pipeline

Redis caching

Credentials encryption

Voice input

Multi-database support

Auto-saved history

API reference

Connection management

Schema

Query

History

Voice

Running locally

Backend

Frontend

Built for SCU Hack-A-Stack 2026

What we learned

What's next

Team

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

SQL Whisper

What it does

Architecture

Tech stack

Features

Natural language → SQL

Multi-turn conversation

Semantic stat cards

Auto-rendered charts

SQL safety pipeline

Redis caching

Credentials encryption

Voice input

Multi-database support

Auto-saved history

API reference

Connection management

Schema

Query

History

Voice

Running locally

Backend

Frontend

Built for SCU Hack-A-Stack 2026

What we learned

What's next

Team

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages