RAG-powered analytics assistant grounded in project docs, schemas, and KPI definitions.
- Grounded retrieval (knowledge-index backed) instead of unconstrained generation.
- Guardrailed SQL suggestion limited to approved analytics marts.
- Simple evaluation harness for retrieval quality tracking.
- Primary fit: AI Engineer, Applied AI Engineer, Software Engineer (AI platform)
- Showcase focus: grounded retrieval, safe SQL guardrails, evaluation harness
- Resume mapping: see
PROOF.mdandRESUME_BULLETS.md
- Grounded Q&A over repository docs.
- Safe SQL suggestion generation from known schema.
- Evaluation set for answer quality.
- Python FastAPI
- Lexical retrieval pipeline with evaluation harness
- Safe SQL template guardrails
app/retrieval/indexer.py: chunking + index creation.app/retrieval/retriever.py: lexical/fuzzy scoring and top-k retrieval.app/sql_guardrails.py: safe template selection for SQL suggestions.app/api/routes.py: ask/index/eval/sql endpoints.
app/retrieval/ # Index + retrieval logic
app/api/ # FastAPI routes
data/knowledge/ # Grounding corpus
data/eval/ # Evaluation questions
tests/ # API tests
GET /healthPOST /v1/index/rebuildPOST /v1/askPOST /v1/sql/suggestGET /v1/eval
python3 -m venv --clear .venv
source .venv/bin/activate
pip install -r requirements.txt
python scripts/build_index.py
pytest -q
./scripts/run_local.shcurl -X POST http://localhost:8090/v1/ask \\
-H 'Content-Type: application/json' \\
-d '{\"question\":\"What is conversion_rate?\",\"top_k\":3}'curl http://localhost:8090/v1/evalGitHub Actions builds index + runs tests on push/PR:
.github/workflows/ci.yml
- Roadmap:
ROADMAP.md - Changelog:
CHANGELOG.md - Proof mapping:
PROOF.md - Resume bullets:
RESUME_BULLETS.md
- Planned gap-coverage work is tracked in
STACK_COVERAGE_PLAN.md.
- AWS:
docs/deploy/aws.md - Azure:
docs/deploy/azure.md
- BI semantic references:
data/knowledge/bi_semantics.md - Cloud operation references:
data/knowledge/cloud_platforms.md - Evaluation set:
data/eval/questions.json