Senior Data Engineer · Manchester, UK · 4.5 years experience
Databricks Certified · Google Cloud Certified · MSc Computer Science, University of Glasgow
Databricks and Google Cloud Certified Data Engineer with 4.5 years building cloud-native data platforms across Azure and GCP. I specialise in scalable ETL/ELT pipelines, medallion architecture, and data quality frameworks — and I'm expanding into GenAI and agentic systems to make data products smarter.
Proven track record of delivering measurable outcomes: £1M+ revenue uplift, 70% reduction in deployment failures, and 40% faster support resolution through GenAI solutions.
- 🔭 Currently a Data Engineer at Footasylum, Manchester
- 🤖 Building agentic analytics on Databricks with Claude Sonnet + Mosaic AI
- 🌱 Exploring RAG, vector search, and LLM-powered data tooling
- 🛂 Apache Airflow contributor — 4 merged PRs
- 📬 guptadhananjay090@gmail.com
Languages & scripting
Data engineering & pipelines
Cloud platforms
Data warehouses & storage
GenAI & ML
DevOps & tooling
BI & visualisation
🎧 BookTalk AI — AI-native audiobook platform with multilingual narration, RAG-powered Q&A grounded in book content, and clickable audio citations. FastAPI pgvector OpenAI ElevenLabs Celery Valkey
🔍 LLM-Based Data Quality Analyser — GenAI tool that detects nulls, outliers and structural anomalies in CSV/Parquet files on S3, generating natural-language summaries and automated fixes. Enabled 70% faster QA checks. FastAPI OpenAI Pandas Streamlit AWS S3
🏗️ Customer360 Pipeline — Event-triggered pipeline validating and routing multi-source order data using ADF and Databricks, with PySpark transformations and Azure Key Vault credential management. ADF Databricks ADLS Gen2 PySpark
⚡ FastAPI CRUD Service — Production-ready REST API with full CRUD, auth, and validation. FastAPI SQLModel PostgreSQL
| Certification | Issuer | Year |
|---|---|---|
| Databricks Data Engineer Professional | Databricks | 2026 |
| Google Cloud Professional Data Engineer | Google Cloud | 2025 |
| GitLab Certified Associate | GitLab | 2024 |
| Database and SQL for Data Science | IBM | 2024 |
| Data Analysis with Python | IBM | 2024 |
🛂 Apache Airflow contributor — 4 merged issues


