Skip to content
View Fury0508's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report Fury0508

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Fury0508/README.md

Hey, I'm Dhananjay 👋

Senior Data Engineer · Manchester, UK · 4.5 years experience
Databricks Certified · Google Cloud Certified · MSc Computer Science, University of Glasgow

    profile views


🧑‍💻 About me

Databricks and Google Cloud Certified Data Engineer with 4.5 years building cloud-native data platforms across Azure and GCP. I specialise in scalable ETL/ELT pipelines, medallion architecture, and data quality frameworks — and I'm expanding into GenAI and agentic systems to make data products smarter.

Proven track record of delivering measurable outcomes: £1M+ revenue uplift, 70% reduction in deployment failures, and 40% faster support resolution through GenAI solutions.

  • 🔭 Currently a Data Engineer at Footasylum, Manchester
  • 🤖 Building agentic analytics on Databricks with Claude Sonnet + Mosaic AI
  • 🌱 Exploring RAG, vector search, and LLM-powered data tooling
  • 🛂 Apache Airflow contributor — 4 merged PRs
  • 📬 guptadhananjay090@gmail.com

🛠️ Tech stack

Languages & scripting

Python SQL PySpark Bash

Data engineering & pipelines

Apache Spark Spark Streaming Apache Airflow dbt Delta Lake

Cloud platforms

Azure Databricks Azure Data Factory GCP BigQuery

Data warehouses & storage

Snowflake PostgreSQL ADLS Gen2

GenAI & ML

OpenAI Gemini LangChain Qdrant pgvector

DevOps & tooling

Docker GitLab CI Terraform Azure DevOps

BI & visualisation

Power BI


🚀 Featured projects

🎧 BookTalk AI — AI-native audiobook platform with multilingual narration, RAG-powered Q&A grounded in book content, and clickable audio citations. FastAPI pgvector OpenAI ElevenLabs Celery Valkey

🔍 LLM-Based Data Quality Analyser — GenAI tool that detects nulls, outliers and structural anomalies in CSV/Parquet files on S3, generating natural-language summaries and automated fixes. Enabled 70% faster QA checks. FastAPI OpenAI Pandas Streamlit AWS S3

🏗️ Customer360 Pipeline — Event-triggered pipeline validating and routing multi-source order data using ADF and Databricks, with PySpark transformations and Azure Key Vault credential management. ADF Databricks ADLS Gen2 PySpark

FastAPI CRUD Service — Production-ready REST API with full CRUD, auth, and validation. FastAPI SQLModel PostgreSQL


🏅 Certifications

Certification Issuer Year
Databricks Data Engineer Professional Databricks 2026
Google Cloud Professional Data Engineer Google Cloud 2025
GitLab Certified Associate GitLab 2024
Database and SQL for Data Science IBM 2024
Data Analysis with Python IBM 2024

🛂 Apache Airflow contributor — 4 merged issues


📈 GitHub stats

 

Popular repositories Loading

  1. fastAPIPractice fastAPIPractice Public

    Practicing CRED operations using FastAPI.

    Python

  2. Pysparkpractice Pysparkpractice Public

    Jupyter Notebook

  3. DataAnalysis DataAnalysis Public

    Jupyter Notebook

  4. DataStructure- DataStructure- Public

    I am going to practice data structure.

    Python

  5. InformationVisualisation InformationVisualisation Public

    Python

  6. metriport metriport Public

    Forked from metriport/metriport

    Metriport is an open-source universal API for healthcare data.

    JavaScript