Skip to content
View boccileonardo's full-sized avatar

Block or report boccileonardo

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
boccileonardo/README.md

Hi there 👋

I'm Leo, a data engineer at P&G in the customer data pipeline.
You can view my personal projects on my site: https://leobocci.pages.dev/ or in my github repos.

  • 🔨 Currently working on:
    Real-time lakehouse from Kafka to Databricks UC with Spark streaming
    Python CICD (GH actions, Ruff, UV packaging, Databricks asset bundles)
    Benchmarking innovative projects (Polars/DuckDB vs Spark for cloud infra cost savings)
    Metadata & Data Quality (Great expectations, Soda-core, data contracts, configuration as code)
  • ⚡ Currently learning:
    Rust language

Pinned Loading

  1. airstream-fm airstream-fm Public

    AirStream FM - Discover New Music

    Python

  2. databricks-bundle-decorators databricks-bundle-decorators Public

    Adds orchestration syntax to Databricks asset bundles inspired by the Airflow TaskFlow API, for in-code DAG dependencies.

    Python 1

  3. f1toolbox f1toolbox Public

    Modern data platform on GCP (Dagster, DBT, BigQuery, Airbyte, Metabase)

    Python

  4. f1toolbox-infra f1toolbox-infra Public

    Terraform to deploy data platform services on GCP

    HCL

  5. agoralewski/ce-it-hub-hackathon-2025 agoralewski/ce-it-hub-hackathon-2025 Public

    HTML 3

  6. Ahmed-Haitham/CE-IT-Hub-Hackathon-2024 Ahmed-Haitham/CE-IT-Hub-Hackathon-2024 Public

    JavaScript 3