Row level lineage at Carbonfact
🎥 python data-eng
No pain no startup
📝 showerthought
Scraping Google Calendar events
📝 python scraping
Warmshowers sparks joy
📝 bike-touring showerthought
Do LLMs identify fonts?
📝 llm scraping
Thoughts on DuckLake
📝 data-eng
The total derivative of a metric tree
📝 data-science
Minimizing the runtime of a SQL DAG
📝 data-engineering python
Introducing icanexplain @ PyData Paris 2024
🎥 analytics-engineering python
LCA software: exit the matrix
📝 sustainability python
Cutting up shoes to measure their footprint
📝 sustainability data-science
A training set for bike sharing forecasting
📝 data-eng machine-learning
Decomposing funnel metrics
📝 data-science
Efficient ELT refreshes
📝 data-eng
Online machine learning on the road @ IDE+A, TH Köln
🎞️ online-machine-learning
Measuring the carbon footprint of pizzas
📝 sustainability python
Graph components with DuckDB
📝 data-science sql
For analytics, don't use dynamic JSON keys
📝 data-eng sql
Online gradient descent written in SQL
📝 online-machine-learning sql
Using SymPy in Python doctests
📝 python
Online active learning in 80 lines of Python
📝 online-machine-learning
Are Airbnb guests less energy efficient than their host?
📝 sustainability data-science
The future of River
📝 online-machine-learning
Parsing garment descriptions with GPT-3
📝 text-processing
NLP at Carbonfact: how would you do it?
📝 text-processing
Matrix inverse mini-batch updates
📝 online-machine-learning
A rant against dbt ref
📝 data-eng sql rant
First IRL meetup with the River developers
📝 online-machine-learning
Online machine learning with River @ GAIA
🎥 online-machine-learning
Fuzzy regex matching in Python
📝 text-processing
OCR spelling correction is hard
📝 text-processing
Comic book panel segmentation
📝 image-processing
Online machine learning in practice @ PyData PDX
🎞️ online-machine-learning
The online machine learning predict/fit switcheroo
📝 online-machine-learning
Online machine learning in practice @ Applied AI
🎥 online-machine-learning
Online machine learning in practice @ LVMH
🎞️ online-machine-learning
Web scraping, upside down
📝 scraping
One year at Alan
📝 job-log
Manipulating ephemeral data with git
🎞️ scraping
Dashboards and GROUPING SETS
📝 data-eng sql
Automated document processing at Alan
📝 text-processing
Text classification by data compression
📝 machine-learning text-processing
Reducing the memory footprint of a scikit-learn text classifier
📝 machine-learning text-processing
An overview of dataset time travel
📝 data-eng
What my PhD was about
📝 job-log
Unsupervised text classification with word embeddings
📝 machine-learning text-processing
Focal loss implementation for LightGBM
📝 machine-learning
A few intermediate pandas tricks
📝 data-eng
The correct way to evaluate online machine learning models
📝 online-machine-learning
Our solution to the IDAO 2020 qualifiers
🎞️ competitive-machine-learning
Speeding up scikit-learn for single predictions
📝 machine-learning
Machine learning for streaming data with creme
📝 online-machine-learning
Bayesian linear regression for practitioners
📝 machine-learning
Under-sampling a dataset with desired ratios
📝 machine-learning
The benefits of online machine learning @ Quantmetry
🎞️ online-machine-learning
The benefits of online machine learning @ Element AI
🎞️ online-machine-learning
Finding fuzzy duplicates with pandas
📝 data-eng
A smooth approach to putting machine learning into production
📝 machine-learning data-eng
The benefits of online machine learning @ Airbus Bizlab
🎞️ online-machine-learning
Skyline queries in Python
📝 data-eng
Online machine learning with creme @ PyData Amsterdam
🎞️ online-machine-learning
SQL subquery enumeration
📝 sql
Morellet crosses with JavaScript
📝 generative-art
Streaming groupbys in pandas for big datasets
📝 online-machine-learning
Target encoding done the right way
📝 machine-learning
Stella triangles with JavaScript
📝 generative-art
Unknown pleasures with JavaScript
📝 generative-art
Docker for data science @ HelloFresh Berlin
🎞️ data-science
Halftoning with Go - Part 2
📝 image-processing
Grid paintings à la Mondrian with JavaScript
📝 generative-art
Challenge Big Data @ TSE
🎥 competitive-machine-learning
Halftoning with Go - Part 1
📝 image-processing
Predire la disponibilité des Velib' @ Toulouse Data Science Meetup
🎥 data-science machine-learning data-viz
Recursive polygons with JavaScript
📝 generative-art
The Naïve Bayes classifier
📝 machine-learning
An introduction to genetic algorithms
📝 machine-learning
Visualizing bike stations live data
📝 data-viz