This repository compares how to build dataflows with macro orchestrator Dagster and the micro orchestrator Apache Hamilton.
see the side-by-side comparison in the Apache Hamilton documentation
dagster_code/includes code from the Dagster tutorial to load data and compute statistics from the website HackerNews.hamilton_code/is a refactor ofdagster_tutorial/using the Apache Hamilton framework.
Each directory contains instructions on how to run the code. We suggest going through the Dagster code first, then read the Apache Hamilton refactor.
-
Create a virtual environment and activate it
python -m venv venv && . venv/bin/active -
Install requirements for both Dagster and Apache Hamilton
pip install -r requirements.txt -
Dagster-specific instructions are found under
dagster_code/