polars

Classic Apache Hamilton Hello World

In this example we show you how to create a simple hello world dataflow that creates a polars dataframe as a result. It performs a series of transforms on the input to create columns that appear in the output.

File organization:

my_functions.py houses the logic that we want to compute. Note (1) how the functions are named, and what input parameters they require. That is how we create a DAG modeling the dataflow we want to happen.
my_script.py houses how to get Apache Hamilton to create the DAG, specifying that we want a polars dataframe and exercise it with some inputs.

To run things:

> python my_script.py

You can even run this example in Google Colab:

Visualizing Execution

Here is the graph of execution - which should look the same as the pandas example:

Caveat with Polars

There is one major caveat with Polars to be aware of: THERE IS NO INDEX IN POLARS LIKE THERE IS WITH PANDAS.

What this means is that when you tell Apache Hamilton to execute and return a polars dataframe if you are using the provided results builder, i.e. hamilton.plugins.h_polars.PolarsResultsBuilder, then you will have to ensure the row order matches the order you expect for all the outputs you request. E.g. if you do a filter, or a sort, or a join, or a groupby, you will have to ensure that when you ask Apache Hamilton to materialize an output that it's in the order you expect.

If you have questions, or need help with this example, join us on slack, and we'll try to help!

Otherwise if you have ideas on how to better make Apache Hamilton work with Polars, please open an issue or start a discussion!

Name		Name	Last commit message	Last commit date
parent directory ..
lazyframe		lazyframe
materialization		materialization
with_columns		with_columns
README.md		README.md
my_functions.py		my_functions.py
my_script.py		my_script.py
notebook.ipynb		notebook.ipynb
polars.png		polars.png
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

Classic Apache Hamilton Hello World

Visualizing Execution

Caveat with Polars

FilesExpand file tree

polars

Directory actions

More options

Directory actions

More options

Latest commit

History

polars

Folders and files

parent directory

README.md

Classic Apache Hamilton Hello World

Visualizing Execution

Caveat with Polars