deltaio/delta-docker

Sponsored OSS

By Delta Lake

Updated 4 months ago

Delta Lake docker with Python, Jupyter, PySpark, Scala Spark, Rust, and ROAPI samples included.

Image
Machine learning & AI
Data science
Databases & storage
24

10K+

deltaio/delta-docker repository overview

Delta Lake Logo

Test License PyPI PyPI - Downloads

Delta Lake is an open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Trino, and Hive and APIs for Scala, Java, Rust, Ruby, and Python.

Docker Versions
TagPlatformPythonRustDelta-SparkSparkJupyterLabPandasPolarsROAPI
1.0.0_3.0.0amd640.12.0latest3.0.03.5.03.6.31.5.3x0.9.0
1.0.0_3.0.0_arm64arm640.12.0latest3.0.03.5.03.6.31.5.3x0.9.0
4.0.0arm64/amd641.1.141.1.144.0.04.0.04.4.6x1.33.10.12.6
latestarm64/amd641.1.141.1.144.0.04.0.04.4.6x1.33.10.12.6

** Note: Starting in version 4.0.0, we started providing multi-platform builds. (amd64/arm64)

References

The following are some of the more popular Delta Lake integrations, refer to delta.io/integrations for the complete list:

  • Apache Spark™: This connector allows Apache Spark™ to read from and write to Delta Lake.
  • Apache Flink: This connector allows Apache Flink to write to Delta Lake.
  • PrestoDB: This connector allows PrestoDB to read from Delta Lake.
  • Trino: This connector allows Trino to read from and write to Delta Lake.
  • Apache Hive: This connector allows Apache Hive to read from Delta Lake.
  • Delta Rust API: This library allows Rust (with Python and Ruby bindings) low level access to Delta tables and is intended to be used with data processing frameworks like datafusion, ballista, rust-dataframe, vega, etc.

Tag summary

Content type

Image

Digest

sha256:59f64f592

Size

1.9 GB

Last updated

4 months ago

docker pull deltaio/delta-docker

This week's pulls

Pulls:

104

Last week