Delta Lake Blogs

New features in the Python deltalake 0.12.0 release
This post explains the new features in the Python deltalake 0.12.0 release

Delta Lake 3.0.0
By Carly Akerly
This post describes the exiting features in the Delta Lake 3.0.0 release

Delta Lake vs. Parquet Comparison
This post compares the stengths and weaknesses of Delta Lake vs Parquet.

Delta Lake vs. ORC Comparison
By Avril Aysha
This post compares the stengths and weaknesses of Delta Lake vs ORC.

Unlock Delta Lakes for PyTorch Training with DeltaTorch
By Daniel Liden , Michael Shtelma
This post demonstrates how to create PyTorch DataLoaders using Delta tables as data sources for training deep learning models.

Introducing Delta Lake Table Features
By Nick Karpov
This introduces Delta Lake Table Features, a discrete feature-based compatibility scheme that replaces the traditional integer protocol versioning for Delta Lake tables and clients.

Delta Lake Change Data Feed (CDF)
By Nick Karpov , Matthew Powers
This blog shows how to enable and use the Delta Lake Change Data Feed.

Delta Lake’s transaction log protocol and its implementations
This blog explains the Delta Lake transaction log protocol and its various implementation.

Delta Lake Deletion Vectors
By Nick Karpov
This blog introduces the new Deletion Vectors table feature for Delta Lake tables, and explains how Deletion Vectors speed up operations that modify existing data in your lakehouse.

Using Ibis with PySpark on Delta Lake tables
By Marlene Mhangami , Matthew Powers
This post explains how to use Ibis to query Delta tables with PySpark

Delta Lake Z Order
This post explains how to use Delta Lake Z Order to make your queries run faster

Delta Lake 2.3.0 Released
By Allison Portis , Matthew Powers
This post explains some of the key features in the Delta Lake 2.3.0 release