Skip to content

File

Delta Lake logo Delta Lake Apache Hudi logo Apache Hudi

Attribute Delta Lake Apache Hudi
Name Delta Lake Apache Hudi
Description Delta Lake is an open-source storage framework that enables building a Lakehouse architecture. Apache Hudi is a transactional data lake platform that brings database and data warehouse capabilities to the data lake. Utilises data stored in either parquet or orc.
License Apache license 2.0 Apache license 2.0
Source code https://github.com/delta-io/delta https://github.com/apache/hudi
Website https://delta.io/ https://hudi.apache.org/
Year created 2019 2016
Company Databricks Uber
Language support scala, java, python, rust
Use cases Write once read many, Analytics, Efficient storage, ACID transactions Incremental data processing, Data upserts, Change Data Capture (CDC), ACID transactions
Is human readable
no
no
Orientation column column or row
Has type system
yes
yes
Has nested structure support
yes
yes
Has native compression
yes
yes
Has encoding support
yes
yes
Has constraint support
yes
yes
Has acid support
yes
yes
Has metadata
yes
yes
Has encryption support
maybe
maybe
Data processing framework support Apache Drill, Apache Flink, Apache Spark, Apache Spark, Apache Flink,
Analytics query support Apache Hive, AWS Athena, Azure Synapse, BigQuery, Clickhouse, Dremio, Presto, Trino, Apache Hive, Apache Impala, AWS Athena, BigQuery, Clickhouse, Presto, Trino,