Skip to content

File

Delta Lake logo Delta Lake Apache Parquet logo Apache Parquet

Attribute Delta Lake Apache Parquet
Name Delta Lake Apache Parquet
Description Delta Lake is an open-source storage framework that enables building a Lakehouse architecture. Apache Parquet is an open source, column-oriented data file format designed for efficient data storage and retrieval.
License Apache license 2.0 Apache license 2.0
Source code https://github.com/delta-io/delta https://github.com/apache/parquet-format
Website https://delta.io/ https://parquet.apache.org/
Year created 2019 2013
Company Databricks Twitter, Cloudera
Language support scala, java, python, rust java, scala, c++, python, r, php
Use cases Write once read many, Analytics, Efficient storage, ACID transactions Write once read many, Analytics, Efficient storage, Column based queries
Is human readable
no
no
Orientation column column
Has type system
yes
yes
Has nested structure support
yes
yes
Has native compression
yes
yes
Has encoding support
yes
yes
Has constraint support
yes
no
Has acid support
yes
no
Has metadata
yes
yes
Has encryption support
maybe
yes
Data processing framework support Apache Drill, Apache Flink, Apache Spark, Apache Beam, Apache Drill, Apache Flink, Apache Spark,
Analytics query support Apache Hive, AWS Athena, Azure Synapse, BigQuery, Clickhouse, Dremio, Presto, Trino, Apache Hive, Apache Impala, Apache Druid, Apache Pinot, AWS Athena, Azure Synapse, BigQuery, Clickhouse, Dremio, DuckDB, Firebolt,