Skip to content

File

Apache Iceberg logo Apache Iceberg Apache ORC logo Apache ORC

Attribute Apache Iceberg Apache ORC
Name Apache Iceberg Apache ORC
Description Iceberg is a high-performance format for huge analytic tables. Utilises data stored in either parquet, avro, or orc. ORC is a self-describing type-aware columnar file format designed for Hadoop workloads.
License Apache license 2.0 Apache license 2.0
Source code https://github.com/apache/iceberg https://github.com/apache/orc
Website https://iceberg.apache.org/ https://orc.apache.org/
Year created 2017 2013
Company Netflix Hortonworks, Facebook
Language support java, scala, c++, python
Use cases Write once read many, Analytics, Efficient storage, ACID transactions Write once read many, Analytics, Efficient storage, ACID transactions
Is human readable
no
no
Orientation column or row row
Has type system
yes
yes
Has nested structure support
yes
yes
Has native compression
yes
yes
Has encoding support
yes
yes
Has constraint support
no
no
Has acid support
yes
no
Has metadata
yes
yes
Has encryption support
maybe
yes
Data processing framework support Apache Drill, Apache Flink, Apache Gobblin, Apache Pig, Apache Spark, Apache Flink, Apache Gobblin, Apache Hadoop, Apache NiFi, Apache Pig, Apache Spark,
Analytics query support Apache Impala, Apache Druid, Apache Hive, AWS Athena, BigQuery, Clickhouse, Dremio, DuckDB, Presto, Trino, Apache Impala, Apache Druid, Apache Hive, Apache Pinot, AWS Athena, BigQuery, Clickhouse, Firebolt, Presto, Trino,