Skip to content

File

Apache ORC logo Apache ORC CSV logo CSV

Attribute Apache ORC CSV
Name Apache ORC CSV
Description ORC is a self-describing type-aware columnar file format designed for Hadoop workloads. Comma-Separated Values (CSV) is a text file format that uses commas to separate values in plain text.
License Apache license 2.0 N/A
Source code https://github.com/apache/orc
Website https://orc.apache.org/ https://www.rfc-editor.org/rfc/rfc4180.html
Year created 2013 0
Company Hortonworks, Facebook
Language support java, scala, c++, python java, scala, c++, python, r, php, go
Use cases Write once read many, Analytics, Efficient storage, ACID transactions
Is human readable
no
yes
Orientation row row
Has type system
yes
no
Has nested structure support
yes
no
Has native compression
yes
no
Has encoding support
yes
no
Has constraint support
no
no
Has acid support
no
no
Has metadata
yes
no
Has encryption support
yes
no
Data processing framework support Apache Flink, Apache Gobblin, Apache Hadoop, Apache NiFi, Apache Pig, Apache Spark, Apache Beam, Apache Drill, Apache Flink, Apache Gobblin, Apache Hive, Apache NiFi, Apache Pig, Apache Spark,
Analytics query support Apache Impala, Apache Druid, Apache Hive, Apache Pinot, AWS Athena, BigQuery, Clickhouse, Firebolt, Presto, Trino, Apache Impala, Apache Druid, Apache Pinot, AWS Athena, Azure Synapse, BigQuery, Clickhouse, Dremio, DuckDB, Firebolt,