What is Apache ORC?
ORC is a self-describing type-aware columnar file format designed for Hadoop workloads. It is optimized for large streaming reads, but with integrated support for finding required rows quickly. Storing data in a columnar format lets the reader read, decompress, and process only the values that are required for the current query. Because ORC files are type-aware, the writer chooses the most appropriate encoding for the type and builds an internal index as the file is written.
Categories & Use Cases
Technical Details
| Mobile Application | No |
|---|
FAQs
What is Apache ORC?
A solution presented as the smallest, fastest columnar storage for Hadoop workloads.
How much does Apache ORC cost?
Apache ORC starts at $0.