There are some cases your data is stored in Hive Table, and you may want to process the data using Apache Pig. In this post, I use an example to describe how to read Hive ORC data using Apache Pig.
- We first create Hive table stored as ORC, and load some data into the table.
- Then, we develop a Apache Pig script to load the data from the Hive ORC table.
Optimized Row Columnar (ORC) file format
The Optimized Row Columnar (ORC) file format provides a highly efficient way to store Hive data.[Read More...]