Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

Parquet (http://parquet.io/) is an ecosystem wide columnar format for Hadoop. Read Demel made simple with Parquet for a good introduction to the format itself. At the time of this writing Parquet supports the follow engines and data descrption description languages:

Engines

  • Apache Hive
  • Apache Drill
  • Cloudera Impala
  • Apache Crunch
  • Apache Pig
  • Cascading

...