Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.
Comment: link to maven instructions

...

  • !SerDe is a short name for "Serializer and Deserializer."
  • Hive uses SerDe (and !FileFormat) to read and write table rows.
  • HDFS files -!InputFileFormat)--> <key, value> --(Deserializer-> Row object
  • Row object -Serializer)--> <key, value> --(!OutputFileFormat-> HDFS files

...

A helpful overview of the Hive query processor can be found in this Hive Anatomy slide deck.

Compiling and Running Hive

h3 Hive now uses maven for build, see updated hive maven build instructions

Hive can be made to compile against different versions of Hadoop.

...