Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.
Comment: make trademarked terms bold to match style prior to Confluence upgrade

Apache Hive

The Apache HiveTM data warehouse software facilitates querying and managing large datasets residing in distributed storage. Built on top of Apache HadoopTM, it provides

  • Tools to enable easy data extract/transform/load (ETL)
  • A mechanism to impose structure on a variety of data formats
  • Access to files stored either directly in Apache HDFSTM or in other data storage systems such as Apache HBaseTM 

  • Query execution via MapReduce

...