Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

Apache Hive

The

...

Apache

...

data warehouse software HiveTM data warehouse software facilitates querying and managing large datasets residing in distributed storage. Built on top of

...

Apache

...

HadoopTM, it provides

  • Tools to enable easy data extract/transform/load (ETL)
  • A mechanism to impose structure on a variety of data formats
  • Access to files stored either directly in

    Wiki Markup{tm}

    Apache

    HDFS{tm}

    or HDFSTM or in other data storage systems such as

    Wiki Markup{tm}

    Apache

    HBase

    {tm}

    TM 

  • Query execution via MapReduce

...