Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.
Comment: Minor edits

1

Table of Contents

Apache Hive

...

Hive provides standard SQL functionality, including many of the later 2003 and 2011 features for analytics.  HiveHive's SQL can also be extended with user code via User Defined Functions user defined functions (UDFs), user defined aggregates (UDAFs), and user defined table functions (UDTFs).

There is not a single "Hive format" in which data must be stored.  Hive Hive comes with built in connectors for comma-separated values (CSV) text files, Apache ParquetTMApache ORCTM, and other formats.  Users Users can extend Hive with connectors for other formats.  Please Please see File Formats and Hive SerDe in the Developer Guide for details.

Hive is not designed for online transaction processing (OLTP) workloads. It is best used for traditional data warehousing tasks.  Hive Hive is designed to maximize scalability (scale out with more machines added dynamically to the Hadoop cluster), performance, extensibility, fault-tolerance, and loose-coupling with its input formats.

...

...

Apache Hive, Apache Hadoop, Apache HBase, Apache HDFS, Apache, the Apache feather logo, and the Apache Hive project logo are trademarks of The Apache Software Foundation.