Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.
Comment: init-cap "hive" and "zookeeper" where appropriate; minor edits

...

The default number of reduce tasks per job. Typically set to a prime close to the number of available hosts. Ignored when mapred.job.tracker is "local". Hadoop set this to 1 by default, whereas hive Hive uses -1 as its default value. By setting this property to -1, Hive will automatically figure out what should be the number of reducers.

...

  • Default Value: nonstrict
  • Added In:

The mode in which the hive Hive operations are being performed. In strict mode, some risky queries are not allowed to run.

...

This controls whether the final outputs of a query (to a local/hdfs file or a hive Hive table) is compressed. The compression codec and other options are determined from hadoop config Hadoop configuration variables mapred.output.compress* .

hive.exec.compress.intermediate

...

This controls whether intermediate files produced by hive Hive between multiple map-reduce jobs are compressed. The compression codec and other options are determined from hadoop config Hadoop configuration variables mapred.output.compress*.

hive.exec.parallel
  • Default Value: false
  • Added In:

...

  • Default Value: false
  • Added In:

Let hive Hive determine whether to run in local mode automatically.

hive.exec.drop.ignorenonexistent

...

Do not report an error if DROP TABLE/VIEW specifies a non-existent.
table/view.

hive.exec.show.job.failure.debug.info

...

  • Default Value: false
  • Added In:

To cleanup the hive Hive scratch dir directory while starting the hive Hive server.

hive.output.file.extension

...

The dfs.umask value for the hive Hive-created folders.

MetaStore

...

Use a level 2 cache. Turn this off if metadata is changed independently of hive Hive metastore server.

datanucleus.cache.level2.type

...

  • Default Value: true
  • Added In:

Wiki Markup
If this sets to true,
hive
 Hive will throw error when doing ALTER TABLE tbl_name \[partSpec\] CONCATENATE on a table/partition that has indexes on it. The reason the user want to set this to true is because it can help user to avoid handling all index drop, recreation, rebuild work. This is very helpful for tables with thousands of partitions.

Statistics

hive.stats.dbclass

...

The default database that stores temporary hive Hive statistics.

hive.stats.autogather

...

The JDBC driver for the database that stores temporary hive
Hive statistics.

hive.stats.dbconnectionstring

...

The default connection string for the database that stores temporary hive Hive statistics.

hive.stats.default.publisher

...

In new Hadoop versions, the parent directory must be set while creating a HAR. Because this functionality is hard to detect with just version numbers, this conf var configuration variable needs to be set manually.

...

  • Default Value: false
  • Added In:

Whether hive Hive supports concurrency or not. A zookeeper Zookeeper instance must be up and running for the default hive Hive lock manager to support read-write locks.

...

  • Default Value: false
  • Added In:

This param configuration property is to control whether or not only do lock on queries that need to execute at least one mapred job.

...

  • Default Value: (empty)
  • Added In:

The list of zookeeper Zookeeper servers to talk to. This is only needed for read/write locks.

...

  • Default Value: 2181
  • Added In:

The port of zookeeper Zookeeper servers to talk to. This is only needed for read/write locks.

...

The parent node under which all zookeeper Zookeeper nodes are created.

hive.zookeeper.clean.extra.nodes

...