Page History

...

The default number of reduce tasks per job. Typically set to a prime close to the number of available hosts. Ignored when mapred.job.tracker is "local". Hadoop set this to 1 by default, whereas hive Hive uses -1 as its default value. By setting this property to -1, Hive will automatically figure out what should be the number of reducers.

...

Default Value: nonstrict
Added In:

The mode in which the hive Hive operations are being performed. In strict mode, some risky queries are not allowed to run.

...

This controls whether the final outputs of a query (to a local/hdfs file or a hive Hive table) is compressed. The compression codec and other options are determined from hadoop config Hadoop configuration variables mapred.output.compress* .

hive.exec.compress.intermediate

...

This controls whether intermediate files produced by hive Hive between multiple map-reduce jobs are compressed. The compression codec and other options are determined from hadoop config Hadoop configuration variables mapred.output.compress*.

hive.exec.parallel

Default Value: false
Added In:

...

Default Value: false
Added In:

Let hive Hive determine whether to run in local mode automatically.

hive.exec.drop.ignorenonexistent

...

Do not report an error if DROP TABLE/VIEW specifies a non-existent.
table/view.

hive.exec.show.job.failure.debug.info

...

Default Value: false
Added In:

To cleanup the hive Hive scratch dir directory while starting the hive Hive server.

hive.output.file.extension

...

The dfs.umask value for the hive Hive-created folders.

MetaStore

...

Use a level 2 cache. Turn this off if metadata is changed independently of hive Hive metastore server.

datanucleus.cache.level2.type

...

Default Value: true
Added In:

Wiki Markup
If this sets to true,

hive

 Hive will throw error when doing ALTER TABLE tbl_name \[partSpec\] CONCATENATE on a table/partition that has indexes on it. The reason the user want to set this to true is because it can help user to avoid handling all index drop, recreation, rebuild work. This is very helpful for tables with thousands of partitions.

Statistics

hive.stats.dbclass

...

The default database that stores temporary hive Hive statistics.

hive.stats.autogather

...

The JDBC driver for the database that stores temporary hive
Hive statistics.

hive.stats.dbconnectionstring

...

The default connection string for the database that stores temporary hive Hive statistics.

hive.stats.default.publisher

...

In new Hadoop versions, the parent directory must be set while creating a HAR. Because this functionality is hard to detect with just version numbers, this conf var configuration variable needs to be set manually.

...

Default Value: false
Added In:

Whether hive Hive supports concurrency or not. A zookeeper Zookeeper instance must be up and running for the default hive Hive lock manager to support read-write locks.

...

Default Value: false
Added In:

This param configuration property is to control whether or not only do lock on queries that need to execute at least one mapred job.

...

Default Value: (empty)
Added In:

The list of zookeeper Zookeeper servers to talk to. This is only needed for read/write locks.

...

Default Value: 2181
Added In:

The port of zookeeper Zookeeper servers to talk to. This is only needed for read/write locks.

...

The parent node under which all zookeeper Zookeeper nodes are created.

hive.zookeeper.clean.extra.nodes

...

Space shortcuts

Child pages

Versions Compared

Old Version 19

New Version 20

Key