Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.
Comment: add version numbers, misc. edits, hive.mergejob.maponly removed in 0.11.0

...

  • Default Value: 2147483647
  • Added In: Hive 0.4.0 with HIVE-352
hive.io.rcfile.column.number.conf
  • Default Value: 0
  • Added In: Hive 0.4.0 with HIVE-352
hive.io.rcfile.tolerate.corruptions
  • Default Value: false
  • Added In: Hive 0.4.0 with HIVE-352
hive.io.rcfile.record.buffer.size
  • Default Value: 4194304
  • Added In: Hive 0.4.0 with HIVE-352
hive.orc.splits.include.file.footer

...

Whether Hive should periodically update task progress counters during execution. Enabling this allows task progress to be monitored more closely in the job tracker, but may impose a performance penalty. This flag is automatically set to true for jobs with hive.exec.dynamic.partition set to true. (This configuration property was removed in release 0.13.0.)

...

  • Default Value: true
  • Added In: Hive 0.4.0

Merge small files at the end of a map-only job.

...

  • Default Value: false
  • Added In: Hive 0.4.0

Merge small files at the end of a map-reduce job.

...

  • Default Value: true
  • Added In: Hive 0.6.0
  • Removed In: Hive 0.11.0

Try to generate a map-only job for merging files if CombineHiveInputFormat is supported. (This configuration property was removed in release 0.11.0.)

hive.merge.size.per.task
  • Default Value: 256000000
  • Added In: Hive 0.4.0

Size of merged files at the end of the job.

...

  • Default Value: 16000000
  • Added In: Hive 0.5.0

When the average output file size of a job is less than this number, Hive will start an additional map-reduce job to merge the output files into bigger files. This is only done for map-only jobs if hive.merge.mapfiles is true, and for map-reduce jobs if hive.merge.mapredfiles is true.

...

  • Default Value: 1000
  • Added In: Hive 0.4.0

Send a heartbeat after this interval – used by mapjoin and filter operators.

...

  • Default Value: false
  • Added In: Hive 0.4.0

Whether Hive Tranform/Map/Reduce Clause should automatically send progress information to TaskTracker to avoid the task getting killed because of inactivity. Hive sends progress information when the script is outputting to stderr. This option removes the need of periodically producing stderr messages, but users should be cautious because this may prevent infinite loops in the scripts to be killed by TaskTracker.

...

  • Default Value: org.apache.hadoop.hive.serde2.lazy.LazySimpleSerDe
  • Added In: Hive 0.4.0

The default SerDe for transmitting input data to and reading output data from the user scripts.

...

  • Default Value: org.apache.hadoop.hive.ql.exec.TextRecordReader
  • Added In: Hive 0.4.0

The default record reader for reading data from the user scripts.

...

  • Default Value: org.apache.hadoop.hive.ql.exec.TextRecordWriter
  • Added In: Hive 0.5.0

The default record writer for writing data to the user scripts.

...

  • Default Value: org.apache.hadoop.hive.ql.io.CombineHiveInputFormat
  • Added In: Hive 0.5.0

The default input format. Set this to HiveInputFormat if you encounter problems with CombineHiveInputFormat.

...

  • Default Value: false
  • Added In: Hive 0.5.0

Whether Hive should automatically send progress information to TaskTracker when using UDTF's to prevent the task getting killed because of inactivity. Users should be cautious because this may prevent TaskTracker from killing tasks with infinite loops.

...

  • Default Value: true
  • Added In: Hive 0.5.0

Whether speculative execution for reducers should be turned on.

...

  • Default Value: 1000
  • Added In: Hive 0.6.0

The interval with which to poll the JobTracker for the counters the running job. The smaller it is the more load there will be on the jobtracker, the higher it is the less granular the caught will be.

...

  • Default Value: false
  • Added In: Hive 0.6.0

Whether bucketing is enforced. If true, while inserting into the table, bucketing is enforced.

...

  • Default Value: false
  • Added In: Hive 0.6.0

Whether sorting is enforced. If true, while inserting into the table, sorting is enforced.

...

  • Default Value: false
  • Added In: Hive 0.6.0

Whether or not to allow dynamic partitions in DML/DDL.

...

  • Default Value: strict
  • Added In: Hive 0.6.0

In strict mode, the user must specify at least one static partition in case the user accidentally overwrites all partitions.

...

  • Default Value: 1000
  • Added In: Hive 0.6.0

Maximum number of dynamic partitions allowed to be created in total.

...

  • Default Value: 100
  • Added In: Hive 0.6.0

Maximum number of dynamic partitions allowed to be created in each mapper/reducer node.

...

  • Default Value: 100000
  • Added In: Hive 0.7.0

Maximum number of HDFS files created by all mappers/reducers in a MapReduce job.

...

  • Default Value: _HIVE_DEFAULT_PARTITION_
  • Added In: Hive 0.6.0

The default partition name in case the dynamic partition column value is null/empty string or any other values that cannot be escaped. This value must not contain any special character used in HDFS URI (e.g., ':', '%', '/' etc). The user has to be aware that the dynamic partition value should not contain this value to avoid confusions.

...

  • Default Value: org.apache.hadoop.hive.serde2.DelimitedJSONSerDe
  • Added In: Hive 0.7.0

The SerDe used by FetchTask to serialize the fetch output.

...

  • Default Value: false
  • Added In: Hive 0.7.0

Let Hive determine whether to run in local mode automatically.

...

  • Default Value: true
  • Added In: Hive 0.7.0

Do not report an error if DROP TABLE/VIEW specifies a non-existent table/view.

...

  • Default Value: true
  • Added In: Hive 0.7.0

If a job fails, whether to provide a link in the CLI to the task with the most failures, along with debugging hints if applicable.

...

  • Default Value: false
  • Added In: Hive 0.8.0

Whether to enable to optimization to trying a smaller subset of data for simple LIMIT first.

...

Controls whether to connect to remote metastore server or open a new metastore server in Hive Client JVM. As of Hive 0.10 this is no longer used. Instead if hive.metastore.uris is set then remote mode is assumed otherwise local.

...

  • Default Value: DBCP in Hive 0.7 to 0.11; BoneCP in 0.12 and later 
  • Added In: Hive 0.7.0

Uses a BoneCP connection pool for JDBC metastore in release 0.12 and later (HIVE-4807), or a DBCP connection pool in releases 0.7 to 0.11.

...

  • Default Value: false
  • Added In: Hive 0.7.0

Validates existing schema against code. Turn this on if you want to verify existing schema

...

  • Default Value: false
  • Added In: Hive 0.7.0

Validates existing schema against code. Turn this on if you want to verify existing schema.

...

  • Default Value: false
  • Added In: Hive 0.7.0

Validates existing schema against code. Turn this on if you want to verify existing schema.

...

  • Default Value: rdbms
  • Added In: Hive 0.7.0

Metadata store type.

datanucleus.autoCreateSchema

Creates necessary schema on a startup if one doesn't exist. Set this to false, after creating it once.

...

  • Default Value: checked
  • Added In: Hive 0.7.0

Throw exception if metadata tables are incorrect.

...

  • Default Value: read-committed
  • Added In: Hive 0.7.0

Default transaction isolation level for identity generation.

...

  • Default Value: false
  • Added In: Hive 0.7.0

This parameter does nothing.
Warning note: For most installations, Hive should not enable the DataNucleus L2 cache, since this can cause correctness issues. Thus, some people set this parameter to false assuming that this disables the cache – unfortunately, it does not. To actually disable the cache, set datanucleus.cache.level2.type to "none".

...

  • Default Value: none in Hive 0.9 and later; SOFT in Hive 0.7 to 0.8.1
  • Added In: Hive 0.7.0

NONE = disable the datanucleus level 2 cache, SOFT = soft reference based cache, WEAK = weak reference based cache.
Warning note: For most Hive installations, enabling the datanucleus cache can lead to correctness issues, and is dangerous. This should be left  as "none".

...

  • Default Value: datanucleus
  • Added In: Hive 0.7.0

Name of the identifier factory to use when generating table/column names etc. 'datanucleus' is used for backward compatibility.

...

  • Default Value: LOG
  • Added In: Hive 0.7.0

Defines what happens when plugin bundles are found and are duplicated: EXCEPTION, LOG, or NONE.

...

  • Default Value: /user/hive/warehouse
  • Added In: Hive 0.2.0

Location of default database for the warehouse.

...

  • Default Value: (empty)
  • Added In: Hive 0.7.0

The path to the Kerberos Keytab file containing the metastore thrift server's service principal.

...

  • Default Value: hive-metastore/_HOST@EXAMPLE.COM
  • Added In: Hive 0.7.0

The service principal for the metastore thrift server. The special string _HOST will be replaced automatically with the correct host name.

...

  • Default Value: false
  • Added In: Hive 0.7.0

Enable or disable the Hive client authorization.

...

  • Default Value: org.apache.hadoop.hive.ql.security.authorization.DefaultHiveAuthorizationProvider
  • Added In: Hive 0.7.0

The Hive client authorization manager class name. The user defined authorization class should implement interface org.apache.hadoop.hive.ql.security.authorization.HiveAuthorizationProvider.

...

  • Default Value: org.apache.hadoop.hive.ql.security.HadoopDefaultAuthenticator
  • Added In: Hive 0.7.0

Hive client authenticator manager class name. The user-defined authenticator should implement interface org.apache.hadoop.hive.ql.security.HiveAuthenticationProvider.

...

  • Default Value: (empty)
  • Added In: Hive 0.7.0

The privileges automatically granted to some users whenever a table gets created. An example like "userX,userY:select;userZ:create" will grant select privilege to userX and userY, and grant create privilege to userZ whenever a new table created.

...

  • Default Value: (empty)
  • Added In: Hive 0.7.0

The privileges automatically granted to some groups whenever a table gets created. An example like "groupX,groupY:select;groupZ:create" will grant select privilege to groupX and groupY, and grant create privilege to groupZ whenever a new table created.

...

  • Default Value: (empty)
  • Added In: Hive 0.7.0

The privileges automatically granted to some roles whenever a table gets created. An example like "roleX,roleY:select;roleZ:create" will grant select privilege to roleX and roleY, and grant create privilege to roleZ whenever a new table created.

...

  • Default Value: (empty)
  • Added In: Hive 0.7.0

The privileges automatically granted to the owner whenever a table gets created. An example like "select,drop" will grant select and drop privilege to the owner of the table.

...

  • Default Value: org.apache.hadoop.hive.shims.HiveHarFileSystem
  • Added In: Hive 0.8.1

The implementation for accessing Hadoop Archives. Note that this won't be applicable to Hadoop versions less than 0.20.

...

  • Default Value: false
  • Added In: Hive 0.6.0

Whether archiving operations are permitted.

...

  • Default Value: false
  • Added In: Hive 0.7.0

Whether Hive supports concurrency or not. A Zookeeper instance must be up and running for the default Hive lock manager to support read-write locks.

...

  • Default Value: false
  • Added In: Hive 0.8.0

This configuration property is to control whether or not only do lock on queries that need to execute at least one mapred job.

...

  • Default Value: 100
  • Added In: Hive 0.7.0

The number of times you want to try to get all the locks.

...

  • Default Value: 10
  • Added In: Hive 0.8.1

The number of times you want to retry to do one unlock.

...

  • Default Value: 60
  • Added In: Hive 0.7.0

The sleep time (in seconds) between various retries.

...

  • Default Value: (empty)
  • Added In: Hive 0.7.0

The list of Zookeeper servers to talk to. This is only needed for read/write locks.

...

  • Default Value: 2181
  • Added In: Hive 0.7.0

The port of Zookeeper servers to talk to. This is only needed for read/write locks.

...

  • Default Value: 600000
  • Added In: Hive 0.7.0

Zookeeper client's session timeout. The client is disconnected, and as a result, all locks released, if a heartbeat is not sent in the timeout.

...

  • Default Value: hive_zookeeper_namespace
  • Added In: Hive 0.7.0

The parent node under which all Zookeeper nodes are created.

...

  • Default Value: false
  • Added In: Hive 0.7.0

Clean extra nodes at the end of the session.

...

  • Default Value: org.apache.hadoop.hive.thrift.MemoryTokenStore
  • Added In: Hive 0.9.0

The delegation token store implementation. Set to org.apache.hadoop.hive.thrift.ZooKeeperTokenStore for load-balanced cluster.

...

  • Default Value: localhost:2181
  • Added In: Hive 0.9.0

The ZooKeeper token store connect string.

...

  • Default Value: /hive/cluster/delegation
  • Added In: Hive 0.9.0

The root path for token store data.

...

  • Default Value: sasl:hive/host1@EXAMPLE.COM:cdrwa,sasl:hive/host2@EXAMPLE.COM:cdrwa
  • Added In: Hive 0.9.0

ACL for token store entries. List comma separated all server principals for the cluster.

...

  • Default Value: false
  • Added In: Hive 0.7.0

Whether to print the names of the columns in query output.

...

  • Default Value: false
  • Added In: Hive 0.8.1

Whether to include the current database in the Hive prompt.

...

  • Default Value: lib/hive-hwi-<version>.war
  • Added In: Hive 0.3.0 with default lib/hive_hwi.war, default changed to lib/hive-hwi-<version>.war in Hive 0.5 (HIVE-978 and HIVE-1183)

...

  • Default Value: 0.0.0.0
  • Added In: Hive 0.3.0

This is the host address the Hive Web Interface will listen on.

...

  • Default Value: 9999
  • Added In: Hive 0.3.0

This is the port the Hive Web Interface will listen on.

...

  • Default Value: false
  • Added In: Hive 0.4.0

Whether Hive is running in test mode. If yes, it turns on sampling and prefixes the output tablename.

...

  • Default Value: test_
  • Added In: Hive 0.4.0

If Hive is running in test mode, prefixes the output table by this string.

...

  • Default Value: 32
  • Added In: Hive 0.4.0

If Hive is running in test mode and table is not bucketed, sampling frequency.

...

  • Default Value: (empty)
  • Added In: Hive 0.4.0

If Hive is running in test mode, don't sample the above comma separated list of tables.

...