Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.
Comment: add new & missing statistics configs

...

If this sets to true, Hive will throw error when doing ALTER TABLE tbl_name [partSpec] CONCATENATE on a table/partition that has indexes on it. The reason the user want to set this to true is because it can help user to avoid handling all index drop, recreation, rebuild work. This is very helpful for tables with thousands of partitions.

Statistics

See Statistics in Hive for information about how to collect and use Hive table statistics. 

hive.stats.dbclass
  • Default Value: jdbc:derby (Hive 0.7 to 0.12) or counter (Hive 0.13 and later)
  • Added In: Hive 0.7 with HIVE-1361

Hive 0.7 to 0.12:  The default database that stores temporary Hive statistics.  Other options are jdbc:mysql and hbase as defined in StatsSetupConst.java.

Hive 0.13 and later:  The storage that stores temporary Hive statistics. Supported values are jdbc, hbase, counter and custom (HIVE-4632).

...

Timeout value (number of seconds) used by JDBC connection and statements.

hive.stats.atomic
  • Default Value: false
  • Added In: Hive 0.7 with HIVE-1961

If this is set to true then the metastore statistics will be updated only if all types of statistics (number of rows, number of files, number of bytes, etc.) are available. Otherwise metastore statistics are updated in a best effort fashion with whatever are available.

hive.stats.retries.max
  • Default Value: 0
  • Added In: Hive 0.8 with HIVE-2127

...

The base waiting window (in milliseconds) before the next retry. The actual wait time is calculated by baseWindow * failues + baseWindow * (failure + 1) * (random number between 0.0,1.0).

hive.stats.

...

collect.rawdatasize
  • Default Value: false true
  • Added In: Hive 0.8 with HIVE-2185

If true, the raw data size is collected when analyzing tables.

hive.stats.reliable
  • Default Value: false
  • Added In: Hive 0.10.0 with HIVE-1653
  • New Behavior In:  Hive 0.13.0 with HIVE-3777

...

Subset of counters that should be of interest for hive.client.stats.publishers (when one wants to limit their publishing). Non-display names should be used.

 

 

hive.compute.query.using.stats
  • Default Value: false
  • Added In: Hive 0.13.0 with HIVE-5483

When set to true Hive will answer a few queries like count(1) purely using statistics stored in the metastore. For basic statistics collection, set the configuration property hive.stats.autogather to true. For more advanced statistics collection, run ANALYZE TABLE queries.

 

 

hive.stats.xxx
  • Default Value: xxx
  • Added In: Hive 0.# with HIVE-####

xxx

 

Authentication/Authorization

...