...
This flag should be set to true to enable vectorized mode of the reduce-side GROUP BY query execution. The default value is true.
hive.vectorized.execution.mapjoin.native.enabled
- Default Value:
true
- Added In: Hive 1.2.0 with HIVE-9824
This flag should be set to true to enable native (i.e. non-pass through) vectorization of queries using MapJoin.
hive.vectorized.execution.mapjoin.native.multikey.only.enabled
- Default Value:
false
- Added In: Hive 1.2.0 with HIVE-9824
This flag should be set to true to restrict use of native vector map join hash tables to the MultiKey in queries using MapJoin.
hive.vectorized.execution.mapjoin.minmax.enabled
- Default Value:
false
- Added In: Hive 1.2.0 with HIVE-9824
This flag should be set to true to enable vector map join hash tables to use max / max filtering for integer join queries using MapJoin.
hive.vectorized.execution.mapjoin.overflow.repeated.threshold
- Default Value:
-1
- Added In: Hive 1.2.0 with HIVE-9824
The number of small table rows for a match in vector map join hash tables where we use the repeated field optimization in overflow vectorized row batch for join queries using MapJoin. A value of -1
means do use the join result optimization. Otherwise, threshold value can be 0 to maximum integer.
hive.vectorized.execution.mapjoin.native.fast.hashtable.enabled
- Default Value:
false
- Added In: Hive 1.2.0 with HIVE-9824
This flag should be set to true to enable use of native fast vector map join hash tables in queries using MapJoin.
hive.vectorized.groupby.checkinterval
- Default Value:
100000
- Added In: Hive 0.13.0 with HIVE-5692
...
For WebHCat configuration, see Configuration Variables in the WebHCat manual.