Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.
Comment: add more ORC parameters

...

  • Default Value: 256*1024*1024 (268,435,456)
  • Added In: Hive 0.14.0

xxxDefine the default file system block size for ORC files.

hive.exec.orc.dictionary.key.size.threshold
  • Default Value: xxx0.8
  • Added In: Hive 0.1312.0

xxxIf the number of keys in a dictionary is greater than this fraction of the total number of non-null rows, turn off dictionary encoding.  Use 1 to always use dictionary encoding.

hive.exec.orc.default.row.index.stride
  • Default Value: xxx10000
  • Added In: Hive 0.13.0

xxxDefine the default ORC index stride.

hive.exec.orc.default.buffer.size
  • Default Value: xxx256*1024 (262,144)
  • Added In: Hive 0.13.0

Define the default ORC buffer size.xxx

hive.exec.orc.default.block.padding
  • Default Value: xxxtrue
  • Added In: Hive 0.13.0

Define the default block padding.xxx

hive.exec.orc.block.padding.tolerance
  • Default Value: xxx0.05
  • Added In: Hive 0.

xxx

hive.exec.orc.
  • 14.0

Define the tolerance for block padding as a decimal fraction of stripe size (for example, the default value 0.05 is 5% of the stripe size). For the defaults of 64Mb ORC stripe and 256Mb HDFS blocks, a maximum of 3.2Mb will be reserved for padding within the 256Mb block with the default hive.exec.orc.block.padding.tolerance. In that case, if the available size within the block is more than 3.2Mb, a new smaller stripe will be inserted to fit within that space. This will make sure that no stripe written will cross block boundaries and cause remote reads within a node local task.

hive.exec.orc.default.compress
  • Default Value: ZLIBDefault Value: xxx
  • Added In: Hive 0.13.

xxx

hive.exec.orc.
  • Default Value: xxx
  • Added In: Hive 0.

xxx

 

 

  • 0

Define the default compression codec for ORC file.

hive.exec.orc.encoding.strategy
  • Default Value: SPEED
  • Added In: Hive 0.14.0

Define the encoding strategy to use while writing data. Changing this will only affect the light weight encoding for integers. This flag will not change the compression level of higher level compression codec (like ZLIB). Possible options are SPEED and COMPRESSION.

hive.orc.splits.include.file.footer

...

How many threads ORC should use to create splits in parallel.

hive.exec.orc

...

.

...

Use zerocopy reads with ORC. (This requires Hadoop 2.3 or later.)

hive.exec.orc.skip.corrupt.data
  • Default Value: false
  • Added In: Hive 0.13.0 with HIVE-6382

If ORC reader encounters corrupt data, this value will be used to determine whether to skip the corrupt data or throw an exception. The default behavior is to throw an exception.

hive.exec.orc.zerocopy

Use zerocopy reads with ORC. (This requires Hadoop 2.3 or later.)

MetaStore

hive.metastore.local

...