Page History

...

This is an HDFS root directory under which Hive's REPL DUMP command will operate, creating dumps to replicate along to other warehouses.

Blobstore (i.e. Amazon S3)

A set Starting in release 2.2.0, a set of configurations were was added to enable read/write performance improvements when working with tables stored on blobstore systems, such as Amazon S3.

hive.blobstore.supported.schemes

Default value: s3,s3a,s3n
Added In: Hive 2.2.0 with HIVE-14270

List of supported blobstore schemes that Hive uses to apply special read/write performance improvements.

hive.blobstore.optimizations.enabled

Default value: true
Added In: Hive 2.2.0 with HIVE-15121

This parameter is a global variable that enables a number of optimizations when running on blobstores.
Some of the optimizations, such as hive.blobstore.use.blobstore.as.scratchdir, won't be used if this variable is set to false.

hive.blobstore.use.blobstore.as.scratchdir

Default value: false
Added In: Hive 2.2.0 with HIVE-14270

Set this to true to enable the use of scratch directories directly on blob storage systems (it may cause performance penalties).

hive.exec.input.listing.max.threads

Default value: 0 (disabled)
Added In: Hive 2.2.0 with HIVE-15881

Set this to a maximum number of threads that Hive will use to list file information form from file systems, such as file size and number of files per table (recommended > 1 for blobstore).

...

Space shortcuts

Child pages

Versions Compared

Old Version 482

New Version 483

Key

Blobstore (i.e. Amazon S3)

hive.blobstore.supported.schemes

hive.blobstore.optimizations.enabled

hive.blobstore.use.blobstore.as.scratchdir

hive.exec.input.listing.max.threads