Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.
Comment: add some headings

Hive Metastore Administration

Table of Contents

Introduction

...

Basic Configuration Parameters

The relevant configuration parameters are shown here. (Non-metastore parameters are described in Configuring Hive. Also see the Language Manual's Hive Configuration Properties, including Metastore and Hive Metastore Security.)

Config ParamConfiguration Parameter

Description

javax.jdo.option.ConnectionURL

JDBC connection string for the data store which contains metadata

javax.jdo.option.ConnectionDriverName

JDBC Driver class name for the data store which contains metadata

hive.metastore.uris

Hive connects to one of these URIs to make metadata requests to a remote Metastore (comma separated list of URIs)

hive.metastore.local

local or remote metastore (Removed as of Hive 0.10: If hive.metastore.uris is empty local mode is assumed, remote otherwise)

hive.metastore.warehouse.dir

URI of the default location for native tables

The Hive Metastore metastore is stateless and thus there can be multiple instances to achieve High Availability. Using hive.metastore.uris it is possible to specify multiple remote Metastoresmetastores. Hive will use the first one from the list by default but will pick a random one on connection failure and will try to reconnect.

Additional Configuration Parameters

These variables metastore configuration parameters were carried over from old documentation without a guarantee that they all still exist (see the HiveConf java class for current Hive configuration options):

Variable NameConfiguration Parameter

Description

Default Value

hive.metastore.metadb.dir

The location of filestore metadata base directory.

 

hive.metastore.rawstore.impl

Name of the class that implements the org.apache.hadoop.hive.metastore.rawstore interface. This class is used to store and retrieval of raw metadata objects such as table, database. (Hive 0.8.1 and later.)

 

org.jpox.autoCreateSchema

Creates necessary schema on startup if one doesn't exist. (The schema includes tables, columns, and so on.) Set to false after creating it once.

 

org.jpox.fixedDatastore

Whether the datastore schema is fixed.

 

datanucleus.autoStartMechanism

Whether to initialize on startup.

 

hive.metastore.ds.connection.url.hook

Name of the hook to use for retriving the JDO connection URL. If empty, the value in javax.jdo.option.ConnectionURL is used as the connection URL. (Hive 0.6 and later.)

 

hive.metastore.ds.retry.attempts

The number of times to retry a call to the backing datastore if there were a connection error.
(Hive 0.6 through 0.12; removed in 0.13.0 – use hive.hmshandler.retry.attempts instead.)

1

hive.metastore.ds.retry.interval

The number of miliseconds between datastore retry attempts.
(Hive 0.6 through 0.12; removed in 0.13.0 – use hive.hmshandler.retry.interval instead.)

1000

hive.metastore.server.min.threads

Minimum number of worker threads in the Thrift server's pool.
(Hive 0.6 and later.)

200

hive.metastore.server.max.threads

Maximum number of worker threads in the Thrift server's pool.
(Hive 0.6 and later.)

100000 since Hive 0.8.1

...

Data Nucleus Auto Start

Warning
titleConfiguring datanucleus.autoStartMechanism is highly recommended

Configuring auto start for data nucleus is highly recommended. See HIVE-4762 for more details.

Code Block
 <property>
    <name>datanucleus.autoStartMechanism</name>
    <value>SchemaTable</value>
  </property>

...

Default Configuration

Default The default configuration sets up an embedded metastore which is used in unit tests and is described in the next section. More practical options are described in the subsequent sections.

...