Hive Metastore Administration
Table of Contents |
---|
Introduction
...
Basic Configuration Parameters
The relevant configuration parameters are shown here. (Non-metastore parameters are described in Configuring Hive. Also see the Language Manual's Hive Configuration Properties, including Metastore and Hive Metastore Security.)
Config ParamConfiguration Parameter | Description |
---|---|
javax.jdo.option.ConnectionURL | JDBC connection string for the data store which contains metadata |
javax.jdo.option.ConnectionDriverName | JDBC Driver class name for the data store which contains metadata |
hive.metastore.uris | Hive connects to one of these URIs to make metadata requests to a remote Metastore (comma separated list of URIs) |
hive.metastore.local | local or remote metastore (Removed as of Hive 0.10: If |
hive.metastore.warehouse.dir | URI of the default location for native tables |
The Hive Metastore metastore is stateless and thus there can be multiple instances to achieve High Availability. Using hive.metastore.uris
it is possible to specify multiple remote Metastoresmetastores. Hive will use the first one from the list by default but will pick a random one on connection failure and will try to reconnect.
Additional Configuration Parameters
These variables metastore configuration parameters were carried over from old documentation without a guarantee that they all still exist (see the HiveConf
java class for current Hive configuration options):
Variable NameConfiguration Parameter | Description | Default Value |
---|---|---|
hive.metastore.metadb.dir | The location of filestore metadata base directory. |
|
hive.metastore.rawstore.impl | Name of the class that implements the org.apache.hadoop.hive.metastore.rawstore interface. This class is used to store and retrieval of raw metadata objects such as table, database. (Hive 0.8.1 and later.) |
|
org.jpox.autoCreateSchema | Creates necessary schema on startup if one doesn't exist. (The schema includes tables, columns, and so on.) Set to false after creating it once. |
|
org.jpox.fixedDatastore | Whether the datastore schema is fixed. |
|
datanucleus.autoStartMechanism | Whether to initialize on startup. |
|
hive.metastore.ds.connection.url.hook | Name of the hook to use for retriving the JDO connection URL. If empty, the value in javax.jdo.option.ConnectionURL is used as the connection URL. (Hive 0.6 and later.) |
|
hive.metastore.ds.retry.attempts | The number of times to retry a call to the backing datastore if there were a connection error. | 1 |
hive.metastore.ds.retry.interval | The number of miliseconds between datastore retry attempts. | 1000 |
hive.metastore.server.min.threads | Minimum number of worker threads in the Thrift server's pool. | 200 |
hive.metastore.server.max.threads | Maximum number of worker threads in the Thrift server's pool. | 100000 since Hive 0.8.1 |
...
Data Nucleus Auto Start
Warning | ||
---|---|---|
| ||
Configuring auto start for data nucleus is highly recommended. See HIVE-4762 for more details.
|
...
Default Configuration
Default The default configuration sets up an embedded metastore which is used in unit tests and is described in the next section. More practical options are described in the subsequent sections.
...