Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.
Comment: 3.0.0 supported features note

...

This document describes the second version of Hive Replication. Please refer to the first version of Hive Replication for details on prior implementation.

This work is under development and interfaces are subject to change. As of Hive 3.0.0 release : only managed table replication where Hive user owns the table contents is supported. External tables, ACID tables, statistics and constraint replication is not supported.

Issues with the Current Replication System

...

The REPL DUMP command has an optional WITH clause to set command-specific configurations to be used when trying to copy from the source cluster. These configurations are only used by the corresponding REPL DUMP command and won't be used for other queries running in the same session.

If HiveConf parameter hive.in.test is  false, REPL DUMP will not use a new dump location, thus it will garble an existing dump. Hence before taking an incremental dump, clear the bootstrap dump location if hive.in.test is false.

Return values:

  1. Error codes returned as return error codes (and over jdbc if with HS2)
  2. Returns 2 columns in the ResultSet:
    1. <dir-name> - the directory to which it has dumped info.
    2. <last-evid> - the last event-id associated with this dump, which might be the end-evid, or the curr-evid, as the case may be.

...