Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

When the active controller decides that a standby controller should start a snapshot, it will communicate that information in its response to the periodic heartbeat sent by that node.  When the active controller decides that it itself should create a snapshot, it will first try to give up the leadership of the Raft quorum in order to avoid a unnecessary delays while writing the snapshot.Each snapshot will be of a consistent point in time.  Because the snapshots are centrally coordinated by the active controller, we can avoid initiating more than one snapshot at once.  The controller will also snapshot less frequently when too many members of the quorum have fallen behind.  Specifically, if losing a node would probably impact availability, we will use a separate set of configurations for determining when to snapshot.

Broker Registration and State Management

...