Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

Abstractly, we think of the metadata store as a KV store and Startpoints are stored as:SSP→StartpointEntry(Startpoint, TaskName)

SSP→Startpoint or SSP+TaskName→Startpoint

TaskName is optional and is primarily for broadcast inputs where the same SSP spans across multiple tasks. However, when a job starts, all Startpoints keyed only by SSP will be remapped to SSP+TaskName. See General Workflow above for details.

StartpointManager

StartpointManager is the main API to read and write Startpoints and is composed alongside the CheckpointManager in the OffsetManager. The StartpointManager is system implementation agnostic and handles the serialization and deserialization of Startpoints into the metadata store.

...