Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

We propose a unified cache for both state store as well as sending downstream to the next processor, as illustrated in the Figure below:


Image AddedImage Removed

The cache has two functions. First, it continues to serve as a read buffer for data that is sent to the state store, just like today. Second, it serves as a write deduplicator for the state store (just like today) as well as for the downstream processor node(s). So the tradeoff is "getting each update, i.e., a low update delay -- but a large CPU and storage load" vs. "skipping some updates, i.e., larger update delay -- but a reduced resource load". Note that this optimization does not affect correctness. The optimization is applicable to aggregations and to operators.

...