Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

  • Side Input Caching using tokens. 
    Jira
    serverASF JIRA
    serverId5aa69414-a9e9-3523-82ec-879b028fb15b
    keyBEAM-11097
    • Side inputs are valid based on the window. Multiple elements and bundles processed in the same window can reuse the same side input data at the runner’s discretion. This is implemented with a key token. 
    • This is critical for Streaming performance to reduce data interchange with the Runner half of a worker.
  • Map Side Inputs / Random Access Side Inputs. 
    Jira
    serverASF JIRA
    serverId5aa69414-a9e9-3523-82ec-879b028fb15b
    keyBEAM-3293
    • KV PCollections as side inputs currently support an iterable approach, however the beam protocol does support a Key Lookup approach as well.
    • This enables having significantly larger side inputs, as data can be looked up from the runner as needed, rather than possibly iterating over the entire PCollection for every element.
    • Similarly, an advanced feature would be to support custom ViewFns for side inputs user side processing of side inputs, orchestrated and garbage collected by the framework, rather than requiring users to do so manually.   
      Jira
      serverASF JIRA
      columnskey,summary,type,created,updated,due,assignee,reporter,priority,status,resolution
      serverId5aa69414-a9e9-3523-82ec-879b028fb15b
      keyBEAM-110993305


Windowing [Basic]

FlumeGo supports simple fixed interval windowing, and global windows. The validation mentioned above needs to be handled before sliding windows can be declared to be correctly supported.

...