Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

  • Portability Framework
    • The model protos contain all aspects of the portability API and is the truth on the ground. The proto definitions supercede any design documents. The main design documents are the following:
    • Runner API. Pipeline representation and discussion on primitive/composite transforms and optimizations.

    • Job API. Job submission and management protocol.

    • Fn API. Execution-side control and data protocols and overview.

    • Container contract. Execution-side docker container invocation and provisioning protocols. See CONTAINERS.md for how to build container images.

    • Cross language. Options and tradeoffs for how to handle various kinds of multi-language/multi-SDK pipelines.
  • Fn API
    • Apache Beam Fn API Overview [doc]
    • Processing a Bundle [doc]
    • Progress [doc]
    • Graphical view of progress [doc]
    • Fn State API and Bundle Processing [doc]
    • Checkpointing and splitting of Beam bundles over the Fn API, with application to SDF [doc]
    • How to send and receive data [doc]
    • Defining and adding SDK Metrics [doc]
    • SDK harness container contract [doc]
    • Structure and Lifting of Combines [doc]
  • Cross-language Beam Pipelines [doc]
  • SDK X with Runner Y using Runner API [doc]
  • Flink Portable Runner Overview [doc]
  • Launching portable pipeline on Flink Runner [doc]
  • Portability support [table]
  • Portability Prototype [doc]
  • Portable Artifact Staging [doc]
  • Portable Beam on Flink [doc]
  • Portability API: How to Checkpoint and Split Bundles [doc]
  • Portability API: How to Finalize Bundles [doc]
  • Side Input in Universal Reference Runner [doc]
  • Spark Portable Runner Overview [doc]
  • Cross-Language

Build / Testing

  • More Expressive PAsserts [doc]
  • Mergebot design document [doc]
  • Performance tests for commonly used file-based I/O PTransforms [doc]
  • Performance tests results analysis and basic regression detection [doc]
  • Eventual PAssert [doc]
  • Testing I/O Transforms in Apache Beam [doc]
  • Reproducible Environment for Jenkins Tests By Using Container [doc]
  • Keeping precommit times fast [doc]
  • Increase Beam post-commit tests stability [doc]
  • Beam-Site Automation Reliability [doc]
  • Managing outdated dependencies [doc]
  • Automation For Beam Dependency Check [doc]
  • Test performance of core Apache Beam operations [doc]
  • Add static code analysis quality gates to Beam [doc]
  • Portable batch & streaming load tests in all sdks [doc]
  • Storing, displaying and detecting anomalies in test results using Prometheus and Grafana [doc]
  • Storing, displaying and detecting anomalies in test results (corrected version of the previous proposal) [doc]

...