Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

Tip

This is where `Hudi` comes into the picture by allowing data to be kept in files, not just input data but also output data.
Hypothesis : the ability to keep data in Parquet files throughout the basic files analysis is key to building the above vision.


Activity

This page about Uber #Michelangelo https://eng.uber.com/michelangelo/ suggests there is still a distinction in the architecture and implementation (and programming model) between "batch" and "streaming" and the data is placed in distinct kinds of physical repositories between batch and continuous analyses.

...