Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

Excerpt Include
Copy On Write (COW)
Copy On Write (COW)
nopaneltrue


Image Added




Merge On Read (MOR)

Excerpt Include
Merge On Read (MOR)
Merge On Read (MOR)
nopaneltrue


Image Added


Hudi writing is implemented as a Spark library, which makes it easy to integrate into existing data pipelines or ingestion libraries (which we will refer to as `Hudi clients`). Hudi Clients prepare an `RDD[HoodieRecord]` that contains the data to be upserted and Hudi upsert/insert is merely a Spark DAG, that can be broken into two big pieces.

...