Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

Excerpt Include
Copy-On-Write
Copy-On-Write
nopaneltrue


...




def~Merge-On-Read

Excerpt Include
Mergedef~Merge-On-ReadMerge
def~Merge-On-Read
nopaneltrue



Hudi writing is implemented as a Spark library, which makes it easy to integrate into existing data pipelines or ingestion libraries (which we will refer to as `Hudi clients`). Hudi Clients prepare an `RDD[HoodieRecord]` that contains the data to be upserted and Hudi upsert/insert is merely a Spark DAG, that can be broken into two big pieces.

...