You are viewing an old version of this page. View the current version.

Compare with Current View Page History

« Previous Version 4 Next »

Definition

Logically: represents a `unit of work` of external data ingestion into a target `Hudi` dataset;  a set of `hudi` records.

Physically: represents a list of `delta file`s (in `Avro` format) representing the external data deltas not yet merged in the target `Hudi` dataset.

Related concepts

  1. storage type
  2. Merge On Read (MOR)
  3. Copy On Write (COW)

Status (draft)

  • No labels