You are viewing an old version of this page. View the current version.

Compare with Current View Page History

« Previous Version 2 Next »

Definition

Logically: represents a `unit of work` of external data ingestion in a given target `Hudi` dataset;  a set of `hudi` records.

Physically: represent a list of `delta file`s (in `Avro` format) representing the external data deltas not yet merged in the target `Hudi` dataset.

Related concepts

  1. storage type
  2. Merge On Read (MOR)
  3. Copy On Write (COW)

Status (draft)

  • No labels