You are viewing an old version of this page. View the current version.

Compare with Current View Page History

« Previous Version 7 Current »

Definition

Logically: represents a `unit of work` of external data ingestion into a target def~dataset;  a set of `hudi` records.

Physically: represents a list of delta/log files (in `Avro` format) representing the external data differences / deltas not yet merged in the target def~dataset.

Related concepts

  1. def~dataset
  2. timeline instant
  3. commit time
  4. storage type
  5. def~Merge-On-Read
  6. Copy-On-Write

Status (draft)

  • No labels