Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

Logically: represents a `unit of work` of external data ingestion into a target dataset def~dataset;  a set of `hudi` records.

Physically: represents a list of `delta file`s delta/log files (in `Avro` format) representing the external data differences / deltas not yet merged in the target dataset def~dataset.

Related concepts

  1. datasetdef~dataset
  2. timeline instant
  3. commit time
  4. storage type
  5. Merge def~Merge-On-Read (MOR)
  6. Copy-On-Write (COW)

Status (draft)