Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

Type that determines how a commit def~commit will be stored and handled relative to its application to the target dataset def~dataset.

#todo verify

Following table summarizes the trade-offs between these two storage types

Trade-offCopy On Write (COW)def~copy-on-writedef~merge-on-readMerge On Read (MOR)
Data LatencyHigherLower
Update cost (I/O)Higher (rewrite entire dataset def~dataset parquet)Lower (append to `delta log`)
Parquet File SizeSmaller (high update(I/0) cost)Larger (low update cost)
Write AmplificationHigherLower (depending on compaction strategy to the dataset def~dataset parquet)

Related concepts

  1. commitdef~commit
  2. Commit List
  3. Merge On Read (MOR)
  4. def~merge-on-read
  5. def~copy-on-write
  6. timeline instantCopy On Write (COW)

Status (draft)