You are viewing an old version of this page. View the current version.

Compare with Current View Page History

« Previous Version 10 Next »

Definition

Type that determines how a def~commit will be stored and handled relative to its application to the target def~dataset.

#todo verify

Following table summarizes the trade-offs between these two storage types

Trade-offdef~copy-on-writedef~merge-on-read
Data LatencyHigherLower
Update cost (I/O)Higher (rewrite entire def~dataset parquet)Lower (append to `delta log`)
Parquet File SizeSmaller (high update(I/0) cost)Larger (low update cost)
Write AmplificationHigherLower (depending on compaction strategy to the def~dataset parquet)

Related concepts

  1. def~commit
  2. Commit List
  3. def~merge-on-read
  4. def~copy-on-write
  5. def~timeline-instant

Status (draft)



  • No labels