You are viewing an old version of this page. View the current version.

Compare with Current View Page History

« Previous Version 11 Current »

Definition

Type that determines how data will be laid out as file and stored, inside a def~table.

#todo verify

Following table summarizes the trade-offs between these two dataset types

Trade-offdef~copy-on-writedef~merge-on-read
Data LatencyHigherLower
Update cost (I/O)Higher (rewrite entire def~table parquet)Lower (append to `delta log`)
Write AmplificationHigherLower (depending on compaction strategy to the def~table parquet)

Related concepts

  1. def~commit
  2. Commit List
  3. def~merge-on-read
  4. def~copy-on-write
  5. def~timeline-instant

Status (draft)



  • No labels