You are viewing an old version of this page. View the current version.

Compare with Current View Page History

« Previous Version 7 Next »

Definition

A set of records in tabular format (a table) ingested in `Hudi` - represents data internal to `Hudi` as opposed to external data, un-managed by `Hudi`.

Design decisions

  1. def~tables are always in `parquet` file format
  2. each def~table has a single `parquet` file and one or more def~timelines (with `delta file`s / `log file`s)
  3. external data is ingested in `Hudi` by one or more def~commits

Related concepts

  1. file format
  2. def~commit

Status (draft)


  • No labels