Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

Definition

A dataset ingested in `Hudi`.set of records in tabular format (a table) ingested in `Hudi` - represents data internal to `Hudi` as opposed to external data, un-managed by `Hudi`.

Design decisions

  1. each 135861650 has a single `parquet` file and one or more def~timelines (with `delta file`s / `log file`s)
  2. external data is ingested in `Hudi` by one or more def~commits

Related concepts

  1. file format
  2. def~commit
  3. def~query-type
  4. def~query-scan-mode

Status (draft)


Definition

A set of records in tabular format (a table) ingested in `Hudi` - represents data This is internal to `Hudi` as opposed to external data, unmanaged un-managed by `Hudi`.External

Design decisions

  1. each 135861650 has a single `parquet` file and one or more def~timelines (with `delta file`s / `log file`s)
  2. external data is ingested in `Hudi` by one or more 

...

  1. def~commits

...

Related concepts

  1. file format
  2. def~commit
  3. def~query-typecommit
  4. def~query-scan-mode

Status (draft)