You are viewing an old version of this page. View the current version.

Compare with Current View Page History

« Previous Version 2 Next »

Definition

A dataset ingested in `Hudi` - represents data internal to `Hudi` as opposed to external data, unmanaged by `Hudi`.

Design decisions

  1. datasets are always in `parquet` format
  2. external data is ingested in `Hudi` by one or more commits

Related concepts

  1. file type
  2. commit

Status (draft)


  • No labels