Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

With Apache Hudi growing in popularity, one of the fundamental challenges for users has been about how to efficiently migrate migrating their historical datasets to Apache Hudi. Apache Hudi maintains per record metadata to perform core operations such as upserts and incremental pull. To take advantage of Hudi’s upsert and incremental processing support, users would need to rewrite their whole dataset to make it an Apache Hudi table. This RFC provides a mechanism to efficiently migrate their datasets without the need to rewrite the entire dataset.

...