Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

Table of Contents
outlinetrue
stylecircle

General 

When is Hudi a useful for me or my organization? 

What are some non-goals for Hudi? 

What is incremental processing? Why does Hudi docs/talks keep talking about it? 

What is the difference between COW (copy on write) vs MOR (merge on read) storage types ?

...

Find more details on trade offs between cow & mor storage types here.

Using Hudi

What are some ways to write a Hudi dataset? 

How can I now query the Hudi dataset I just wrote?

How do I control how the input is merged against 

How

...

is a Hudi job deployed? 

What are different ways of running compaction for a MOR dataset?

Performance 

What performance can I expect for Hudi writing?

What performance can I expect for Hudi reading/queries? 

How do I improve the Hudi writing performance?

How do I to avoid creating tons of small files?

HoodieWriteConfig exposes knobs to allow for such flexibility. 

...

HoodieDeltaStreamer users

HoodieWriteClient users

Performance 

How do I improve the Hudi writing performance?

...