Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

When is Hudi a useful for me or my organization? 

<Answer WIP>

What are some non-goals for Hudi? 

<Answer WIP>

What is incremental processing? Why does Hudi docs/talks keep talking about it? 

<Answer WIP>

What is the difference between COW (copy on write) vs MOR (merge on read) storage types ?

...

Find more details on trade offs between cow & mor storage types here.

Is Hudi an analytical database? 

<Answer WIP>

How do I model the data stored in Hudi? 

<Answer WIP>

Using Hudi

What are some ways to write a Hudi dataset? 

<Answer WIP>

How can I now query the Hudi dataset I just wrote?

<Answer WIP>

How

...

does Hudi handle duplicate record keys in an input? 

<Answer WIP>

Can I implement my own logic for how input records are merged with record on storage? 

<Answer WIP>

How is a Hudi job deployed? 

<Answer WIP>

What are different ways of running compaction for a MOR dataset?

<Answer WIP>

Performance 

What performance can I expect for Hudi writing?

<Answer WIP>

What performance can I expect for Hudi reading/queries? 

<Answer WIP>

How do I improve the Hudi writing performance?

<Answer WIP>

How do I to avoid creating tons of small files?

...