Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

                                  Concurrrency

...

We use concurrrency to achive the batch operation.

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

...

 

Hudi provide two standard command lines which hudiimport and hudiexport to realize data batch operation.

...

Hudiimport -h master:1990 -l lake -t table -e target.txt

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

Limitation and Solution

Independent Operation 

  Hudi provide the serializable operation.However they have the own operation log.Sharing the operaion log across multiple tables would remove the limitation.

Low latency

   Hudi is limited by the latency of the underlying object storage.It is difficult to achive millisecond streaming latency using batch ,hudi run the parallel jobs.

...

 

 

 

 

 

...

 

Performance Evaluation

Todo: performance comparison

...