THIS IS A TEST INSTANCE. ALL YOUR CHANGES WILL BE LOST!!!!
Definition
Controls how datasets are exposed to queries
Hudi supports the following views of stored data
Excerpt Include read optimized view read optimized view - incremental view : Queries on this view only see new data written to the dataset, since a given commit/compaction. This view effectively provides change streams to enable incremental data pipelines.
- realtime view : Queries on this view see the latest snapshot of dataset as of a given delta commit action. This view provides near-real time datasets (few mins) by merging the base and delta files of the latest file slice on-the-fly.
Following table summarizes the trade-offs between the different views.
Trade-off | ReadOptimized | RealTime |
---|---|---|
Data Latency | Higher | Lower |
Query Latency | Lower (raw columnar performance) | Higher (merge columnar + row based delta) |
Excerpt |
---|