THIS IS A TEST INSTANCE. ALL YOUR CHANGES WILL BE LOST!!!!
...
- APIs: (https://issues.apache.org/jira/browse/HUDI-4141)
- FileGroup APIs in Java
- Rust/C++ APIs for Timeline, Metadata, FileGroup Read/Write (https://issues.apache.org/jira/browse/HUDI-6486)
- Internal APIs/Abstractions/Code Refactoring (https://issues.apache.org/jira/browse/HUDI-6243)
- Take HoodieData abstraction to completion and end-end row writing for Spark? All write operations work with rows end-end (HUDI-4857)
- HUDI-43
- HoodieSchema ? https://issues.apache.org/jira/browse/HUDI-6499
- Design
- General purpose, global timeline (no active vs archived distinction) (HUDI-309)
- Non-blocking concurrency control/clustering + updates, inserts + inserts for Spark + Flink.
- Spark SQL statements to complete DB vision. (vinoth has a list. ???)
- Implementation
- Multi-table transaction
- Implement Non blocking CC for Spark...
- Secondary indexes (Bloom, RLI, VectorIndex, ..) on Spark read/write path. (HUDI-3907, HUDI-4128)
- MT integration across Presto, Trino (HUDI-4552, HUDI-4394)
- Presto : Snapshot, Incremental, Time Travel, CDC queries (on MT) (https://issues.apache.org/jira/browse/HUDI-3210)
- Trino: (repeat above https://issues.apache.org/jira/browse/HUDI-2687)
- Minimize configs and cleanup defaults (https://issues.apache.org/jira/browse/HUDI-1239)
- Meta Sync to Glue/HMS with reduced storage/API overhead (HUDI-2519, HUDI-5108, HUDI-6488), seamless inc query, cdc query, ro/rt experience
- Broader Performance improvements (HUDI-3249)
- Encoding updates as deletes + inserts. (HUDI-6490)
- SQL experience for timeline, metadata. (HUDI-6498)
- Introduce TrueTime API or equivalent, to explain the foundations more clearly. (reuse HUDI-3057)
- Introduce HudiStorage APIs to abstract out Hadoop FileSystem. (HUDI-6497)
...