...
Description: Enable some variant of SQL (likely HiveQL) to run on top of Flink, both in standalone and in embedded /mixed mode and by submitting queries from a client.
Associated JIRA:
Expected: Q3/Q4 2015
Integrate with Tachyon
Description: Integrate with Tachyon storage and lineage-based recovery
Associated JIRA:
Expected:
...
Integrate with Zeppelin
Description:
Associated JIRA:
Expected:
Integrate with Tez
Description: Enable Flink programs to run on Tez rather than using Flink's network stack. For certain use cases, this will give the option of running Flink programs with the resource elasticity that Tez provides.
Associated JIRA:
Expected: First version supporting a subset of Flink API in Q1 2015
Integrate with Samoa
Description: Create a Samoa adaptor
Associated JIRA:
Expected: Q1 2015
Semantic annotations for optimization
Description: A lot of optimizations are not possible in Flink, because the optimizer does not know what is happening inside user-defined functions. By adding semantic information for user functions which tells the optimizer how a function behaves, some of these limitations can be overcome.
Associated JIRA:
Expected: Q1 2015
Plan choice hints
Description: Query optimizers are kind of black boxes and usually do a good job in finding efficient executions. However, in some cases the user/developer knows better and wants guide the optimizer or help to find a better plan. Flink’s optimizer offers several hints which are not well exposed in the API. Also documentation for how write well optimizable programs need to be improved.
Associated JIRA:
Expected: Q2 2015
Improved statistics for the optimizer
Description: Improve data source statistics, integrate with data sources that already provide statistics (HCatalog)
Associated JIRA:
Expected: Q2 2015
Use off-heap memory
Description: Use off-heap memory for intermediate results, sorting and hashing. Reduces number of objects and size of JVM heap to make garbage collection more efficient.
Associated JIRA:
Expected: Q1 2015
Dynamic memory allocation
Description: Allocate memory to operators based on a need/benefit scheme. Improves memory utilization for pipelined operators.
Associated JIRA:
Expected: Q2/Q3 2015
Incremental ML Library
Description:
Associated JIRA:
Expected:
Unify batch and streaming APIs
Description:
Associated JIRA:
Expected:
...