Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

# Feature freeze (branch forking):  Estimated for early 31th July 2022

# Code freeze (first voting RC):  Estimated for the end 15th of July Aug 2022

# Release date: Estimated for the 31th of Aug 2022

# Release manager: yiguolei

# New Features

## 1. Support using S3 as cold data storage [dataroaring]

* [DSIP-010](https://cwiki.apache.org/confluence/display/DORIS/DSIP-010%3A+Cooldown+Data+to+S3)
* [#10280](https://github.com/apache/doris/pull/10280)
* [#10405](https://github.com/apache/doris/pull/10405)

## 2. JAVA UDF and JAVA UDAF [happenlee]

* [DSIP-001](https://cwiki.apache.org/confluence/display/DORIS/DSIP-001%3A+Java+UDF)

## 3. New Realtime Update Storage

...

Engine [dataroaring, wangbo]

* [DSIP-018](https://cwiki.apache.org/confluence/display/DORIS/DSIP-018%3A+Support+Merge-On-Write+implementation+for+UNIQUE+KEY+data+model)

## 4. Multi-Catalog support [morningman]

* [DSIP-014](https://cwiki.apache.org/confluence/display/DORIS/DSIP-014%3A+Multi+Catalog+Support)

## 5. Light weight schema change [dataroaring, yiguolei]

* [#10136](https://github.com/apache/doris/pull/10136)

...

1. Array Type [DSIP-013](https://cwiki.apache.org/confluence/display/DORIS/DSIP-013%3A+Support+ARRAY+type) [zhengguoyang, lihangyu]
2. Jsonb Type [#10322](https://github.com/apache/doris/pull/10322) [xiaokang]
3. Decimal v3 [#9582](https://github.com/apache/doris/pull/9582) [yiguolei]
4. Data Datetime v2 [#9916](https://github.com/apache/doris/pull/9916)

## 7. New Flink CDC Connector

Support 2PC stream load and Exactly-Once sematice

* [](https://github.com/apache/doris-flink-connector/pull/12)

...

[yiguolei]

## 7. Support for deployment in k8s [caiconghui]

# Performance Improvement

## 1. Global Dictionary for low cardinality data for group by. [yiguolei]

* [#9842](https://github.com/apache/doris/pull/9842)

## 2. Store decoded bitshuffle data in page cache [yiguolei]

* [#10036](https://github.com/apache/doris/pull/10036)

## 3. Compaction and Load optimization [morningman, yixiudt]

Support load and compaction for single replica

* [DSIP-015](https://cwiki.apache.org/confluence/display/DORIS/DSIP-015%3A+Support+single+replica+load+for+load)
* [#10298](https://github.com/apache/doris/pull/10298)
* [#10301](https://github.com/apache/doris/pull/10301)

## 4. Improvements in Runtime Filter [yiguolei]

The Runtime Filter feature with vectorized engine still need to be improved.

## 5. Full vectorization support [happenlee]

In version 1.2, all features should be supported running with vectorized engine, including:

1. Stream Load
2. Insert into values
3. Insert into select
4. Broker load
5. Spark dpp
6. Routine load
7. Query for olap, hive, iceberg, mysql, odbc, es
8. File format support: parquet, json, csv, orc file, plain text
9. Schema Change
10. Compaction
11. Export
12. Select outfile 

# Stability Improvement

## 1. New memory tracker [xinyi]

* [DSIP-002](https://cwiki.apache.org/confluence/display/DORIS/DSIP-002%3A+Refactor+memory+tracker+on+BE)

# Test

## 1. Add more regression tests [dataroaring]

Add more regression tests

...

## Remove and Refactor old codescodes 

1. Remove Mini load related code [yiguolei]
2. Remove Alpha Rowset (Segment v1) code [yiguolei]
3. Compatibility for C++ UDF Framework running in vectorized query engine. [weizuo, Garibel]

4. String type: Provide a tools to shorten the data in string type, because we no longer support string type longer than 1MB [morningman]