Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

Welcomes Welcome to the Apache CarbonData wiki. If you are interested in contributing to CarbonData, visit the contributing to CarbonData page to learn more.

...

DateVersion number
Aug 2016Apache CarbonData 0.1.0-incubating
Sep 2016Apache CarbonData 0.1.1-incubating
Nov 2016Apache CarbonData 0.2.0-incubating
Jan 2017Apache CarbonData 1.0.0-incubating
May 2017Apache CarbonData 1.1.0
Aug-Sep 2017Apache CarbonData 1.2.0
Jan-Feb 2018Apache CarbonData 1.3.0
Mar 2018Apache CarbonData 1.3.1
May 2018Apache CarbonData 1.4.0
Aug 2018Apache CarbonData 1.4.1
Oct 2018Apache CarbonData 1.5.0
Dec 2018Apache CarbonData 1.5.1
Jan 2019Apache CarbonData 1.5.2
Mar 2019Apache CarbonData 1.5.3
May 2019Apache CarbonData 1.5.4
Aug 2019Apache CarbonData 1.6.0
Oct 2019Apache CarbonData 1.6.1
May 2020

Apache CarbonData 2.0.0

Jun 2020Apache CarbonData 2.0.1
Nov 2020Apache CarbonData 2.1.0
Mar 2021Apache CarbonData 2.1.1
Aug 2021Apache CarbonData 2.2.0
Jan 2022Apache CarbonData 2.3.0

Road map plan:

1.0.x:

  • Support 2.1 integration in CarbonData
  • Remove kettle, support new data load solution
  • Support data update and delete SQL in Spark 1.6

...

  • Support Write into hive
  • Load performance improvements
  • TPCDS [Query, load] performance improvements
  • Carbon Advisor for auto suggestion of ideal table schema including MV, index, sort col, range col, compression ...
  • Delete and update support in CarbonData SDK
  • Support C engine reader for CarbonData SDK
  • ES based datamap management
  • Support Spark DataSource API V2
  • Support CarbonData metadata management using DB or other external OLTP system
  • Support MV on Streaming tables, partition tables, Time Series
  • Support MV creation from another MV

2.1.x:

  • Presto read support for complex columns
  • Make GeoID visible to the user
  • Support Carbondata SDK to load data from parquet, ORC, CSV, Avro and JSON.
  • Implement delete and update feature in carbondata SDK.
  • Support array<string> with SI
  • Support IndexServer with Presto Engine
  • Implementing a new Reindex command to repair the missing SI Segments
  • Support Change Column Comment
  • Support Local dictionary for presto complex datatypes
  • Block Pruning for geospatial polygon expression
  • Improve concurrent query performance
  • Support global sort for Secondary index table
  • Filter reordering
  • Geospatial index algorithm improvement and UDFs enhancement
  • CarbonData Trash support
  • Support Writing Flink Stage data into Hdfs file system
  • Support MERGE INTO SQL Command
  • Support Complex DataType when Save DataFrame
  • Adding global sort support for SI segments data files merge operation.

2.2.x:

  • Support Add, Drop and rename column support for the complex column
  • Spark-3.1 support
  • Secondary Index Support for Presto
  • CDC Performance improvement
  • Local sort Partition Load and Compaction improvement
  • Geo Spatial Query enhancements
  • Improve table status and metadata writing

2.3.x:

  • Support spatial index creation using data frame
  •  Introduce Streamer tool for Carbondata
  • Upgrade prestosql to 333 version
  • Multi-level complex schema support
  • Support for Dynamic Partition Pruning

Pages Link

Committers
Releases

...