https://github.com/apache/iotdb/pull/1890

This experiment show the performance on compress ratio and time cost of DIFF encoding. The experiment data is generated by the same test classes with TS_2DIFF encoding algorithm (in tsfile's test package), including quadratic function data (INT and LONG), regular date (LONG) and regular date with missing points(LONG).


Compression ratio


Size before encoding /

byte

Size After encoding /

byte

compression ratio

quadratic function

(INT)

40,000,00029,857,2381.340

quadratic function

(LONG)

80,000,00069,857,2381.145

regular date

(LONG)

100,00043,4132.303
regular date with missing points(LONG)100,00043,8112.283

The DIFF encoding is like TS_2DIFF, based on delta encoding algorithm. It will encode values by the difference between the current value and the previous value. So for data sets, the more regualr data with smaller difference can have a better perfomance on compression ratio.

Time cost


quadratic function(INT) / nsquadratic function(LONG ) / nsregular date(LONG) / nsregutlar date with missing points(LONG) / ns
Encode1,909,067,2003,043,108,92025,280,46026,435,600
Decode1,433,093,5403,072,575,72012,295,18012,961,360


  • No labels