Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

编码方式GORILLAPLAINRLETS_2DIFF
datasetsyntheticsyntheticsyntheticsynthetic
pagePointNum(ppn)10000100001000010000
numOfPagesInChunk(pic)1000100010001000
chunksWritten(cw)10101010
timeEncoding(te)TS_2DIFFTS_2DIFFTS_2DIFFTS_2DIFF
valueDataType(vt)INT64INT64INT64INT64
valueEncoding(ve)GORILLAPLAINRLETS_2DIFF
compression(co)UNCOMPRESSEDUNCOMPRESSEDUNCOMPRESSEDUNCOMPRESSED
totalPointNum100000000100000000100000000100000000
tsfileSize(MB)805.3812895781.4226151781.8422318793.3244705
chunkDataSize_stats_mean(MB)80.5380362478.1421661478.1841278179.33235168
compressedPageSize_stats_mean(B)84386.25189818748191883122
uncompressedPageSize_stats_mean(B)84386.25189818748191883122
timeBufferSize_stats_mean(B)1872187218721872
valueBufferSize_stats_mean(B)82512.25189800008004481248
[2] category: (A)get ChunkStatistic->(B)load on-disk Chunk->(C)get PageStatistics->(D)load in-memory PageData
[Avg&Per] (A)get_chunkMetadatas91331.98490000001 us - 0.8580518676474486%100944.7581 us - 1.2939556377951902%88098.20449999999 us - 0.9671805828234409%88231.2157 us - 0.9257222461823116%
[Avg&Per] (B)load_on_disk_chunk5552645.935400001 us - 52.16637107440095%5170158.3812 us - 66.27343223726832%5270914.364100001 us - 57.866400973957255%5526099.6186 us - 57.97985793317826%
[Avg&Per] (C)get_pageHeader8185.805399999992 us - 0.07690455451459878%7712.402700000001 us - 0.09886107156476356%7813.90999999998 us - 0.08578451820695045%7725.986500000003 us - 0.08106107934716146%
[Avg&Per] (D_1)decompress_pageData548160.3352000009 us - 5.149893543905802%525441.0348000005 us - 6.7353412114264035%585036.2351000007 us - 6.422783415941812%632154.6457000006 us - 6.632568914631739%
[Avg&Per] (D_2)decode_pageData4443785.968300003 us - 41.748778959531215%1996996.816400002 us - 25.598409841945312%3156902.088299994 us - 34.65785050907054%3276856.417400001 us - 34.38078982666053%
[3] D_1 compare each step inside
[Avg&Per] (D-1)7_1_data_ByteBuffer_to_ByteArray(us)110421.72189999989 us - 64.93156378407389%109687.91739999971 us - 63.92643734398307%113658.35759999987 us - 62.901942587101885%109187.47799999996 us - 63.08420663361515%
[Avg&Per] (D-1)7_2_data_decompress_PageDataByteArray(us)54624.26410000002 us - 32.12084386602246%57095.555900000094 us - 33.27545607007074%62164.880499999985 us - 34.403908579311135%59072.25890000005 us - 34.12961499817788%
[Avg&Per] (D-1)7_3_data_ByteArray_to_ByteBuffer(us)1179.3427000000347 us - 0.6934918640164246%1234.10400000004 us - 0.7192394012210651%1032.115900000022 us - 0.5712038820191121%1193.2719000000652 us - 0.6894253122110825%
[Avg&Per] (D-1)7_4_data_split_time_value_Buffer(us)3833.2921999999994 us - 2.254100485887217%3567.0158000000133 us - 2.0788671847251163%3835.9775000000177 us - 2.122944951567873%3629.1045000000054 us - 2.0967530559958814%
[3] D_2 compare each step inside
[Avg&Per] (D-2)8_1_createBatchData(us)6008.9294 us - 0.04720551599260821%6005.094 us - 0.058953749593284185%9136.1988 us - 0.07959246314858166%6219.250599999999 us - 0.053213386787106826%
[[Avg&Per] (D-2)8_2_timeDecoder_hasNext(us)1795067.4479 us - 14.10186066084482%1862631.8749 us - 18.285997377780266%1805100.2815999999 us - 15.725618584694368%1838661.8702 us - 15.732028111177547%
[Avg&Per] (D-2)8_3_timeDecoder_readLong(us)2073615.2138 us - 16.290102549307967%2089493.8765 us - 20.513167449482335%2063469.3846 us - 17.976470800088325%2172029.5514 us - 18.58440125112089%
[Avg&Per] (D-2)8_4_valueDecoder_read(us)4636195.4124 us - 36.42146247963989%1880348.902 us - 18.459930571697104%3352494.9242 us - 29.206164899804453%3364558.2458 us - 28.787960289219587%
[Avg&Per] (D-2)8_5_checkValueSatisfyOrNot(us)1724239.435 us - 13.545443257159627%1784205.2864 us - 17.516060810611705%1723128.802 us - 15.01150190311584%1780807.3196 us - 15.237010821076382%
[Avg&Per] (D-2)8_6_putIntoBatchData(us)2494168.5891 us - 19.59392553705509%2563425.3348 us - 25.165890040835308%2525393.9444 us - 22.000651349148434%2525103.5280999998 us - 21.605386140618513%


  • 总的来说,使用的人工数据集不太好,其随机生成的取值,编码压缩效率不高总的来说,使用的人工数据集不太好,其随机生成的取值,编码压缩效率都不高


中车数据实验结果

RLValueEncodingRealExpScripts.sh

...