ID	IEP-20
Author	Vladimir Ozerov Ozerov
Sponsor	Vladimir Ozerov Ozerov
Created	25 Apr 2018
Status	DRAFT

Motivation

Compression is used extensively by all database vendors to reduce TCO and improve performance. Compression could be applied to different parts of the system to achieve the following goals:

Less writes to disk - less IO call, less disk space needed
Less reads from disk - less IO call, less RAM is needed to accommodate the same number of records.

Performance numbers of other vendors demonstrate that we could expect 2x-4x decrease in required disk space and >1.5x increase in throughput (ops/sec) on typical workloads.

Competitive analysis

This section describes general compression approaches and their pros and cons. The following compression mechanisms are implemented in practice:

Data format improvements
Index prefix compression
Page-level compression
WAL compression
Per-column compression
Compression on file system level
Column store

Data Format Improvements

Efficient disk usage starts with proper data layout. Vendors strive to place data in pages in such a way that total overhead is kept as low as possible while still maintaining high read speed. Typically this is achieved as follows:

Common metadata such is stored outside of data page
Numeric types are written using varlen encoding (e.g. int data type may take 1-5 bytes instead of 4)
Fixed-length string data types (CHAR, NCHAR) are trimmed
NULL and zero values are optimized to consume no space.

Examples:

SQL Server \[1\]

\[1\] https://docs.microsoft.com/en-us/sql/relational-databases/data-compression/row-compression-implementation?view=sql-server-2017

Draft materials

Compression options:

1) FS

2) Sparse files

3) Prefix compression (indexes)

4) Better format (varlen, common header, null fields)

5) Column store

6) Block compression

7) Per-column compression

8) Row compression

9) WAL compression

10) New algorithms (LSM, BWTree, ...)

https://www.percona.com/blog/2013/07/09/how-tokumx-gets-great-compression-for-mongodb/

https://github.com/facebook/mysql-5.6/wiki/MyRocks-advantages-over-InnoDB

------------------------------

Postgres:

1) TOAST: https://www.postgresql.org/docs/10/static/storage-toast.html

2) WAL: https://www.pgcon.org/2016/schedule/attachments/432_WAL-Reduction.pdf

Compress full pages
Remove holes in the page

3) Plans: https://habr.com/company/postgrespro/blog/337180/

------------------------------

MySQL:

1) Table Compression https://dev.mysql.com/doc/refman/5.7/en/innodb-compression-background.html

2) Page Compression https://mysqlserverteam.com/innodb-transparent-page-compression/

3) COMPRESS/UNCOMPRESS functions

4) Column compression (https://www.percona.com/doc/percona-server/LATEST/flexibility/compressed_columns.html)

https://mysqlserverteam.com/innodb-transparent-pageio-compression/

https://www.percona.com/blog/2017/11/20/innodb-page-compression/

https://www.percona.com/live/e17/sites/default/files/slides/Percona%20XtraDB_%20Compressed%20Columns%20with%20Dictionaries%20-%20An%20Alternative%20to%20InnoDB%20Table%20Compression%20-%20FileId%20-%20115392.pdf

http://techblog.constantcontact.com/devops/space-the-final-frontier-a-story-of-mysql-compression/

"The code changes required to get the old InnoDB compression to work properly were extensive and complex. Its tentacles are everywhere—I think that just about every module inside InnoDB has had modifications done to it in order to make it work properly when compression is in use. This complexity has its challenges, both in terms of maintainability and when it comes to improving the feature. We have been debating internally about what we should do about this over the long run. As much as we would like to redesign and rewrite the entire old compression code, it is impractical. Therefore we are only fixing issues in the old compression code reported by customers. We think there are better ways to solve the underlying performance and compression problems around B-Trees. For example by adding support for other types of indexes e.g. LSM tree and/or BW-Tree or some variation of the two. "

------------------------------

MariaDB

1) WAL compression - compress event data once certain threshold is reached https://mariadb.com/kb/en/library/compressing-events-to-reduce-size-of-the-binary-log/

2) Page compression - uncompressed in memory, compressed on disk https://mariadb.com/kb/en/library/compression/

https://mariadb.org/significant-performance-boost-with-new-mariadb-page-compression-on-fusionio/

3) Old good MySQL COMPRESSED format (stores both compressed and uncompressed data in memory, use special room in page to store current modifications without recompression) https://mariadb.com/kb/en/library/xtradbinnodb-storage-formats/

3) Independent Column Compression - automatically compress and uncompress, cannot create indexes on these columns https://mariadb.com/kb/en/library/storage-engine-independent-column-compression/

4) COMPRESS function (similar as in MySQL?) https://mariadb.com/kb/en/library/compress/

?? 5) ColumnStore

General link: https://mariadb.com/kb/en/library/optimization-and-tuning-compression/

Bad experience with punch-holes in MySQL: https://mariadb.org/innodb-holepunch-compression-vs-the-filesystem-in-mariadb-10-1/

------------------------------

SQL Server

1) Row compression - metadata, varlen for numeric types, trimming for CHAR types https://docs.microsoft.com/en-us/sql/relational-databases/data-compression/row-compression-implementation?view=sql-server-2017

1.1) NULL and 0 take no bytes!

2) Page compression https://docs.microsoft.com/en-us/sql/relational-databases/data-compression/page-compression-implementation?view=sql-server-2017

3) COMPRESS command https://docs.microsoft.com/en-us/sql/t-sql/functions/compress-transact-sql?view=sql-server-2017

https://docs.microsoft.com/en-us/sql/relational-databases/data-compression/data-compression?view=sql-server-2017

https://docs.microsoft.com/en-us/sql/relational-databases/data-compression/enable-compression-on-a-table-or-index?view=sql-server-2017

------------------------------

Oracle

1) Basic compression - compression during bulk loads only (direct load, CREATE TABLE AS SELECT)

2) Advanced Row Compression (ex. OLTP Table Compression) - used for DML, keep data compressed in-memory; more CPU but less IO - reads gets gain anyway

3) Advanced LOB compression and deduplication - appears to be something similar to PG TOAST (?)

4) Index compression - just prefix compression https://blogs.oracle.com/dbstorage/compressing-your-indexes:-index-key-compression-part-1

?? 5) Advanced Index Compression

?? 6) Hybrid columnar compression (query level, archive level) - tremendous compression rates (up to 50x), 5-15x typical, "query" - improves scan performance, "archive" - for data archives

7) Compression at tablespace and table levels

8) Tablespace encryption - after compression, column encryption - before compression, no effect

9) Indexes are compressed separately from data!

10) Clustered tables - only prefix compression is applicable

11) Heat Map - insight on how data is accessed

12) Advanced Row Compression - can read specific attributes without full decompression

https://www.oracle.com/us/assets/lad-2015-ses16380-pedregal-2604876.pdf

http://www.oracle.com/technetwork/database/options/compression/advanced-compression-wp-12c-1896128.pdf

https://blogs.oracle.com/dbstorage/advanced-row-compression-improvements-with-oracle-database-12c-release-2

https://docs.oracle.com/database/121/ADMIN/tables.htm#ADMIN-GUID-34D15DD1-0925-4C9A-BE8A-3EE91671E526

!!! https://blogs.oracle.com/dbstorage/updates-in-row-compressed-tables

"With Advanced Row Compression, when the block is full, it is compressed. More rows are then added (since more rows can now fit into the block), and the process of recompression is repeated several times until the rows in the block cannot be compressed further. Blocks are usually compressed and reformatted in their entirety, but, starting with Oracle Database 12c Release 2, in some cases the block can be partially compressed, hence resulting in CPU savings and extra compression."

------------------------------

MongoDB

1) Block compression (Snappy, zlib) https://docs.mongodb.com/manual/core/wiredtiger/#compression

2) Prefix compression (indexes, once per page) https://docs.mongodb.com/manual/reference/glossary/#term-prefix-compression

3) WAL compression https://docs.mongodb.com/manual/core/wiredtiger/#storage-wiredtiger-journal

4) Configurable per-collection and per-index

https://www.mongodb.com/blog/post/new-compression-options-mongodb-30

https://serverfault.com/questions/826181/does-the-mongodb-3-2-wiredtiger-compression-include-stuff-stored-in-ram

http://ilearnasigoalong.blogspot.ru/2015/03/wired-tiger-how-to-reduce-your-mongdb.html

https://www.mongodb.com/presentations/a-technical-introduction-to-wiredtiger

https://www.objectrocket.com/blog/company/mongodb-3-0-wiredtiger-compression-and-performance/

"The cache generally stores uncompressed changes (the exception is for very large documents). The default snappy compression is fairly straightforward: it gathers data up to a maximum of 32KB, compresses it, and if compression is successful, writes the block rounded up to the nearest 4KB.

The alternative zlib compression works a little differently: it will gather more data and compress enough to fill a 32KB block on disk. This is more CPU intensive but generally results in better compression ratios (independent of the inherent differences between snappy and zlib)."

—Michael Cahill

Motivation

TBD

Proposed changes

TBD

Tickets

key	summary	type	created	updated	due	assignee	reporter	priority	status	resolution
JQL and issue key arguments for this macro require at least one Jira application link to be configured

Page tree

IEP-20: Data Compression in Ignite

Motivation

Competitive analysis

Data Format Improvements

Draft materials

Motivation

Proposed changes

Tickets