Table of Contents

Status

Current state: Under DiscussionAdopted

Discussion thread: dev discussion

...

CREATE CUSTOM INDEX ON person (index_name) USING 'StorageAttachedIndex' WITH OPTIONS = { }

Version 1 Features

AND queriesOR queries
Numeric range queries
Non-variable length numeric types
Text type indexes and equality queries
SSTable attached
Optional case sensitivity
Optional unicode normalization
Tokenization
Index versioning
Row-aware query path

Version 2 Features

Prefix LIKE
OR Queries
Range queries on all types
Global sorting

Indexing options

case_sensitive - If the index is case sensitive or not.
normalize - If the index uses unicode normalization or not.

Write path

Write path is mostly the same as SASI where multiple column indexes are attached to a single memtable.

During flush, SAI will make use of the index memtable to generate an on-disk index file to avoid re-indexing the flushed sstable twice.

On-disk versioning of indexes is supported by embedding version information in the on-disk file format.

Read path

The read path in SAI is similar to the SASI read path with a merge of postings from the in-memory and SSTable indexes using the RangeIterator framework.

...

The only "easy" way around these two challenges is to focus our efforts on queries that are restricted to either partitions or small token ranges. These queries behave well locally even on LCS (given levels contain token-disjoint SSTables, and assuming a low number of unleveled SSTables), avoid fan-out and all of its secondary pitfalls, and allow us to make queries at varying CLs with reasonable performance. Attempting to fix the local problems around compaction strategy could mean either restricted strategy usage or partially abandoning SSTable-attachment. Attempting to fix distributed read path problems by pushing the design towards IR systems like ES could compromise our ability to use higher read CLs.

Addendum

Index Versioning

Index versioning is supported by including the index version in the on-disk file name.

The index version is mapped to a codec internally to write and read the on-disk index files.

Index Format Version 1

The following applies to the version 1 index format.

...

Metrics include items such as: disk usage, memory usage, query latencies, compaction statistics, chunk cache hits/misses/lookups, open files.

Virtual Tables

Sets of the above metrics are also available through virtual tables. These metrics have been grouped into:

Index - global index metrics about the index build and query status and on-disk sizing.
SSTable - per-SSTable metrics
Segment - per-index metrics

The per-SSTable and per-index metrics relate to the on-disk structures described below.

Terminology

Row ID - A monotonic increasing integer associated with every row in a sstable. It’s stored in an index structure instead of key token or key offset, because it compresses better.
Postings/posting-list - Sorted row ids that match a given indexed value.
Primary key - A partition key and clustering representing a row in a SSTable
Primary key store - A bi-directional block store allowing Row ID → Primary Key and Primary Key → Row ID lookups.
Segment - A smallest unit of on-disk indexing structure that is flushed during compaction to reduce memory pressure. Multiple segments of an index are written to the same physical file.

...

Space shortcuts

Page tree

Versions Compared

Old Version 25

New Version Current

Key

Status

Version 1 Features

Version 2 Features

Indexing options

Write path

Read path

Addendum

Index Versioning

Index Format Version 1

Virtual Tables

Terminology

Space shortcuts

Page tree

Page History

Versions Compared

Old Version 25

New Version Current

Key

Status

Version 1 Features

Version 2 Features

Indexing options

Write path

Read path

Addendum

Index Versioning

Index Format Version 1

Virtual Tables

Terminology