ID	IEP-49
Author	Igor Seliverstov
Sponsor	Maksim Timonin
Created	22 May 2020
Status	ACTIVE

Motivation

Since Ignite wants to leverage from several SQL engines we need to make work with index independent from the used SQL engine. We also should consider moving all machinery related to index to the core module to make it available from any module that wants to use it.

Description

Introduced abstractions:

Index - base index interface with common methods:

id() unique ID
name() unique name
isOnline()
onUpdate(oldRow(nullable), newRow(nullable)) where rows have an indexed types
find(Args) - where args are implementation specific arguments, like a TextQuery for a full text index or lowerBound + upperBound for sorted index, returns a cursor over found rows
unwrap(indexInterface extends Index) - gets index implementation interface instance, for example, it's obvious that fullText index may have some additional methods, different to hash or sorted index.
acquire(Cancellable owner, boolean force) when calls with force = true on all previous owners cancel() method calls and waits for all previous owners leave the index, after index acquired, destroy flag checks, so that, we wait for all index readers (like running queries) finish gracefully before the index is dropped or altered and cannot acquire dropped or altered index.
leave()
break() marks index instance as stale (after drop or altering) after this method called, acquire method throws an exception (index destroyed or modified)

IndexDefinition describes index implementation (should be enhanced for each index implementation and provide implementation specific parameters):

indexId - long identifier
indexName - String
sourceCacheId - cache id the index is created for
indexType - Enum (hash, sorted, fullText, userDefined)
indexedType type the index is built for
indexedColumns list of columns, it's needed to make possible skipping index updates when an indexed column was not changed
indexFactory - creates index instance
indexValidatorList - optional, the way to implement various constrains

IndexFactory creates specific index instance:

create(IndexManager, IndexDefinition) - on index is altered all internal structures may be obtained from previous instance using IndexManager

IndexLifecycleListener - all callbacks should be executed on both client and server nodes.

onIndexCreated(IndexDefinition)
onIndexModified(IndexDefinition)
onIndexDeleted(indexId)
onIndexStateChange(indexId, newState) - to make indexes online/offline

IndexManager allows next operations (like appropriate SQL commands):

createIndex(IndexDefinition)
alterIndex(IndexDefinition)
dropIndex(IndexId)
getIndex(indexId) - throws an exception on client nodes
listen(IndexLifecycleListener)
onRowUpdate(cacheId, oldRow, newRow) - callback method, called on cache entry update by IgniteCacheOffheapManager

Basic postulates:

Newly created index is in offline state
offline index cannot be read, only modified (in scope of index rebuild or regular updates)
index becomes online after index rebuild
Index may be created on cache start, by DDL command or by API call (direct call to IndexManager)
on index create an Index instance is created using provided factory, this way we may introduce geospatial indexes or prefix trees in future just providing specific factory.
sorted index represents a database index in terms of SQL and requires hash index created first (if not exists).
hash index is just a proxy to cache partitions and always online, it represents a table in terms of SQL. This way SQL queries may be executed before index is fully built

On index create:

index created - all indexes and definitions registered on all nodes, all indexes starts applying current updates
onIndexCreate() callback executes - index is registered in a query execution engine
index rebuild started - index is filling up with existing data
index rebuild finished - index is ready to use
onIndexStateChange() callback executes - index becomes available for a query execution engine.

On index read:

index is get by its Id from an index manager (if it doesn't exist an exception is thrown - cannot to execute due to schema change)
index is acquired for read, scan cancel (query cancel) callback is provided by reader (if it cannot acquire index, an exception is thrown - cannot to execute due to schema change)
index is reading (if a cancel occurs the index has to be leaved by reader)
reader leaves the index (while index is acquired the index cannot be dropped or modified)

On index delete:

index is get by its Id from an index manager (if it doesn't exist an exception is thrown - cannot to execute due to schema change)
index is acquired for write (all current readers may be cancelled or not, depending on force flag, if the index cannot be acquired, an exception is thrown - cannot to execute due to schema change)
an index marked as broken, all readers trying to acquire it right now and after will get an exception
onIndexDeleted() callback executes - index is deregistered from a query execution engine
all internal structures are destroyed gracefully

On index altered:

index is get by its Id from an index manager (if it doesn't exist an exception is thrown - cannot to execute due to schema change)
index is acquired for write (all current readers may be cancelled or not, depending on force flag, if the index cannot be acquired, an exception is thrown - cannot to execute due to schema change)
an index marked as broken, all readers trying to acquire it right now and after will get an exception
a new index is created on the basis of previous one and registers in the manager
onIndexModified() callback executes - index is updated in a query execution engine and becomes offline, planner shouldn't consider its using
in case the index needs rebuild,
1. index rebuild started - index is filling up with existing data
2. index rebuild finished - index is ready to use
onIndexStateChange() callback executes - index becomes available for a query execution engine.

Risks and Assumptions

New indexes should be binary compatible with current H2 indexes

Dev list discussion

http://apache-ignite-developers.2346864.n4.nabble.com/Basic-index-infrastructure-as-a-part-of-core-APIs-td47638.html

JIRA tickets

key	summary	type	created	updated	due	assignee	reporter	priority	status	resolution
JQL and issue key arguments for this macro require at least one Jira application link to be configured

Page tree

Motivation

Description

Introduced abstractions:

Basic postulates:

On index create:

On index read:

On index delete:

On index altered:

Risks and Assumptions

Dev list discussion

JIRA tickets

2 Comments

Alexey Goncharuk

Igor Seliverstov

Page tree

IEP-49: Basic index infrastructure as a part of core Ignite APIs

Motivation

Description

Introduced abstractions:

Basic postulates:

On index create:

On index read:

On index delete:

On index altered:

Risks and Assumptions

Dev list discussion

JIRA tickets

2 Comments

Alexey Goncharuk

Igor Seliverstov