You are viewing an old version of this page. View the current version.

Compare with Current View Page History

« Previous Version 9 Next »

IDIEP-7
AuthorAnton Vinogradov
SponsorAnton Vinogradov
CreatedNov 14, 2017
StatusDRAFT


Motivation

Internal problems may cause unexpected cluster behaviour.
We should determine behavior in case any internal problems happened.

Description

Internal problems can be split to

1) OOM or any other reason cause node crash

2) Situations required graceful node shutdown with custom notification
- IgniteOutOfMemoryException
- Persistence errors
- ExchangeWorker exits with error

3) Prefomance issues should be covered by metrics
- GC STW duration
- Timed out tasks and jobs
- TX deadlock
- Hanged Tx (waits for some service)
- Java Deadlocks

4) Situations required external monitoring implementation
- GC STW duration exceed maximum possible length

Risks and Assumptions

// Describe project risks, such as API or binary compatibility issues, major protocol changes, etc.

Discussion Links

// Links to discussions on the devlist, if applicable.

Reference Links

Deadlock Detection And Cluster Protection.

Tickets

key summary type created updated due assignee reporter priority status resolution

JQL and issue key arguments for this macro require at least one Jira application link to be configured

  • No labels