You are viewing an old version of this page. View the current version.

Compare with Current View Page History

« Previous Version 9 Next »

DRAFT

The CEP process itself has not been approved by the community yet. This is a strawman draft to start the conversation to such a process.


This page describes a proposed Cassandra Enhancement Proposal (CEP) process for proposing a major change to Cassandra.

To create your own CEP, click on Create CIP. If you don't have permission, please send an email with your Wiki ID to dev@cassandra.apache.org and request permission. Also add an entry to the table CEPs under discussion.

Purpose

The purpose of an CEP is to inform and involve both the user and dev communities in major improvements to Cassandra, from as early on in the development process as possible.


CEPs should be used for significant user-facing or cross-cutting changes, not small incremental improvements. When in doubt, if a committer thinks a change needs an CEP, it does. At the technical level, CEPs are intended to increase the likelihood that compatibility requirements are safely met. At a user level, CEPs are to help ensure all variants of Cassandra installations and usages are considered. 


What should be included in a CEP?

A CEP should contain the following sections: 

  • Scope,

  • Goals,

  • Approach,

  • Timeline,

  • Mailing list / Slack channels,

  • Related JIRA tickets.

Who should initiate the CEP?

Anyone can initiate a CEP but you shouldn't do it unless you have an intention and know-how of getting the work done to implement it.

A CEP needs to attract a Shepherd that is a PMC member who is committed to shepherding the proposed change throughout the entire process. Although the shepherd can delegate or work with other committers in the development process, the shepherd is ultimately responsible for the success or failure of the CEP. Responsibilities of the shepherd include, but are not limited to:

  • Be an advocate for the proposed change
  • Ensure the working group achieves consensus among key stakeholders
  • Ensure the working group seeks feedback from users and iterates on the design & implementation (see below for additional CEP documentation)
  • Uphold the quality of the changes, including verifying whether the changes satisfy the goal of the CEP and are absent of critical bugs before releasing them
  • Be committed to review code changes, ensuring project standards

The Process

Here is the process for making a CEP:

  1. Click Create CIP. Take the next available CEP number and give your proposal a descriptive heading. e.g. "CEP 1: Proposing an Apache Cassandra Management process".
  2. Fill in the sections as described above
  3. Start a [DISCUSS] thread on the Apache mailing list. Please ensure that the subject of the thread is of the format [DISCUSS] CEP-{your CEP number} {your CEP heading} The discussion should happen on the mailing list not on the wiki since the wiki comment system doesn't work well for larger discussions. In the process of the discussion you may update the proposal. You should let people know the changes you are making. When you feel you have a finalized proposal 
  4. Once the proposal is finalized call a [VOTE] to have the proposal adopted. These proposals are more serious than code changes and more serious even than release votes. The criteria for acceptance is lazy consensus (3 binding +1 votes and no binding vetoes). The vote should remain open for 72 hours.
  5. Please update the CEP wiki page, and the index below, to reflect the current stage of the CEP after a vote. This acts as the permanent record indicating the result of the CEP (e.g., Accepted or Rejected). Also report the result of the CEP vote to the voting thread on the mailing list so the conclusion is clear.


What is considered a "major change" that needs a CEP?

Any of the following should be considered a major change:

  • Any major new feature, subsystem, or piece of functionality
  • Any change that impacts the public interfaces of the project


Cassandra's Public Interfaces

Not all compatibility commitments are the same. We need to spend significantly more time on the wire protocols as these break code in lots of clients, cause downtime releases, etc. Public API/SPIs are next as they cause people to rebuild code and/or maintain abstraction layers in third-party libraries/frameworks. Configuration, monitoring, and command line tools can be faster and looser — changes here will break monitoring dashboards and require a bit of care during upgrades but aren't as big a burden.

For the most part monitoring, command line tool changes, and configs are added with new features so these can be done with a single CEP.

Cassandra requires a high level of compatibility between releases, especially to ensure rolling upgrades are possible. Core architectural elements can't break compatibility or shift functionality from release to release. As a result each new major feature or public api has to be done in a way that we can stick with it going forward.

Areas of compatibility, or "public interfaces" to the project, are 

  • native protocol (and CQL)
  • gossip and the messaging service
  • pluggable components (SPIs) like authorisation, triggers, …
  • commitlog, hintlog, cache files
  • sstables components 
  • configuration
  • jmx mbeans (including metrics)
  • monitoring
  • client tool classes
  • command line tools and arguments
  • operational routines


When making changes or additions that impact these areas we need to think through what we are doing as best we can prior to release. And as we go forward we need to stick to our decisions as much as possible. All technical decisions have pros and cons so it is important we capture the thought process that lead to a decision or design to avoid flip-flopping needlessly.

This process also isn't meant to discourage incompatible changes — proposing an incompatible change is totally legitimate. Sometimes we will have made a mistake and the best path forward is a clean break that cleans things up and gives us a good foundation going forward. Rather this is intended to avoid accidentally introducing half thought-out interfaces and protocols that cause needless heartburn when changed. Likewise the definition of "compatible" is itself squishy: small details like which errors are thrown when are clearly part of the contract but may need to change in some circumstances, likewise performance isn't part of the public contract but dramatic changes may break use cases. So we just need to use good judgement about how big the impact of an incompatibility will be and how big the payoff is.

Additional CEP Design Documentation

Suggestions for additional discussion and documentation, to help flesh out the implementation constraints, are…

  • Motivation: The problem to be solved.
  • Audience: The intended client audience. Examples include data scientists, data engineers, library devs, devops, etc. A single CEP can have multiple target personas. 
  • Goals: What must this allow users to do, that they can't currently.
  • Non-Goals: What problem(s) is this proposal not designed to solve.
  • Proposed Change: The new thing you want to do. This may be fairly extensive and have large subsections of its own. Or it may be a few sentences, depending on the scope of the change.
  • New or Changed Public Interfaces: Impact to any of the "compatibility commitments" described above. We want to call these out in particular so everyone thinks about them.
  • Migration Plan and Compatibility: If this feature requires additional support for a no-downtime upgrade describe how that will work.
  • Rejected Alternatives: What are the other alternatives you considered and why are they worse? The goal of this section is to help people understand why this is the best solution now, and also to prevent churn in the future when old alternatives are reconsidered.

List of CEPs

Adopted CEPs

CEP

Release



CEPs under discussion

CEPComment
CEP-1: Apache Cassandra Management Process(es)

Sent emails to Dev discussion group.
Work tracked under CASSANDRA-14395.

Dormant/inactive CEPs

CEPComment


Discarded CEPs

CEPComment


  • No labels