Page History

...

JIRA:

Jira

server	ASF JIRA
columns	key,summary,type,created,updated,due,assignee,reporter,priority,status,resolution
maximumIssues	20
jqlQuery	key = KAFKA-8019 or key = KAFKA-7149 or key = KAFKA-6145 or key = KAFKA-6039
serverId	5aa69414-a9e9-3523-82ec-879b028fb15b

...

When the config value is "eager", the consumer would still use V0 of the consumer protocol as well as the rebalance behavior; if the config value is "cooperative", the consumer will then use the new V1 protocol as well as the new algorithm. Note the difference with KIP-415 here, that : since on consumer we cannot do not have the luxury to leverage on list of assignors to register multiple protocols and let leader to auto-switch to new versions, we need two rolling bounces instead of one rolling bounce to complete the upgrade, whereas Connect only need one rolling bounce (details below).

And we We'd update the above algorithm on leader (i.e. bullet point 3) as, such that we will first check the versions of subscription of all the members:

If all members are on V1, then follow the new algorithm.
If all members are on V0, then follow the old algorithm.
If there's no consensus, it means we are in a second rebalance then do the following:
1. For those members in V1, send assignment in V1 as well by just giving back their existing assigned-partitions from the subscription metadata and leaving the revoked-partitions empty.
2. For those members in V0, which means they have revoked their partitions but we do not know what are those partitions, we can only given them V0 assignment back with an empty assigned-partitions.

The key point behind this two rolling bounce and the additional check is that, we want to avoid the situation where leader is on old byte-code and only recognize V0, but due to compatibility would still be able to deserialize V1 protocol data from newer versioned members, and hence just go ahead and do the assignment while new versioned members did not revoke their partitions before joining the group.

...

So under case 3), those members in V0 will have a small window where no partitions are assigned, and some partitions are not assigned to anyone as well. Hence, we will add a new ConsumerCoordinatorMetrics, which client will record upon receiving the assignment with the error code

...

-------------------------------

For KStream, we are going to take a trade-off between “revoking all” and “revoking none” solution: we shall only revoke tasks that are being learned since last round. So when we assign learner tasks to new members, we shall also mark active tasks as "being learned" on current owners. Every time when a rebalance begins, the task owners will revoke the being learned tasks and join group without affecting other ongoing tasks. Learned tasks could then immediately transfer ownership without attempting for a second round of rebalance upon readiness. Compared with KIP-415, we are optimizing for fewer rebalances, but increasing the metadata size and sacrificing partial availability of the learner tasks.

---------------------------------------------------------------------------------------------

Part II: Streams Two-Phase Task Assignor

Now the second part of this KIP is on Streams' PartitionAssginor implementation on top of the consumer rebalance protocol. The main goal is to tweak the stickiness / workload balance trade-off so that we can reduce the stateful task's restoration time (hence the unavailability gap) when migrating it to a "new host" (here we use this term for a host who do not have previously restored state for this task, and hence need to restore from scratch, or even more generally speaking, a host whose local maintained state if far behind the actual state's progress, and hence would need to restore for a long time) from an existing active host.More specifically, we will push the following metadata bytes to the consumer's protocol:

Terminology

we shall define several terms for easy walkthrough of the algorithm.

...

Space shortcuts

Child pages

Versions Compared

Old Version 101

New Version 102

Key

Part II: Streams Two-Phase Task Assignor

Terminology