Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

Current state: Under Discussion

Discussion threadhere (<- link to httpshttp://apache-flink-mailing-list-archive.1008284.n3.nabble.com/DISCUSS-FLIP-53-Fine-Grained-Resource-Management-td31831.htmlmail-archives.apache.org/mod_mbox/flink-dev/)

JIRA:

Released:

Please keep the discussion on the mailing list rather than commenting on the wiki (wiki discussions get unwieldy fast).

...

Replace the requestSlot interface with a new requestResource interface.


requestSlot

requestResource

Parameters

  • SlotID
  • JobID
  • AllocationID
  • TargetAddress
  • ResourceManagerID
  • ResourceProfile
  • JobID
  • AllocationID
  • TargetAddress
  • ResourceManagerID

Return Value

Acknowledge

SlotID


SlotReport

A slot report that task executors send to the resource manager (in registration or heartbeats) now consists of two kinds of information.

...

  • This FLIP deprecates the configuration "taskmanager.numberOfTaskSlots", but stays compatible with it.

Test Plan

  • We need to update existing and add new integration tests dedicated to validate the new fine grained resource management behaviors.
  • It is also expected that other regular integration and end-to-end tests should fail if this is broken.

Rejected Alternatives

An alternative for setting slot sharing groups in compiling is that, to set tasks with specified resource requirements into individual slot sharing groups (except for tasks in colocation groups), and tasks with unknown resource requirements in the same slot sharing group. It is rejected because it separates tasks from the same pipelined region into different slot sharing group, which may lead to a situation with resource deadlocks.

...