Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

We define a new trogdor-metrics group that captures the metrics as defined below.

Metric/Attribute Name

Description

active-agents-count

The total number of active agents in the Trogdor cluster

created-task-count

The total number of created tasks in the Trogdor cluster

running-task-count

The total number of running tasks in the Trogdor cluster

done-task-count

The total number of done tasks in the Trogdor cluster

All metrics listed above are simply cumulative sums of the number of tasks/agents in each respective state. Thus, since these are cumulative sums, it is expected that the created-task-count = running-task-count = done-task-count when a Trogdor cluster has finished all tasks.

Proposed Changes

We propose adding a TrogdorMetrics class to Trogdor that exposes the aforementioned metrics.  Since Trogdor agents and tasks share a common Platform class, a TrogdorContainer class will be created inside the Platform class to allow for the creation of a shared TrogdorMetrics instance between the Agent and Coordinator classes.

...