Diego can run one-off work in the form of Tasks. When a Task is submitted, Diego allocates resources for the Task on a Cell, runs the Task on that Cell, and then reports the result of the Task. Tasks are guaranteed to run at most once.
We recommend interacting with Diego's Task functionality through the ExternalTaskClient interface. The calls exposed to external clients are specifically documented here.
Tasks in Diego undergo a lifecycle encoded in the Task state:
- When first created, a Task's state is
PENDING
. - When the
PENDING
Task is allocated to a Diego Cell, the Cell sets the Task's state toRUNNING
state, and populates the Task'sCellId
field with its own Cell ID. - When the Task completes, the Cell sets the
Failed
,FailureReason
, andResult
fields on the Task as appropriate, and sets the Task's state toCOMPLETED
.
At this point it is up to the Diego client to detect and resolve the completed Task. It can do this either by having set a completion callback URL on the Task when defined, or by polling for the Task and resolving and deleting it itself.
To prevent two Task clients from operating on the same completed Task at once, the BBS provides the RESOLVING
state on the Task. Any client intending to delete the Task must first successfully move it from the COMPLETED
state to the RESOLVING
state. For example, when Diego itself calls the completion callback URL on the Task, it first must transition the Task into the RESOLVING
state. External clients should adhere to the same convention.
Diego will automatically delete completed Tasks that remain unresolved after 2 minutes.
When submitting a task, a valid guid
, domain
, and TaskDefinition
should be provided to a Client's DesireTask method. See Defining Tasks for more detail on the TaskDefinition
fields.
The ExternalTaskClient
can be used to retrieve a Task from the BBS API. Fields on the Task not present in the TaskDefinition
represent its status. The returned task has the following additional attributes:
The State
field determines where in its lifecycle the Task is.
CellId
identifies which of Diego's Cells has accepted the Task workload.
Timestamps in nanoseconds since the start of UNIX epoch time (1970-01-01).
CreatedAt
is the time at which the Task was created.UpdatedAt
is the time at which the Task was last updated.FirstCompletedAt
is the time at which the Task first entered theCOMPLETED
state.
The FirstCompletedAt
timestamp is used to determine when a Task should be deleted during Task convergence after remaining unresolved for over 2 minutes.
If the client specifies a CompletionCallbackUrl
on the original Task definition, a TaskCallbackResponse
will be sent back as JSON to the specified URL when the task is completed.
This JSON-encoded TaskCallbackResponse
looks like the following:
{
"task_guid": "some-guid",
"failed": false,
"failure_reason": "some failure",
"result": "first 10KB of ResultFile",
"annotation": "arbitrary",
"created_at: 4567890434758937
}
Once a Task enters the COMPLETED
state, Failed
will be a boolean indicating whether the Task completed succesfully or unsuccesfully.
If Failed
is true
, FailureReason
will be a short string indicating why the Task failed. Sometimes, in the case of a RunAction
that has failed this may read exit status 1
. More detailed debugging of the Task may require retrieving the log messages from the Loggregator system.
If ResultFile
was specified and the Task has completed succesfully, Result
will include the first 10KB of the ResultFile
.
This is the arbitrary string that was specified in the TaskDefinition.