This is an interactive sample that supports a set of commands that allow you to interact with the AWS IoT Jobs Service. The sample includes both control plane commands (that require the AWS SDK for Java v2 and use HTTP as transport) and data plane commands (that use the v2 device SDK and use MQTT as transport). In a real use case, control plane commands would be issued by applications under control of the customer, while the data plane operations would be issued by software running on the IoT device itself.
Using the Jobs service and this sample requires an understanding of two closely-related but different service terms:
- Job - metadata describing a task that the user would like one or more devices to run
- Job Execution - metadata describing the state of a single device's attempt to execute a job
In particular, you could have many IoT devices (things) that belong to a thing group. You could create a Job that targets the thing group. Each device/thing would
manage its own individual Job Execution that corresponded to its attempt to fulfill the overall job request. In the section that follows, notice that all of the data-plane
commands use job-execution
while all of the control plane commands use job
.
Once connected, the sample supports the following commands:
Control Plane
create-job <jobId> <jobDocument-as-JSON>
- creates a new job resource that targets the thing/device the sample has been configured with. It is up to the device application to interpret the Job document appropriately and carry out the execution it describes.delete-job <jobId>
- delete a job. A job must be in a terminal state (all executions terminal) for this command to complete successfully.
Data Plane
get-pending-job-executions
- gets the state of all incomplete job executions for this thing/device.start-next-pending-job-execution
- if one or more pending job executions exist for this thing/device, attempts to transition the next one from QUEUED to IN_PROGRESS. Returns information about the newly-in-progress job execution, if it exists.describe-job-execution <jobId>
- gets the current state of this thing's execution of a particular job.update-job-execution <jobId> <SUCCEEDED | IN_PROGRESS | FAILED | CANCELED>
- updates the status field of this thing's execution of a particular job. SUCCEEDED, FAILED, and CANCELED are all terminal states.
Miscellaneous
help
- prints the set of supported commandsquit
- quits the sample application
Your IoT Core Thing's Policy must provide privileges for this sample to connect, subscribe, publish, and receive. Below is a sample policy that can be used on your IoT Core Thing that will allow this sample to run as intended.
Sample Policy
{ "Version": "2012-10-17", "Statement": [ { "Effect": "Allow", "Action": "iot:Publish", "Resource": [ "arn:aws:iot:region:account:topic/$aws/things/thingname/jobs/start-next", "arn:aws:iot:region:account:topic/$aws/things/thingname/jobs/*/update", "arn:aws:iot:region:account:topic/$aws/things/thingname/jobs/*/get", "arn:aws:iot:region:account:topic/$aws/things/thingname/jobs/get" ] }, { "Effect": "Allow", "Action": "iot:Receive", "Resource": [ "arn:aws:iot:region:account:topic/$aws/things/thingname/jobs/notify", "arn:aws:iot:region:account:topic/$aws/things/thingname/jobs/notify-next", "arn:aws:iot:region:account:topic/$aws/things/thingname/jobs/start-next/*", "arn:aws:iot:region:account:topic/$aws/things/thingname/jobs/*/update/*", "arn:aws:iot:region:account:topic/$aws/things/thingname/jobs/get/*", "arn:aws:iot:region:account:topic/$aws/things/thingname/jobs/*/get/*" ] }, { "Effect": "Allow", "Action": "iot:Subscribe", "Resource": [ "arn:aws:iot:region:account:topicfilter/$aws/things/thingname/jobs/notify", "arn:aws:iot:region:account:topicfilter/$aws/things/thingname/jobs/notify-next", "arn:aws:iot:region:account:topicfilter/$aws/things/thingname/jobs/start-next/*", "arn:aws:iot:region:account:topicfilter/$aws/things/thingname/jobs/*/update/*", "arn:aws:iot:region:account:topicfilter/$aws/things/thingname/jobs/get/*", "arn:aws:iot:region:account:topicfilter/$aws/things/thingname/jobs/*/get/*" ] }, { "Effect": "Allow", "Action": "iot:Connect", "Resource": "arn:aws:iot:region:account:client/test-*" } ] }
Replace with the following with the data from your AWS account:
<region>
: The AWS IoT Core region where you created your AWS IoT Core thing you wish to use with this sample. For exampleus-east-1
.<account>
: Your AWS IoT Core account ID. This is the set of numbers in the top right next to your AWS account name when using the AWS IoT Core website.<thingname>
: The name of your AWS IoT Core thing you want the device connection to be associated with
Note that in a real application, you may want to avoid the use of wildcards in your ClientID or use them selectively. Please follow best practices when working with AWS on production applications using the SDK. Also, for the purposes of this sample, please make sure your policy allows a client ID of test-*
to connect or use --client_id <client ID here>
to send the client ID your policy supports.
Additionally, the sample's control plane operations require that AWS credentials with appropriate permissions be sourceable by the default credentials provider chain of the v2 Java SDK. At a minimum, the following permissions must be granted:
Sample Policy
{ "Version": "2012-10-17", "Statement": [ { "Effect": "Allow", "Action": "iot:CreateJob", "Resource": [ "arn:aws:iot:region:account:job/*", "arn:aws:iot:region:account:thing/thingname" ] }, { "Effect": "Allow", "Action": "iot:DeleteJob", "Resource": [ "arn:aws:iot:region:account:job/*", "arn:aws:iot:region:account:thing/thingname" ] } ] }
Replace with the following with the data from your AWS account:
<region>
: The AWS IoT Core region where you created your AWS IoT Core thing you wish to use with this sample. For exampleus-east-1
.<account>
: Your AWS IoT Core account ID. This is the set of numbers in the top right next to your AWS account name when using the AWS IoT Core website.<thingname>
: The name of your AWS IoT Core thing you want the device connection to be associated with
Notice that you must provide iot:CreateJob
permission to all things targeted by your jobs as well as the job itself. In this example, we use a wildcard for the
job permission so that you can name the jobs whatever you would like.
To run the Jobs sample use the following command:
mvn compile exec:java -pl samples/JobsSandbox -Dexec.mainClass=jobs.JobsSandbox -Dexec.args="--endpoint <endpoint> --cert <path to certificate> --key <path to private key> --thing <thing name>"
If an AWS IoT Thing resource with the given name does not exist, the sample will first create it. Once the thing exists, the sample connects via MQTT and you can issue commands to the Jobs service and inspect the results. This walkthrough assumes a fresh thing that has no pre-existing jobs targeting it.
First, we check if there are any incomplete job executions for this device. Assuming the thing is freshly-created, we expect there to be nothing:
get-pending-job-executions
yields output like
GetPendingJobExecutionsResponse:
{"inProgressJobs":[],"queuedJobs":[],"timestamp":{},"clientToken":"5b79ec86-eefb-41ba-a61b-dde398fe16e8"}
from which we can see that the device has no pending job executions and no in-progress job executions.
Next, we'll create a couple of jobs that target the device:
create-job Job1 {"Todo":"Reboot"}
which yields output similar to
CreateJobResponse:
CreateJobResponse(JobArn=arn:aws:iot:us-east-1:123124136734:job/Job1, JobId=Job1)
Received JobExecutionsChanged event:
{"jobs":{"QUEUED":[{"jobId":"Job1","executionNumber":1,"versionNumber":1,"lastUpdatedAt":{},"queuedAt":{}}]},"timestamp":{}}
Received NextJobExecutionChanged event:
{"execution":{"jobId":"Job1","jobDocument":{"Todo":"Reboot"},"status":"QUEUED","queuedAt":{},"lastUpdatedAt":{},"versionNumber":1,"executionNumber":1},"timestamp":{}}
In addition to the successful (HTTP) response to the CreateJob API call, our action triggered two (MQTT-based) events: a JobExecutionsChanged event and a
NextJobExecutionChanged event. When the sample is run, it creates and opens two streaming operations that listen for these two different events, by using the
createJobExecutionsChangedStream
and createNextJobExecutionChangedStream
APIs.
A JobExecutionsChanged event is emitted every time either the queued or in-progress job execution sets change for the device. A NextJobExecutionChanged event is emitted only when the next job to be executed changes. So if you create N jobs targeting a device, you'll get N JobExecutionsChanged events, but only (up to) one NextJobExecutionChanged event (unless the device starts completing jobs, triggering additional NextJobExecutionChanged events).
Let's create a second job as well:
create-job Job2 {"ToDo":"Delete Root User"}
whose output might look like
CreateJobResponse:
CreateJobResponse(JobArn=arn:aws:iot:us-east-1:123124136734:job/Job2, JobId=Job2)
Received JobExecutionsChanged event:
{"jobs":{"QUEUED":[{"jobId":"Job1","executionNumber":1,"versionNumber":1,"lastUpdatedAt":{},"queuedAt":{}},{"jobId":"Job2","executionNumber":1,"versionNumber":1,"lastUpdatedAt":{},"queuedAt":{}}]},"timestamp":{}}
Notice how this time, there is no NextJobExecutionChanged event because the second job is behind the first, and therefore the next job execution hasn't changed. As we will see below, a NextJobExecutionChanged event referencing the second job will be emitted when the first job (in progress) is completed.
Our device now has two jobs queued that it needs to (pretend to) execute. Let's see how to do that, and what happens when we do.
The easiest way to start a job execution is via the startNextPendingJobExecution
API. This API takes the job execution at the head of the QUEUED list and moves it
into the IN_PROGRESS state, returning its job document in the process.
start-next-pending-job-execution
StartNextJobExecutionResponse:
{"clientToken":"a167c623-6156-4757-a552-0e98a461390a","execution":{"jobId":"Job1","jobDocument":{"Todo":"Reboot"},"status":"IN_PROGRESS","queuedAt":{},"startedAt":{},"lastUpdatedAt":{},"versionNumber":2,"executionNumber":1},"timestamp":{}}
Note that the response includes the job's document, which is what describes what the job actually entails. The contents of the job document and its interpretation and execution are the responsibility of the developer. Notice also that no events were emitted from the action of moving a job from the QUEUED state to the IN_PROGRESS state.
If we run getPendingJobExecutions
again, we see that Job1 is now in progress, while Job2 remains in the queued state:
get-pending-job-executions
GetPendingJobExecutionsResponse:
{"inProgressJobs":[{"jobId":"Job1","executionNumber":1,"versionNumber":2,"lastUpdatedAt":{},"queuedAt":{},"startedAt":{}}],"queuedJobs":[{"jobId":"Job2","executionNumber":1,"versionNumber":1,"lastUpdatedAt":{},"queuedAt":{}}],"timestamp":{},"clientToken":"775aa42a-9173-4392-af02-c9a1e46d22d2"}
A real device application would perform the job execution steps as needed. Let's assume that has been done. We need to tell the service the job has completed:
update-job-execution Job1 SUCCEEDED
will trigger output similar to
UpdateJobExecutionResponse:
{"clientToken":"6864b359-12dc-45de-b556-7302f5df7b39","timestamp":{}}
Received NextJobExecutionChanged event:
{"execution":{"jobId":"Job2","jobDocument":{"ToDo":"Delete Root User"},"status":"QUEUED","queuedAt":{},"lastUpdatedAt":{},"versionNumber":1,"executionNumber":1},"timestamp":{}}
Received JobExecutionsChanged event:
{"jobs":{"QUEUED":[{"jobId":"Job2","executionNumber":1,"versionNumber":1,"lastUpdatedAt":{},"queuedAt":{}}]},"timestamp":{}}
Notice we get a response as well as two events, since both
- The set of incomplete job executions set has changed.
- The next job to be executed has changed.
As expected, we can move Job2's execution into IN_PROGRESS by invoking startNextPendingJobExecution
again:
start-next-pending-job-execution
StartNextJobExecutionResponse:
{"clientToken":"d0d71122-666c-40fc-9886-9b8d64ceac53","execution":{"jobId":"Job2","jobDocument":{"ToDo":"Delete Root User"},"status":"IN_PROGRESS","queuedAt":{},"startedAt":{},"lastUpdatedAt":{},"versionNumber":2,"executionNumber":1},"timestamp":{}}
Let's pretend that the job execution failed. An update variant can notify the Jobs service of this fact:
update-job-execution Job2 FAILED
triggering
UpdateJobExecutionResponse:
{"clientToken":"5022f709-1e39-4f2e-ba12-c816533d3d8b","timestamp":{}}
Received JobExecutionsChanged event:
{"jobs":{},"timestamp":{}}
Received NextJobExecutionChanged event:
{"timestamp":{}}
At this point, no incomplete job executions remain.
When all executions for a given job have reached a terminal state (SUCCEEDED, FAILED, CANCELED), you can delete the job itself. This is a control plane operation that requires the v2 AWS SDK for Java and should not be performed by the device executing jobs:
delete-job Job1
yielding
DeleteJobResponse:
DeleteJobResponse()
The request will not fail, but the execution
field of the response will be empty, indicating that there is nothing to do.
What happens if I call startNextPendingJobExecution
twice in a row (or while another job is in the IN_PROGRESS state)?
The service will return the execution information for the IN_PROGRESS job again.
Since startNextPendingJobExecution
does not help here, the device application can manually update a job execution from the QUEUED state to the IN_PROGRESS
state in the same manner that it completes a job execution: use getPendingJobExecutions
to get the list of queued executions and use
updateJobExecution
to move one or more job executions into the IN_PROGRESS state.
A device's persistent job executor should:
- On startup, create and open streaming operations for both the JobExecutionsChanged and NextJobExecutionChanged events
- On startup, get and cache the set of incomplete job executions using
getPendingJobExecutions
- Keep the cached job execution set up to date by reacting appropriately to JobExecutionsChanged and NextJobExecutionChanged events
- While there are incomplete job executions, start and execute them one-at-a-time; otherwise wait for a new entry in the incomplete (queued) job executions set.