jobs

Creates, updates, deletes, gets or lists a jobs resource.

Overview

Name	`jobs`
Type	Resource
Id	`google.dataproc.jobs`

Fields

The following fields are returned by SELECT queries:

projects_regions_jobs_get
projects_regions_jobs_list

Name	Datatype	Description
`done`	`boolean`	Output only. Indicates whether the job is completed. If the value is false, the job is still in progress. If true, the job is completed, and status.state field will indicate if it was successful, failed, or cancelled.
`driverControlFilesUri`	`string`	Output only. If present, the location of miscellaneous control files which can be used as part of job setup and handling. If not present, control files might be placed in the same location as driver_output_uri.
`driverOutputResourceUri`	`string`	Output only. A URI pointing to the location of the stdout of the job's driver program.
`driverSchedulingConfig`	`object`	Optional. Driver scheduling configuration. (id: DriverSchedulingConfig)
`flinkJob`	`object`	Optional. Job is a Flink job. (id: FlinkJob)
`hadoopJob`	`object`	Optional. Job is a Hadoop job. (id: HadoopJob)
`hiveJob`	`object`	Optional. Job is a Hive job. (id: HiveJob)
`jobUuid`	`string`	Output only. A UUID that uniquely identifies a job within the project over time. This is in contrast to a user-settable reference.job_id that might be reused over time.
`labels`	`object`	Optional. The labels to associate with this job. Label keys must contain 1 to 63 characters, and must conform to RFC 1035 (https://www.ietf.org/rfc/rfc1035.txt). Label values can be empty, but, if present, must contain 1 to 63 characters, and must conform to RFC 1035 (https://www.ietf.org/rfc/rfc1035.txt). No more than 32 labels can be associated with a job.
`pigJob`	`object`	Optional. Job is a Pig job. (id: PigJob)
`placement`	`object`	Required. Job information, including how, when, and where to run the job. (id: JobPlacement)
`prestoJob`	`object`	Optional. Job is a Presto job. (id: PrestoJob)
`pysparkJob`	`object`	Optional. Job is a PySpark job. (id: PySparkJob)
`reference`	`object`	Optional. The fully qualified reference to the job, which can be used to obtain the equivalent REST path of the job resource. If this property is not specified when a job is created, the server generates a job_id. (id: JobReference)
`scheduling`	`object`	Optional. Job scheduling configuration. (id: JobScheduling)
`sparkJob`	`object`	Optional. Job is a Spark job. (id: SparkJob)
`sparkRJob`	`object`	Optional. Job is a SparkR job. (id: SparkRJob)
`sparkSqlJob`	`object`	Optional. Job is a SparkSql job. (id: SparkSqlJob)
`status`	`object`	Output only. The job status. Additional application-specific status information might be contained in the type_job and yarn_applications fields. (id: JobStatus)
`statusHistory`	`array`	Output only. The previous job status.
`trinoJob`	`object`	Optional. Job is a Trino job. (id: TrinoJob)
`yarnApplications`	`array`	Output only. The collection of YARN applications spun up by this job.Beta Feature: This report is available for testing purposes only. It might be changed before final release.

Name	Datatype	Description
`done`	`boolean`	Output only. Indicates whether the job is completed. If the value is false, the job is still in progress. If true, the job is completed, and status.state field will indicate if it was successful, failed, or cancelled.
`driverControlFilesUri`	`string`	Output only. If present, the location of miscellaneous control files which can be used as part of job setup and handling. If not present, control files might be placed in the same location as driver_output_uri.
`driverOutputResourceUri`	`string`	Output only. A URI pointing to the location of the stdout of the job's driver program.
`driverSchedulingConfig`	`object`	Optional. Driver scheduling configuration. (id: DriverSchedulingConfig)
`flinkJob`	`object`	Optional. Job is a Flink job. (id: FlinkJob)
`hadoopJob`	`object`	Optional. Job is a Hadoop job. (id: HadoopJob)
`hiveJob`	`object`	Optional. Job is a Hive job. (id: HiveJob)
`jobUuid`	`string`	Output only. A UUID that uniquely identifies a job within the project over time. This is in contrast to a user-settable reference.job_id that might be reused over time.
`labels`	`object`	Optional. The labels to associate with this job. Label keys must contain 1 to 63 characters, and must conform to RFC 1035 (https://www.ietf.org/rfc/rfc1035.txt). Label values can be empty, but, if present, must contain 1 to 63 characters, and must conform to RFC 1035 (https://www.ietf.org/rfc/rfc1035.txt). No more than 32 labels can be associated with a job.
`pigJob`	`object`	Optional. Job is a Pig job. (id: PigJob)
`placement`	`object`	Required. Job information, including how, when, and where to run the job. (id: JobPlacement)
`prestoJob`	`object`	Optional. Job is a Presto job. (id: PrestoJob)
`pysparkJob`	`object`	Optional. Job is a PySpark job. (id: PySparkJob)
`reference`	`object`	Optional. The fully qualified reference to the job, which can be used to obtain the equivalent REST path of the job resource. If this property is not specified when a job is created, the server generates a job_id. (id: JobReference)
`scheduling`	`object`	Optional. Job scheduling configuration. (id: JobScheduling)
`sparkJob`	`object`	Optional. Job is a Spark job. (id: SparkJob)
`sparkRJob`	`object`	Optional. Job is a SparkR job. (id: SparkRJob)
`sparkSqlJob`	`object`	Optional. Job is a SparkSql job. (id: SparkSqlJob)
`status`	`object`	Output only. The job status. Additional application-specific status information might be contained in the type_job and yarn_applications fields. (id: JobStatus)
`statusHistory`	`array`	Output only. The previous job status.
`trinoJob`	`object`	Optional. Job is a Trino job. (id: TrinoJob)
`yarnApplications`	`array`	Output only. The collection of YARN applications spun up by this job.Beta Feature: This report is available for testing purposes only. It might be changed before final release.

Methods

The following methods are available for this resource:

Name	Accessible by	Required Params	Optional Params	Description
`projects_regions_jobs_get`	`select`	`projectId`, `region`, `jobId`		Gets the resource representation for a job in a project.
`projects_regions_jobs_list`	`select`	`projectId`, `region`	`pageSize`, `pageToken`, `clusterName`, `jobStateMatcher`, `filter`	Lists regions/{region}/jobs in a project.
`projects_regions_jobs_patch`	`update`	`projectId`, `region`, `jobId`	`updateMask`	Updates a job in a project.
`projects_regions_jobs_delete`	`delete`	`projectId`, `region`, `jobId`		Deletes the job from the project. If the job is active, the delete fails, and the response returns FAILED_PRECONDITION.
`projects_regions_jobs_submit`	`exec`	`projectId`, `region`		Submits a job to a cluster.
`projects_regions_jobs_submit_as_operation`	`exec`	`projectId`, `region`		Submits job to a cluster.
`projects_regions_jobs_cancel`	`exec`	`projectId`, `region`, `jobId`		Starts a job cancellation request. To access the job resource after cancellation, call regions/{region}/jobs.list (https://cloud.google.com/dataproc/docs/reference/rest/v1/projects.regions.jobs/list) or regions/{region}/jobs.get (https://cloud.google.com/dataproc/docs/reference/rest/v1/projects.regions.jobs/get).

Parameters

Parameters can be passed in the WHERE clause of a query. Check the Methods section to see which parameters are required or optional for each operation.

Name	Datatype	Description
`jobId`	`string`
`projectId`	`string`
`region`	`string`
`clusterName`	`string`
`filter`	`string`
`jobStateMatcher`	`string`
`pageSize`	`integer (int32)`
`pageToken`	`string`
`updateMask`	`string (google-fieldmask)`

`SELECT` examples

projects_regions_jobs_get
projects_regions_jobs_list

Gets the resource representation for a job in a project.

SELECT
done,
driverControlFilesUri,
driverOutputResourceUri,
driverSchedulingConfig,
flinkJob,
hadoopJob,
hiveJob,
jobUuid,
labels,
pigJob,
placement,
prestoJob,
pysparkJob,
reference,
scheduling,
sparkJob,
sparkRJob,
sparkSqlJob,
status,
statusHistory,
trinoJob,
yarnApplications
FROM google.dataproc.jobs
WHERE projectId = '{{ projectId }}' -- required
AND region = '{{ region }}' -- required
AND jobId = '{{ jobId }}' -- required
;

Lists regions/{region}/jobs in a project.

SELECT
done,
driverControlFilesUri,
driverOutputResourceUri,
driverSchedulingConfig,
flinkJob,
hadoopJob,
hiveJob,
jobUuid,
labels,
pigJob,
placement,
prestoJob,
pysparkJob,
reference,
scheduling,
sparkJob,
sparkRJob,
sparkSqlJob,
status,
statusHistory,
trinoJob,
yarnApplications
FROM google.dataproc.jobs
WHERE projectId = '{{ projectId }}' -- required
AND region = '{{ region }}' -- required
AND pageSize = '{{ pageSize }}'
AND pageToken = '{{ pageToken }}'
AND clusterName = '{{ clusterName }}'
AND jobStateMatcher = '{{ jobStateMatcher }}'
AND filter = '{{ filter }}'
;

`UPDATE` examples

projects_regions_jobs_patch

Updates a job in a project.

UPDATE google.dataproc.jobs
SET 
data__reference = '{{ reference }}',
data__placement = '{{ placement }}',
data__hadoopJob = '{{ hadoopJob }}',
data__sparkJob = '{{ sparkJob }}',
data__pysparkJob = '{{ pysparkJob }}',
data__hiveJob = '{{ hiveJob }}',
data__pigJob = '{{ pigJob }}',
data__sparkRJob = '{{ sparkRJob }}',
data__sparkSqlJob = '{{ sparkSqlJob }}',
data__prestoJob = '{{ prestoJob }}',
data__trinoJob = '{{ trinoJob }}',
data__flinkJob = '{{ flinkJob }}',
data__labels = '{{ labels }}',
data__scheduling = '{{ scheduling }}',
data__driverSchedulingConfig = '{{ driverSchedulingConfig }}'
WHERE 
projectId = '{{ projectId }}' --required
AND region = '{{ region }}' --required
AND jobId = '{{ jobId }}' --required
AND updateMask = '{{ updateMask}}'
RETURNING
done,
driverControlFilesUri,
driverOutputResourceUri,
driverSchedulingConfig,
flinkJob,
hadoopJob,
hiveJob,
jobUuid,
labels,
pigJob,
placement,
prestoJob,
pysparkJob,
reference,
scheduling,
sparkJob,
sparkRJob,
sparkSqlJob,
status,
statusHistory,
trinoJob,
yarnApplications;

`DELETE` examples

projects_regions_jobs_delete

Deletes the job from the project. If the job is active, the delete fails, and the response returns FAILED_PRECONDITION.

DELETE FROM google.dataproc.jobs
WHERE projectId = '{{ projectId }}' --required
AND region = '{{ region }}' --required
AND jobId = '{{ jobId }}' --required
;

Lifecycle Methods

projects_regions_jobs_submit
projects_regions_jobs_submit_as_operation
projects_regions_jobs_cancel

Submits a job to a cluster.

EXEC google.dataproc.jobs.projects_regions_jobs_submit 
@projectId='{{ projectId }}' --required, 
@region='{{ region }}' --required 
@@json=
'{
"job": "{{ job }}", 
"requestId": "{{ requestId }}"
}'
;

Submits job to a cluster.

EXEC google.dataproc.jobs.projects_regions_jobs_submit_as_operation 
@projectId='{{ projectId }}' --required, 
@region='{{ region }}' --required 
@@json=
'{
"job": "{{ job }}", 
"requestId": "{{ requestId }}"
}'
;

Starts a job cancellation request. To access the job resource after cancellation, call regions/{region}/jobs.list (https://cloud.google.com/dataproc/docs/reference/rest/v1/projects.regions.jobs/list) or regions/{region}/jobs.get (https://cloud.google.com/dataproc/docs/reference/rest/v1/projects.regions.jobs/get).

EXEC google.dataproc.jobs.projects_regions_jobs_cancel 
@projectId='{{ projectId }}' --required, 
@region='{{ region }}' --required, 
@jobId='{{ jobId }}' --required
;

Overview​

Fields​

Methods​

Parameters​

SELECT examples​

UPDATE examples​

DELETE examples​

Lifecycle Methods​