data_labeling_jobs

Creates, updates, deletes, gets or lists a data_labeling_jobs resource.

Overview

Name	`data_labeling_jobs`
Type	Resource
Id	`google.aiplatform.data_labeling_jobs`

Fields

The following fields are returned by SELECT queries:

get
list

Name	Datatype	Description
`name`	`string`	Output only. Resource name of the DataLabelingJob.
`activeLearningConfig`	`object`	Parameters that configure the active learning pipeline. Active learning will label the data incrementally via several iterations. For every iteration, it will select a batch of data based on the sampling strategy. (id: GoogleCloudAiplatformV1ActiveLearningConfig)
`annotationLabels`	`object`	Labels to assign to annotations generated by this DataLabelingJob. Label keys and values can be no longer than 64 characters (Unicode codepoints), can only contain lowercase letters, numeric characters, underscores and dashes. International characters are allowed. See https://goo.gl/xmQnxf for more information and examples of labels. System reserved label keys are prefixed with "aiplatform.googleapis.com/" and are immutable.
`createTime`	`string (google-datetime)`	Output only. Timestamp when this DataLabelingJob was created.
`currentSpend`	`object`	Output only. Estimated cost(in US dollars) that the DataLabelingJob has incurred to date. (id: GoogleTypeMoney)
`datasets`	`array`	Required. Dataset resource names. Right now we only support labeling from a single Dataset. Format: `projects/{project}/locations/{location}/datasets/{dataset}`
`displayName`	`string`	Required. The user-defined name of the DataLabelingJob. The name can be up to 128 characters long and can consist of any UTF-8 characters. Display name of a DataLabelingJob.
`encryptionSpec`	`object`	Customer-managed encryption key spec for a DataLabelingJob. If set, this DataLabelingJob will be secured by this key. Note: Annotations created in the DataLabelingJob are associated with the EncryptionSpec of the Dataset they are exported to. (id: GoogleCloudAiplatformV1EncryptionSpec)
`error`	`object`	Output only. DataLabelingJob errors. It is only populated when job's state is `JOB_STATE_FAILED` or `JOB_STATE_CANCELLED`. (id: GoogleRpcStatus)
`inputs`	`any`	Required. Input config parameters for the DataLabelingJob.
`inputsSchemaUri`	`string`	Required. Points to a YAML file stored on Google Cloud Storage describing the config for a specific type of DataLabelingJob. The schema files that can be used here are found in the https://storage.googleapis.com/google-cloud-aiplatform bucket in the /schema/datalabelingjob/inputs/ folder.
`instructionUri`	`string`	Required. The Google Cloud Storage location of the instruction pdf. This pdf is shared with labelers, and provides detailed description on how to label DataItems in Datasets.
`labelerCount`	`integer (int32)`	Required. Number of labelers to work on each DataItem.
`labelingProgress`	`integer (int32)`	Output only. Current labeling job progress percentage scaled in interval [0, 100], indicating the percentage of DataItems that has been finished.
`labels`	`object`	The labels with user-defined metadata to organize your DataLabelingJobs. Label keys and values can be no longer than 64 characters (Unicode codepoints), can only contain lowercase letters, numeric characters, underscores and dashes. International characters are allowed. See https://goo.gl/xmQnxf for more information and examples of labels. System reserved label keys are prefixed with "aiplatform.googleapis.com/" and are immutable. Following system labels exist for each DataLabelingJob: * "aiplatform.googleapis.com/schema": output only, its value is the inputs_schema's title.
`specialistPools`	`array`	The SpecialistPools' resource names associated with this job.
`state`	`string`	Output only. The detailed state of the job.
`updateTime`	`string (google-datetime)`	Output only. Timestamp when this DataLabelingJob was updated most recently.

Name	Datatype	Description
`name`	`string`	Output only. Resource name of the DataLabelingJob.
`activeLearningConfig`	`object`	Parameters that configure the active learning pipeline. Active learning will label the data incrementally via several iterations. For every iteration, it will select a batch of data based on the sampling strategy. (id: GoogleCloudAiplatformV1ActiveLearningConfig)
`annotationLabels`	`object`	Labels to assign to annotations generated by this DataLabelingJob. Label keys and values can be no longer than 64 characters (Unicode codepoints), can only contain lowercase letters, numeric characters, underscores and dashes. International characters are allowed. See https://goo.gl/xmQnxf for more information and examples of labels. System reserved label keys are prefixed with "aiplatform.googleapis.com/" and are immutable.
`createTime`	`string (google-datetime)`	Output only. Timestamp when this DataLabelingJob was created.
`currentSpend`	`object`	Output only. Estimated cost(in US dollars) that the DataLabelingJob has incurred to date. (id: GoogleTypeMoney)
`datasets`	`array`	Required. Dataset resource names. Right now we only support labeling from a single Dataset. Format: `projects/{project}/locations/{location}/datasets/{dataset}`
`displayName`	`string`	Required. The user-defined name of the DataLabelingJob. The name can be up to 128 characters long and can consist of any UTF-8 characters. Display name of a DataLabelingJob.
`encryptionSpec`	`object`	Customer-managed encryption key spec for a DataLabelingJob. If set, this DataLabelingJob will be secured by this key. Note: Annotations created in the DataLabelingJob are associated with the EncryptionSpec of the Dataset they are exported to. (id: GoogleCloudAiplatformV1EncryptionSpec)
`error`	`object`	Output only. DataLabelingJob errors. It is only populated when job's state is `JOB_STATE_FAILED` or `JOB_STATE_CANCELLED`. (id: GoogleRpcStatus)
`inputs`	`any`	Required. Input config parameters for the DataLabelingJob.
`inputsSchemaUri`	`string`	Required. Points to a YAML file stored on Google Cloud Storage describing the config for a specific type of DataLabelingJob. The schema files that can be used here are found in the https://storage.googleapis.com/google-cloud-aiplatform bucket in the /schema/datalabelingjob/inputs/ folder.
`instructionUri`	`string`	Required. The Google Cloud Storage location of the instruction pdf. This pdf is shared with labelers, and provides detailed description on how to label DataItems in Datasets.
`labelerCount`	`integer (int32)`	Required. Number of labelers to work on each DataItem.
`labelingProgress`	`integer (int32)`	Output only. Current labeling job progress percentage scaled in interval [0, 100], indicating the percentage of DataItems that has been finished.
`labels`	`object`	The labels with user-defined metadata to organize your DataLabelingJobs. Label keys and values can be no longer than 64 characters (Unicode codepoints), can only contain lowercase letters, numeric characters, underscores and dashes. International characters are allowed. See https://goo.gl/xmQnxf for more information and examples of labels. System reserved label keys are prefixed with "aiplatform.googleapis.com/" and are immutable. Following system labels exist for each DataLabelingJob: * "aiplatform.googleapis.com/schema": output only, its value is the inputs_schema's title.
`specialistPools`	`array`	The SpecialistPools' resource names associated with this job.
`state`	`string`	Output only. The detailed state of the job.
`updateTime`	`string (google-datetime)`	Output only. Timestamp when this DataLabelingJob was updated most recently.

Methods

The following methods are available for this resource:

Name	Accessible by	Required Params	Optional Params	Description
`get`	`select`	`projectsId`, `locationsId`, `dataLabelingJobsId`		Gets a DataLabelingJob.
`list`	`select`	`projectsId`, `locationsId`	`filter`, `pageSize`, `pageToken`, `readMask`, `orderBy`	Lists DataLabelingJobs in a Location.
`create`	`insert`	`projectsId`, `locationsId`		Creates a DataLabelingJob.
`delete`	`delete`	`projectsId`, `locationsId`, `dataLabelingJobsId`		Deletes a DataLabelingJob.
`cancel`	`exec`	`projectsId`, `locationsId`, `dataLabelingJobsId`		Cancels a DataLabelingJob. Success of cancellation is not guaranteed.

Parameters

Parameters can be passed in the WHERE clause of a query. Check the Methods section to see which parameters are required or optional for each operation.

Name	Datatype	Description
`dataLabelingJobsId`	`string`
`locationsId`	`string`
`projectsId`	`string`
`filter`	`string`
`orderBy`	`string`
`pageSize`	`integer (int32)`
`pageToken`	`string`
`readMask`	`string (google-fieldmask)`

`SELECT` examples

get
list

Gets a DataLabelingJob.

SELECT
name,
activeLearningConfig,
annotationLabels,
createTime,
currentSpend,
datasets,
displayName,
encryptionSpec,
error,
inputs,
inputsSchemaUri,
instructionUri,
labelerCount,
labelingProgress,
labels,
specialistPools,
state,
updateTime
FROM google.aiplatform.data_labeling_jobs
WHERE projectsId = '{{ projectsId }}' -- required
AND locationsId = '{{ locationsId }}' -- required
AND dataLabelingJobsId = '{{ dataLabelingJobsId }}' -- required
;

Lists DataLabelingJobs in a Location.

SELECT
name,
activeLearningConfig,
annotationLabels,
createTime,
currentSpend,
datasets,
displayName,
encryptionSpec,
error,
inputs,
inputsSchemaUri,
instructionUri,
labelerCount,
labelingProgress,
labels,
specialistPools,
state,
updateTime
FROM google.aiplatform.data_labeling_jobs
WHERE projectsId = '{{ projectsId }}' -- required
AND locationsId = '{{ locationsId }}' -- required
AND filter = '{{ filter }}'
AND pageSize = '{{ pageSize }}'
AND pageToken = '{{ pageToken }}'
AND readMask = '{{ readMask }}'
AND orderBy = '{{ orderBy }}'
;

`INSERT` examples

create
Manifest

Creates a DataLabelingJob.

INSERT INTO google.aiplatform.data_labeling_jobs (
data__displayName,
data__datasets,
data__annotationLabels,
data__labelerCount,
data__instructionUri,
data__inputsSchemaUri,
data__inputs,
data__labels,
data__specialistPools,
data__encryptionSpec,
data__activeLearningConfig,
projectsId,
locationsId
)
SELECT 
'{{ displayName }}',
'{{ datasets }}',
'{{ annotationLabels }}',
{{ labelerCount }},
'{{ instructionUri }}',
'{{ inputsSchemaUri }}',
'{{ inputs }}',
'{{ labels }}',
'{{ specialistPools }}',
'{{ encryptionSpec }}',
'{{ activeLearningConfig }}',
'{{ projectsId }}',
'{{ locationsId }}'
RETURNING
name,
activeLearningConfig,
annotationLabels,
createTime,
currentSpend,
datasets,
displayName,
encryptionSpec,
error,
inputs,
inputsSchemaUri,
instructionUri,
labelerCount,
labelingProgress,
labels,
specialistPools,
state,
updateTime
;

# Description fields are for documentation purposes
- name: data_labeling_jobs
  props:
    - name: projectsId
      value: string
      description: Required parameter for the data_labeling_jobs resource.
    - name: locationsId
      value: string
      description: Required parameter for the data_labeling_jobs resource.
    - name: displayName
      value: string
      description: >
        Required. The user-defined name of the DataLabelingJob. The name can be up to 128 characters long and can consist of any UTF-8 characters. Display name of a DataLabelingJob.
        
    - name: datasets
      value: array
      description: >
        Required. Dataset resource names. Right now we only support labeling from a single Dataset. Format: `projects/{project}/locations/{location}/datasets/{dataset}`
        
    - name: annotationLabels
      value: object
      description: >
        Labels to assign to annotations generated by this DataLabelingJob. Label keys and values can be no longer than 64 characters (Unicode codepoints), can only contain lowercase letters, numeric characters, underscores and dashes. International characters are allowed. See https://goo.gl/xmQnxf for more information and examples of labels. System reserved label keys are prefixed with "aiplatform.googleapis.com/" and are immutable.
        
    - name: labelerCount
      value: integer
      description: >
        Required. Number of labelers to work on each DataItem.
        
    - name: instructionUri
      value: string
      description: >
        Required. The Google Cloud Storage location of the instruction pdf. This pdf is shared with labelers, and provides detailed description on how to label DataItems in Datasets.
        
    - name: inputsSchemaUri
      value: string
      description: >
        Required. Points to a YAML file stored on Google Cloud Storage describing the config for a specific type of DataLabelingJob. The schema files that can be used here are found in the https://storage.googleapis.com/google-cloud-aiplatform bucket in the /schema/datalabelingjob/inputs/ folder.
        
    - name: inputs
      value: any
      description: >
        Required. Input config parameters for the DataLabelingJob.
        
    - name: labels
      value: object
      description: >
        The labels with user-defined metadata to organize your DataLabelingJobs. Label keys and values can be no longer than 64 characters (Unicode codepoints), can only contain lowercase letters, numeric characters, underscores and dashes. International characters are allowed. See https://goo.gl/xmQnxf for more information and examples of labels. System reserved label keys are prefixed with "aiplatform.googleapis.com/" and are immutable. Following system labels exist for each DataLabelingJob: * "aiplatform.googleapis.com/schema": output only, its value is the inputs_schema's title.
        
    - name: specialistPools
      value: array
      description: >
        The SpecialistPools' resource names associated with this job.
        
    - name: encryptionSpec
      value: object
      description: >
        Customer-managed encryption key spec for a DataLabelingJob. If set, this DataLabelingJob will be secured by this key. Note: Annotations created in the DataLabelingJob are associated with the EncryptionSpec of the Dataset they are exported to.
        
    - name: activeLearningConfig
      value: object
      description: >
        Parameters that configure the active learning pipeline. Active learning will label the data incrementally via several iterations. For every iteration, it will select a batch of data based on the sampling strategy.
        

`DELETE` examples

delete

Deletes a DataLabelingJob.

DELETE FROM google.aiplatform.data_labeling_jobs
WHERE projectsId = '{{ projectsId }}' --required
AND locationsId = '{{ locationsId }}' --required
AND dataLabelingJobsId = '{{ dataLabelingJobsId }}' --required
;

Lifecycle Methods

cancel

Cancels a DataLabelingJob. Success of cancellation is not guaranteed.

EXEC google.aiplatform.data_labeling_jobs.cancel 
@projectsId='{{ projectsId }}' --required, 
@locationsId='{{ locationsId }}' --required, 
@dataLabelingJobsId='{{ dataLabelingJobsId }}' --required
;

Overview​

Fields​

Methods​

Parameters​

SELECT examples​

INSERT examples​

DELETE examples​

Lifecycle Methods​