job

Module to interact with jobs.

This module contains class to interact with a job in EMS data integration.

Typical usage example

job = data_pool.get_job(job_id)
job.name = "NEW_NAME"
job.update()

job.execute()

job.execute(
    transformation_ids=[transformation.id],
    data_model_execution_configurations=[
        DataModelExecutionConfiguration(
            data_model_execution_id=data_model_execution.id,
            tables=[t.id for t in data_model_execution.tables],
        )
    ],
)

job.delete()

Job ¶

Bases: JobTransport

Data job object to interact with data job specific data integration endpoints.

client `instance-attribute` `class-attribute` ¶

client: Client = Field(Ellipsis, exclude=True)

id `instance-attribute` ¶

id: str

Id of job.

data_pool_id `instance-attribute` ¶

data_pool_id: str

Id of data pool where job is located.

data_source_id `instance-attribute` ¶

data_source_id: typing.Optional[str]

Id of data connection where job is located.

name `instance-attribute` ¶

name: typing.Optional[str]

Name of job.

from_transport `classmethod` ¶

from_transport(client, job_transport)

Creates high-level job object from given JobTransport.

Parameters:

client (Client) –

Client to use to make API calls for given job.
job_transport (JobTransport) –

JobTransport object containing properties of job.

Returns:

Job –

A Job object with properties from transport and given client.

update ¶

update()

Pushes local changes of job to EMS and updates properties with response from EMS.

sync ¶

sync()

Syncs job properties with EMS.

delete ¶

delete()

Deletes job.

create_task ¶

create_task(name, task_type, description=None, **kwargs)

Creates new task with name in given data job.

Parameters:

name (str) –

Name of new task.
task_type (TaskType) –

Type of new task.
description (typing.Optional[str]) –

Description of new task.
**kwargs (typing.Any) –

Additional parameters set for NewTaskInstanceTransport object.

Returns:

Task –

A Task object for newly created task.

create_transformation ¶

create_transformation(name, description=None, **kwargs)

Creates new transformation task with name in given data job.

Parameters:

name (str) –

Name of new transformation task.
description (typing.Optional[str]) –

Description of new transformation task
**kwargs (typing.Any) –

Additional parameters set for NewTaskInstanceTransport object.

Returns:

Task –

A Task object for newly created transformation task.

Examples:

Create data job with transformation statement and execute it:

data_job = data_pool.create_job("PyCelonis Tutorial Job")

task = data_job.create_transformation(
    name="PyCelonis Tutorial Task",
    description="This is an example task"
)

task.update_statement(\"\"\"
    DROP TABLE IF EXISTS ACTIVITIES;
    CREATE TABLE ACTIVITIES (
        _CASE_KEY VARCHAR(100),
        ACTIVITY_EN VARCHAR(300)
    );
\"\"\")

data_job.execute()

create_extraction ¶

create_extraction(name, **kwargs)

Creates new extraction task with name in given data job.

Parameters:

name (str) –

Name of new extraction task.
**kwargs (typing.Any) –

Additional parameters set for NewTaskInstanceTransport object.

Returns:

Task –

A Task object for newly created extraction task.

get_task ¶

get_task(id_)

Gets task with given id.

Parameters:

id_ (str) –

Id of task.

Returns:

Task –

A Task object for task with given id.

get_tasks ¶

get_tasks()

Gets all tasks of given data job.

Returns:

CelonisCollection[Task] –

A list containing all tasks.

get_transformations ¶

get_transformations()

Gets all transformations of given data job.

Returns:

CelonisCollection[Task] –

A list containing all transformations.

get_extractions ¶

get_extractions()

Gets all extractions of given data job.

Returns:

CelonisCollection[Task] –

A list containing all extractions.

execute ¶

execute(
    transformation_ids=None,
    extraction_configurations=None,
    data_model_execution_configurations=None,
    mode=ExtractionMode.DELTA,
    wait=True,
    **kwargs
)

Executes job with given transformations, extractions, and data model executions.

Parameters:

transformation_ids (typing.Optional[typing.List[str]]) –

Ids of transformations to use. Default is None which executes all transformations.
extraction_configurations (typing.Optional[typing.List[ExtractionConfiguration]]) –

Extraction configurations to define which extractions to execute. Default is None which executes all extractions.
data_model_execution_configurations (typing.Optional[typing.List[DataModelExecutionConfiguration]]) –

Data model execution configurations to define which data model reloads to execute. Default is None which executes all data model reloads. If configuration is given it needs to contain all table ids to reload.
mode (ExtractionMode) –

Extraction mode. Default is DELTA.
wait (bool) –

If true, function only returns once data job has been executed and raises error if it fails. It does only wait until the job has finished and not until all data models have been reloaded. If false, function returns after triggering execution and does not raise errors in case it failed.
**kwargs (typing.Any) –

Additional parameters set for JobExecutionConfiguration object.

Examples:

Create data job with transformation statement and execute it:

data_job = data_pool.create_job("PyCelonis Tutorial Job")

task = data_job.create_transformation(
    name="PyCelonis Tutorial Task",
    description="This is an example task"
)

task.update_statement(\"\"\"
    DROP TABLE IF EXISTS ACTIVITIES;
    CREATE TABLE ACTIVITIES (
        _CASE_KEY VARCHAR(100),
        ACTIVITY_EN VARCHAR(300)
    );
\"\"\")

data_job.execute()

cancel_execution ¶

cancel_execution()

Cancels the execution of the job.

create_data_model_execution ¶

create_data_model_execution(
    data_model_id, table_ids=None, **kwargs
)

Creates data model execution in given data job.

Parameters:

data_model_id (str) –

Id of data model to reload.
table_ids (typing.Optional[typing.List[str]]) –

Ids of tables to reload. Defaults to full load.
**kwargs (typing.Any) –

Additional parameters set for DataModelExecutionTransport object.

Returns:

DataModelExecution –

A DataModelExecution object for newly created data model execution.

get_data_model_execution ¶

get_data_model_execution(id_)

Gets data model execution with given id.

Parameters:

id_ (typing.Optional[str]) –

Id of data model execution.

Returns:

DataModelExecution –

A DataModelExecution object for data model execution with given id.

get_data_model_executions ¶

get_data_model_executions()

Gets all data model executions of given data job.

Returns:

CelonisCollection[DataModelExecution] –

A list containing all data model executions.

job

Job ¶

client instance-attribute class-attribute ¶

id instance-attribute ¶

data_pool_id instance-attribute ¶

data_source_id instance-attribute ¶

name instance-attribute ¶

from_transport classmethod ¶

update ¶

sync ¶

delete ¶

create_task ¶

create_transformation ¶

create_extraction ¶

get_task ¶

get_tasks ¶

get_transformations ¶

get_extractions ¶

execute ¶

cancel_execution ¶

create_data_model_execution ¶

get_data_model_execution ¶

get_data_model_executions ¶

client `instance-attribute` `class-attribute` ¶

id `instance-attribute` ¶

data_pool_id `instance-attribute` ¶

data_source_id `instance-attribute` ¶

name `instance-attribute` ¶

from_transport `classmethod` ¶