bauplan.schema

`class`APIError

bauplan.schema.APIError(
    *,
    code: int,
    type: str,
    message: str,
    context: dict[str,
    typing.Any]
)-> None

`class`APIMetadata

bauplan.schema.APIMetadata(
    *,
    status_code: int,
    ref: Optional[Annotated[Union[bauplan.schema.Branch,
    bauplan.schema.Tag,
    bauplan.schema.DetachedRef],
    FieldInfo(
    annotation=NoneType,
    required=True,
    discriminator='type'
)]] = None,
    username: Optional[str] = None,
    error: Optional[str] = None,
    pagination_token: Optional[str] = None,
    request_id: str,
    request_ts: int,
    request_ms: int
)-> None

`class`APIResponse

bauplan.schema.APIResponse(
    *,
    metadata: bauplan.schema.APIMetadata,
    ref: Optional[Annotated[Union[bauplan.schema.Branch,
    bauplan.schema.Tag,
    bauplan.schema.DetachedRef],
    FieldInfo(
    annotation=NoneType,
    required=True,
    discriminator='type'
)]] = None
)-> None

`class`APIResponseWithData

bauplan.schema.APIResponseWithData(
    *,
    metadata: bauplan.schema.APIMetadata,
    ref: Optional[Annotated[Union[bauplan.schema.Branch,
    bauplan.schema.Tag,
    bauplan.schema.DetachedRef],
    FieldInfo(
    annotation=NoneType,
    required=True,
    discriminator='type'
)]] = None,
    data: Any
)-> None

Bases: APIResponse

`class`APIResponseWithError

bauplan.schema.APIResponseWithError(
    *,
    metadata: bauplan.schema.APIMetadata,
    ref: Optional[Annotated[Union[bauplan.schema.Branch,
    bauplan.schema.Tag,
    bauplan.schema.DetachedRef],
    FieldInfo(
    annotation=NoneType,
    required=True,
    discriminator='type'
)]] = None,
    error: bauplan.schema.APIError
)-> None

Bases: APIResponse

`class`Actor

bauplan.schema.Actor(
    *,
    name: str,
    email: str | None
)-> None

`class`Branch

bauplan.schema.Branch(
    *,
    name: str,
    hash: str | None = None,
    type: Literal['BRANCH'] = 'BRANCH'
)-> None

Bases: Ref

`class`CacheDir

bauplan.schema.CacheDir(
    *,
    dirpath: pathlib.Path
)-> None

EXPERIMENTAL AND SUBJECT TO CHANGE.

CacheDir is a model for a standard bauplan directory ($HOME/.bauplan) for caching of files on the local filesystem. This is partially a convenience interface for other models such as JobContext, and partially a convenience for the user to easily clean up any cache files they no longer want (or a previous process failed to clean up).

`def` cleanup

Remove the temporary cache directory and its contents.

`def` clear_job_cache

Remove all directories with the '.job_snapshot' prefix from the bauplan cache.

Parameters

`def` save

Make the temporary cache directory persistent, preventing automatic cleanup.

`class`Commit

A commit is a record of a change in the data lake.

Attributes

`class`DAGEdge

bauplan.schema.DAGEdge(
    *,
    source_model: Optional[str],
    destination_model: str
)-> None

A dependency between DAGNode instances, representing dataflow.

`class`DAGNode

bauplan.schema.DAGNode(
    *,
    id: str,
    name: str
)-> None

A bauplan function that produces a Model.

Attributes

`class`DetachedRef

bauplan.schema.DetachedRef(
    *,
    name: str,
    hash: str | None = None,
    type: Literal['DETACHED'] = 'DETACHED'
)-> None

Bases: Ref

`class`Entry

bauplan.schema.Entry(
    *,
    name: str,
    namespace: str,
    kind: bauplan.schema.EntryType
)-> None

`class`EntryType

`class`GetBranchesResponse

An Iterable containing Branch objects returned by get_branches method.

Example:

response = client.get_branches()
for branch in response:
    print(branch.name)

`class`GetCommitsResponse

An Iterable containing Commit objects returned by get_commits method.

`class`GetNamespacesResponse

`class`GetTablesResponse

An Iterable containing TableWithMetadata objects returned by get_tables method.

Example:

response = client.get_tables(namespace='my_namespace', ref='main')
for table in response:
    print(table.name, table.records)

`class`GetTagsResponse

`class`Job

bauplan.schema.Job(
    *,
    id: str,
    kind: Union[str,
    bauplan.schema.JobKind],
    user: str,
    human_readable_status: str,
    created_at: Optional[datetime.datetime],
    finished_at: Optional[datetime.datetime],
    status: bauplan.schema.JobState
)-> None

EXPERIMENTAL AND SUBJECT TO CHANGE.

Job is a model for a job in the Bauplan system. It is tracked as a result of a code snapshot run.

`def` finished_after

Check if the job finished within the given timedelta from now.

Parameters

`def` finished_before

Check if the job finished before the given timedelta from now.

Parameters

`def` finished_between

Check if the job finished between two datetimes.

Parameters

`def` from_proto

Parameters

`def` has_finished_range

Parameters

`def` has_id

Check if the job has the specified ID or ID prefix.

Parameters

`def` has_started_range

Check if the job started within the specified time range.

Parameters

`def` has_status

Check if the job has specified status.

Parameters

`def` started_after

Check if the job started after the given datetime.

Parameters

`def` started_before

Check if the job started before the given datetime.

Parameters

`def` started_between

Check if the job started between two datetimes.

Parameters

`class`JobContext

bauplan.schema.JobContext(
    *,
    id: str,
    project_id: Optional[str],
    project_name: Optional[str],
    ref: Optional[bauplan.schema.Ref],
    tx_ref: Optional[bauplan.schema.Ref],
    logs: List[bauplan.schema.JobLogEvent],
    dag_nodes: List[bauplan.schema.DAGNode],
    dag_edges: List[bauplan.schema.DAGEdge],
    snapshot_dict: Dict[str,
    str],
    snapshot_dirpath: Optional[pathlib.Path]
)-> None

EXPERIMENTAL AND SUBJECT TO CHANGE.

JobContext is a model for immediate working context of a particular job. This currently includes: (1) Ref, (2) Code Snapshot, (3) Logs. A JobContext should enable a variety of workflows for iterating on an existing Job.

`def` cleanup_cache

Clean up the cache directory if it exists.

`def` save_cache

Save the cache directory if it exists.

`class`JobKind

Models a job's "kind" or job type. May be one of: UNSPECIFIED, CODE_SNAPSHOT_RUN, QUERY, IMPORT_PLAN_CREATE, IMPORT_PLAN_APPLY, TABLE_PLAN_CREATE, TABLE_PLAN_CREATE_APPLY, or TABLE_IMPORT.

`class`JobLogEvent

bauplan.schema.JobLogEvent(
    *,
    stream: Optional[bauplan.schema.JobLogStream],
    level: Optional[bauplan.schema.JobLogLevel],
    message: str
)-> None

EXPERIMENTAL AND SUBJECT TO CHANGE.

JobLogEvent is a model for a particular log message from a particular job.

When you output logs within a Python model, they are persisted as JobLogEvents.

`class`JobLogLevel

`class`JobLogList

bauplan.schema.JobLogList(
    *,
    events: List[bauplan.schema.JobLogEvent]
)-> None

EXPERIMENTAL AND SUBJECT TO CHANGE.

JobLogList is a model for all of the logs from a particular job. This model is primarily provided as a convenience for "common" interactions with a job's log messages.

`def` error_messages

`class`JobLogStream

`class`JobState

`class`Namespace

bauplan.schema.Namespace(
    *,
    name: str,
    ref: Optional[Annotated[Union[bauplan.schema.Branch,
    bauplan.schema.Tag,
    bauplan.schema.DetachedRef],
    FieldInfo(
    annotation=NoneType,
    required=True,
    discriminator='type'
)]] = None
)-> None

`class`PartitionField

bauplan.schema.PartitionField(
    *,
    name: str,
    transform: str
)-> None

`class`Ref

bauplan.schema.Ref(
    *,
    name: str,
    hash: str | None = None,
    type: str | None = None
)-> None

A branch or a tag

Examples:

ref = Ref(name='main', hash='abc123')

Attributes

`def` from_dict

Parameters

`def` from_string

Parameters

`class`Table

bauplan.schema.Table(
    *,
    name: str,
    namespace: str,
    kind: bauplan.schema.EntryType = <EntryType.TABLE: 'TABLE'>
)-> None

Bases: Entry

`def` is_external

`def` is_managed

`class`TableField

bauplan.schema.TableField(
    *,
    id: int,
    name: str,
    required: bool,
    type: str
)-> None

`class`TableWithMetadata

bauplan.schema.TableWithMetadata(
    *,
    name: str,
    namespace: str,
    kind: bauplan.schema.EntryType = <EntryType.TABLE: 'TABLE'>,
    id: str,
    records: Optional[int],
    size: Optional[int],
    last_updated_ms: int,
    fields: List[bauplan.schema.TableField],
    snapshots: Optional[int],
    partitions: List[bauplan.schema.PartitionField],
    metadata_location: str,
    current_snapshot_id: Optional[int],
    current_schema_id: Optional[int],
    properties: Optional[Dict[str,
    str]],
    raw: Optional[Dict]
)-> None

Bases: Table

`class`Tag

bauplan.schema.Tag(
    *,
    name: str,
    hash: str | None = None,
    type: Literal['TAG'] = 'TAG'
)-> None

Bases: Ref

classAPIError

classAPIMetadata

classAPIResponse

classAPIResponseWithData

classAPIResponseWithError

classActor

classBranch

classCacheDir

def cleanup

def clear_job_cache

Parameters

def save

classCommit

Attributes

classDAGEdge

classDAGNode

Attributes

classDetachedRef

classEntry

classEntryType

classGetBranchesResponse

Example:​

classGetCommitsResponse

classGetNamespacesResponse

classGetTablesResponse

Example:​

classGetTagsResponse

classJob

def finished_after

Parameters

def finished_before

Parameters

def finished_between

Parameters

def from_proto

Parameters

def has_finished_range

Parameters

def has_id

Parameters

def has_started_range

Parameters

def has_status

Parameters

def started_after

Parameters

def started_before

Parameters

def started_between

Parameters

classJobContext

def cleanup_cache

def save_cache

classJobKind

classJobLogEvent

classJobLogLevel

classJobLogList

def error_messages

classJobLogStream

classJobState

classNamespace

classPartitionField

classRef

Examples:​

Attributes

def from_dict

Parameters

def from_string

Parameters

classTable

def is_external

def is_managed

classTableField

classTableWithMetadata

classTag

def proto_datetime_to_py_datetime

Parameters

`class`APIError

`class`APIMetadata

`class`APIResponse

`class`APIResponseWithData

`class`APIResponseWithError

`class`Actor

`class`Branch

`class`CacheDir

`def` cleanup

`def` clear_job_cache

`def` save

`class`Commit

`class`DAGEdge

`class`DAGNode

`class`DetachedRef

`class`Entry

`class`EntryType

`class`GetBranchesResponse

Example:

`class`GetCommitsResponse

`class`GetNamespacesResponse

`class`GetTablesResponse

Example:

`class`GetTagsResponse

`class`Job

`def` finished_after

`def` finished_before

`def` finished_between

`def` from_proto

`def` has_finished_range

`def` has_id

`def` has_started_range

`def` has_status

`def` started_after

`def` started_before

`def` started_between

`class`JobContext

`def` cleanup_cache

`def` save_cache

`class`JobKind

`class`JobLogEvent

`class`JobLogLevel

`class`JobLogList

`def` error_messages

`class`JobLogStream

`class`JobState

`class`Namespace

`class`PartitionField

`class`Ref

Examples:

`def` from_dict

`def` from_string

`class`Table

`def` is_external

`def` is_managed

`class`TableField

`class`TableWithMetadata

`class`Tag

`def` proto_datetime_to_py_datetime