Skip to main content
Vacuum is an orchestration component that performs a vacuum operation on a list of tables. Vacuum is a housekeeping task that physically reorganizes table data according to its sort key, and reclaims space left over from deleted rows. Vacuum is almost always used at the end of an orchestration pipeline. For more information about the vacuum process, read:

Properties

Name
string
required
A human-readable name for the component.
Catalog
drop-down
required
Select a Databricks Unity Catalog. The special value [Environment Default] uses the catalog defined in the environment. Selecting a catalog will determine which databases are available in the next parameter.
Schema (Database)
drop-down
required
The Databricks schema. The special value [Environment Default] uses the schema defined in the environment. Read Create and manage schemas to learn more.
Tables to Vacuum
dual listbox
required
Select which tables to vacuum.
Retention Period
integer
required
The retention threshold. The default is 7, with the unit specified in Retention Unit.
Retention Unit
drop-down
required
Select the unit of the Retention Period. Options are Day, Hour, or Week. The default is Day.