Vacuum is an orchestration component that performs a vacuum operation on a list of tables. Vacuum is a housekeeping task that physically reorganizes table data according to its sort key, and reclaims space left over from deleted rows. Vacuum is almost always used at the end of an orchestration pipeline. For more information about the vacuum process, read:Documentation Index
Fetch the complete documentation index at: https://docs.maia.ai/llms.txt
Use this file to discover all available pages before exploring further.
- Databricks VACUUM documentation.
- AWS VACUUM documentation.
Properties
- Databricks
- Amazon Redshift
A human-readable name for the component.
Select a Databricks Unity Catalog. The special value
[Environment Default] uses the catalog defined in the environment. Selecting a catalog will determine which databases are available in the next parameter.The Databricks schema. The special value
[Environment Default] uses the schema defined in the environment. Read Create and manage schemas to learn more.Select which tables to vacuum.
The retention threshold. The default is
7, with the unit specified in Retention Unit.Select the unit of the Retention Period. Options are Day, Hour, or Week. The default is Day.
