Properties
A human-readable name for the component.
The file location to load the data from.Files in this location must have the specified
FILEFORMAT. Accepted encryption options are TYPE = 'AWS_SSE_C', and MASTER_KEY for AWS S3.Files in the specified location will only be loaded if their names match the pattern you specify here. You can use wildcards in the pattern. Enter
.* to match all files in the location.Select a Databricks Unity Catalog. The special value
[Environment Default] uses the catalog defined in the environment. Selecting a catalog will determine which databases are available in the next parameter.You must have appropriate permissions and access rights to the source volume and destination Delta table.
The Databricks schema. The special value
[Environment Default] uses the schema defined in the environment. Read Create and manage schemas to learn more.The name of the Delta table. This table will be recreated and will drop any existing table of the same name.
Choose the columns to load. If you leave this parameter empty, all columns will be loaded.
The format of the source files to load. Available file types are CSV, JSON, PARQUET, and AVRO.Component properties will change to reflect the selected file type. Click one of the tabs below for properties applicable to that file type.
- CSV
- JSON
- PARQUET
- AVRO
Select Yes to use the first line of the file as column names. If not specified, the default is No.
Enter the delimiter character used to separate fields in the CSV file. This can be one or more single-byte or multibyte characters that separate fields in an input file. If none is specified, the default is a comma.Accepted characters include common escape sequences, octal values (prefixed by
\\), or hex values (prefixed by 0x). This delimiter is limited to a maximum of 20 characters. The specified delimiter must be a valid UTF-8 character and not a random sequence of bytes.A TAB character can be specified as "".
Manually set a date format in the data files to be loaded. If none is specified, the default is
yyyy-MM-dd.Manually set a timestamp format in the CSV files to be loaded. If none is specified, the default is
yyyy-MM-dd'T'HH:mm:ss.[SSS][XXX].The encoding type to use when decoding the CSV files. If none is specified, the default is
UTF-8.When Yes, skips any leading whitespaces. If not specified, the default is No.
When Yes, skips any trailing whitespaces. If not specified, the default is No.
If Yes, will attempt to determine the input schema automatically from the data contained in the CSV file. If not specified, the default is No.
If Yes, will parse records which may span multiple lines. If not specified, the default is No.
Sets the string representation of a null value. If not specified, the default value is an empty string.
Sets the string representation of an empty value. If not specified, the default value is an empty string.
If Yes, partition inference is disabled. If not specified, the default is No. To control which files are loaded, use the Pattern property instead.
If Yes, files are loaded regardless of whether they’ve been loaded before. If not specified, the default is No.

