- Snowflake types include: VARCHAR, NUMBER, FLOAT, BOOLEAN, DATE, TIMESTAMP, TIME, and VARIANT. The use of these data types is detailed in the Snowflake documentation.
- Databricks types include: INTEGER, NUMBER, FLOAT, TEXT, TIMESTAMP, DATE, BOOLEAN, and BINARY. The use of these data types is detailed in the Databricks documentation.
- Amazon Redshift types include: TEXT, INTEGER, NUMERIC, REAL, DOUBLE PRECISION, BOOLEAN, DATE, DATETIME, and SUPER. The use of these data types is detailed in the Amazon Redshift documentation.
- When appropriate, values are first rounded to the requested decimal places before being cast to the requested size.
- Users may experience casting errors if using binary values.
- Although syntax is checked at validation time, runtime errors may occur during type conversion if the input data cannot fit into the requested target type.
Use case
This component is useful when you need to ensure that the data types in your dataset match the requirements of your cloud data warehouse or when you want to standardize data types across different datasets. For example, you might use this component to:- Standardize data types across multiple datasets for auditing and regulation.
- Align mismatched types across disparate systems for downstream join or merge operations.
- Convert string representations of dates or numbers into their respective date or numeric types for accurate calculations and comparisons.
Properties
A human-readable name for the component.
Enter the following details for each table column:
- Column: The name of the column to convert.
- Type: The type to cast the column to. Choose one of the cloud data platform tabs below for data types applicable to that platform.
- Size: For Text types, this is the maximum length. This is a limit on the number of bytes, not characters. For Numeric types, this is the total number of digits allowed, whether before or after the decimal point.
- Precision: The precision of the data in the column. Will be 0 (zero) for non-applicable types.
- Format: (Amazon Redshift only) The DateTime format. This is only required if Type is set to
DATEorDATETIME. If your input column isTEXTand you want to convert toDATEorDATETIME, specify the input format of the input column. For an exhaustive list of possible formats, read the Amazon Redshift documentation.
- Snowflake
- Databricks
- Amazon Redshift
| Type | Description |
|---|---|
| Varchar | Suitable for numbers and letters. A varchar or Variable Character Field is a set of character data of indeterminate length. |
| Number | Suitable for numeric types, with or without decimals. |
| Float | Approximate numeric values with fractional components. |
| Boolean | Suitable for data that is either “true” or “false”, or “1” or “0”, respectively. |
| Date | Suitable for dates without times. |
| Timestamp | A timestamp left unformatted (exists as Unix/Epoch Time). |
| Time | Suitable for time, independent of a specific date and timezone. |
| Variant | A tagged universal type that can hold up to 16 MB of any data type supported by Snowflake. |

