About datasets
A dataset specifies a collection of data to uploaded into the Precisely Data Integrity Suite Data Quality. This provides source data loaded into Data Integrity Suite and transformed by pipelines.
A dataset is presented in a tabular pattern. Each column corresponds to a field in the source data. Each row contains values for fields read in from source data, such as first name, last name, address, postal code, phone, membership date, and so forth. Fields in a dataset are characterized by field name, data type, and semantic type.
Data Settings can be configured to match the character encoding, field delimiter, text qualifier, and line separator in a particular data source. Data type formats for a dataset can be configured to recognize the numeric and date-time data types for data fields. These characterizations are used to suggest transformations in pipelines associated with a dataset.