Identify the data
Data Dictionaries & Patterns ..
Last updated
Data Dictionaries & Patterns ..
Last updated
The data identification process efficiently classifies data by leveraging dictionaries and data pattern analysis. This methodology enables the automatic tagging of data based on predefined criteria in dictionary and pattern configuration files.
While Data Catalog comes equipped with a comprehensive set of dictionaries and patterns, it also offers flexibility by allowing users to create custom dictionaries and pattern analysis configurations. This customization ensures that the data identification process can be tailored to meet the specific requirements of any organization.
Data Dictionaries
A data dictionary in a data catalog is a collection of predefined terms and definitions that help classify and tag data within an organization's datasets. It serves as a reference guide for data terms, helping users understand the meaning, usage, and context of each data element.
By leveraging a data dictionary, organizations can ensure consistency, accuracy, and easier data identification and management across different data sets. Custom dictionaries can also be created to meet specific organizational needs.
Let's run through an example: Marital_Status.
Navigate to the 'Management' tile & click on: Dictionaries.
Search for: Marital_Status
When the data is profiled, in our example: 'marital_status' the value is compared, using a rule, against (with a degree of confidence) the predefined dictionary.
Once matched: Tags - PII, Marital Status, Non-Sensitive are then applied.
Click on the > to View Dictionary
Next -> 1.2 Rules