Data Sources
Connect to a data source ..
x
x
x
xx
Adventure Works 2022 contains approximately 70 tables organized into multiple schemas representing different functional areas of the business, with around 20,000 customers, over 70,000 orders, and 500 products.
The database contains 486 columns that require classification, making it ideal for demonstrating data governance and classification processes:
Personal Data Identification and Classification: The database contains various types of sensitive data including personal information in tables like Person.Person and HumanResources.Employee, with data such as names, addresses, contact information, dates of birth, and even employee resumes that could contain multiple types of personal data.
Data Sensitivity Categorization: Using Pentaho Data Catalog (PDC), the Adventure Works database demonstrates how to perform automated data classification, categorizing columns into sensitivity levels such as Confidential, Highly Confidential, and assigning appropriate information types based on content.
Regulatory Reporting and Audit Trails: The comprehensive business structure of Adventure Works, spanning sales, human resources, and production data, provides an excellent framework for demonstrating how data catalogs support regulatory reporting requirements.
Risk Assessment and Data Governance: The database allows data governance teams to quantify data risk and develop processes for data masking in non-production environments, which is a critical compliance requirement for protecting sensitive data in development and testing scenarios.
Last updated
Was this helpful?