Data Discovery
Overview
Data Discovery establishes the critical foundation for implementing Pentaho Data Catalog (PDC) with Adventure Works database. The workshop systematically transforms raw data discovery into actionable governance, ensuring regulatory compliance (GDPR, SOX, CCPA) while enabling secure, role-based data access.
Through six structured sessions, you will create:
a complete data asset inventory, classify sensitive information, map organizational access requirements, and design automated compliance controls that reduce manual governance overhead by an estimated 75%.
The results deliver immediate business value through proactive risk mitigation and audit readiness. By identifying 47 sensitive data elements across Adventure Works' 71 tables and mapping them to specific regulatory requirements, organizations can avoid potential GDPR fines of up to 4% of global revenue and SOX compliance violations that carry criminal liability for executives.
The structured approach ensures that all 19,972 person records and 31,465 financial transactions are properly classified and protected according to their risk profile and business usage patterns.
The final deliverable is a complete implementation roadmap with Keycloak group hierarchies, role-based attributes, and PDC Community assignments that directly translate discovery findings into operational data governance. This foundation enables automated segregation of duties for SOX compliance, purpose limitation for GDPR requirements, and principle of least privilege access controls - all while maintaining business operational efficiency and user productivity.
Last updated
Was this helpful?