Dashboard

Data Discovery Dashboard

  1. Start the Dashboard server:

cd
cd ~/Projects/Data_Discovery

# Activate virtual environment
source venv/bin/activate

# Start dashboard server
python -m data_discovery.dashboard.server
  1. Log into Dashboard:

Executive Overview

This AdventureWorks2022 database executive dashboard provides a comprehensive analysis of data discovery, classification, and risk assessment status. The system has analyzed 88 total tables containing 723 columns, with 225 classified columns identified. The database maintains a 77% trust score with a "Good" governance level, though the overall risk level is marked as "HIGH," indicating significant security concerns requiring attention.

  1. Click on the Executive Overview report:

Executive Overview
  1. Each Chart, Graph & Table has an info: 'About this ..' button for further details

Key Insights & Recommendations

Schema Distribution & Data Classification

The schema distribution chart shows Production and Sales schemas containing the highest concentration of tables (approximately 28 and 26 tables respectively), followed by Person (15 tables), HumanResources (12 tables), and Purchasing (7 tables). The classification summary reveals diverse sensitive data categories, with Technical data comprising the largest segment, followed by Personal Name, Address Info, Business, Financial, Contact Info, Operational, Biometric, Compliance, and Security classifications, demonstrating the presence of multiple types of sensitive information requiring protection.

Critical Insights & Risk Indicators

The analysis of 723 columns across 88 tables reveals comprehensive data coverage of 95%+ for governance assessment. A significant PII exposure concern has been identified, with 188 columns containing personally identifiable information requiring enhanced data protection and access controls. Most critically, the security risk assessment has flagged 124 columns as high risk, necessitating immediate security controls and monitoring implementation to mitigate potential data breaches or unauthorized access.

Trust Score

The trust score breakdown shows DAMA-DMBOK standards achieving excellent ratings (85+, Good 70-84, Fair 55-69, Poor <55), indicating strong adherence to data management best practices. The system prioritizes three key recommendations: implementing data protection with masking for PII columns in non-production environments (High Priority), establishing role-based access controls for sensitive data schemas (Medium Priority), and deploying automated data quality monitoring for critical business tables (Standard). These actions are essential to address the HIGH overall risk level while maintaining operational efficiency.

Last updated

Was this helpful?