Sensitivity Level & Trust Scores

Sensitivity Level & Trust Score

Dataflow

The solution consists of three main components:

  1. Entity Extraction Tool - Extracts all entities with hierarchical names from OpenSearch

  2. Pentaho Data Integration - Joins your calculated values with entity data

  3. Bulk Update Tool - Updates Trust Score and Sensitivity via API or OpenSearch

Expected Outcomes

  • Automated bulk updates of Trust Score (0-100) and Sensitivity (HIGH/MEDIUM/LOW)

  • Support for schema, table, and column level updates

  • Validation and error reporting

  • Scalable solution for thousands of entities


x

Entity Extraction

The extraction process retrieves all entities from your data catalog with their hierarchical relationships intact.

What Gets Extracted:

  • Entity unique identifiers (UUIDs)

  • Entity types (SCHEMA/TABLE/COLUMN)

  • Hierarchical names for joining

  • Current Trust Score and Sensitivity values

  • Fully qualified domain names (FQDNs)

Learning Objectives:

  • Understand the entity extraction process

  • Extract all entities with hierarchical names from your data catalog

  • Analyze the extracted data structure

  • Prepare data for joining with calculated metrics

  1. Run the extraction script:

Expected output:

  1. Take a look at the entity_extraction.csv

Column
Description
Example

entity_id

Unique identifier

ef60e629-4261-4ce6-8635-961ca4b1b420

entity_type

Type of entity

SCHEMA, TABLE, COLUMN

entity_name

Entity's actual name

Employee

schema_name

Schema name for joining

HumanResources

table_name

Table name (empty for schemas)

Employee

column_name

Column name (empty for schemas/tables)

FirstName

fqdn

Internal fully qualified name

688cc7b9c5759eae5fdcba07/...

fqdn_display

Human-readable path

mssql:adventureworks2022/...

current_trust_score

Existing trust score

48

current_sensitivity

Existing sensitivity

HIGH

new_trust_score

For your calculated values

(empty)

new_sensitivity

For your calculated values

(empty)

  1. Run a Data Quality Analysis

  1. x

  2. x

  3. x

3. Run

Run Complete Extraction

Last updated

Was this helpful?