Uniqueness Rules

Uniqueness Rules

x


Accessing your Catalog

To get started using the Data Catalog, log in using the address and credentials provided by your Data Catalog service user or administrator.

To access your catalog, please follow these steps:

  1. Open Google Chrome web browser.

  2. Navigate to:

  1. Enter following email and password, then click Sign In.

Username: [email protected] (mapped to Business Steward role)

Password: Welcome123!


x

x

Duplicate Customer Detection

Scenario: Identify potential duplicate customer records based on name and contact info.

Business Rule Configuration:

Rule Name: Customer_Duplicate_Detection
Description: Identifies potential duplicate customer records
Data Quality Dimension: Uniqueness
Schedule: Weekly
Target: Person.Person

SQL Query:

WITH DuplicateCandidates AS (
    SELECT 
        FirstName, 
        LastName, 
        EmailAddress,
        COUNT(*) AS DupCount
    FROM Person.Person p
    JOIN Person.EmailAddress e ON p.BusinessEntityID = e.BusinessEntityID
    GROUP BY FirstName, LastName, EmailAddress
    HAVING COUNT(*) > 1
)
SELECT 
    (SELECT COUNT(DISTINCT BusinessEntityID) FROM Person.Person) AS total_count,
    (SELECT COUNT(DISTINCT BusinessEntityID) FROM Person.Person) - 
        (SELECT SUM(DupCount - 1) FROM DuplicateCandidates) AS scopeCount,
    (SELECT SUM(DupCount - 1) FROM DuplicateCandidates) AS nonCompliant

x

x

x

x

x

Last updated

Was this helpful?