Connect AW Database

Connect to AW DW ..

Adventure Works

Adventure Works

Link to Adventure Works ER

  1. Log into Data Catalog:

Username: [email protected]

Password: Welcome123!

  1. Click: Management in the left navigation menu.

Management - Resources
  1. In the Resources tile, click: Add Data Source.

Add a Data Source
  1. Specify the following information for the connection to your data source.

Test Connection and Ingest Metadata Schema ..

After you have specified the detailed information according to your data source type, test the connection to the data source and add the data source.

  1. Enter the following details to connect to: Adventure Works database.

Field
Setting

Data Source Name

mssql:adventureworks2022

Data Source ID

Leave Blank to autogenerate ID

Description

AW DW: Person, HR, Purchasing, Sales, Production

Data Source Type

Microsoft SQL Server

Affinity

Default

Configuration Method

URI

Username

sa

Password

StrongPassword123

URI

jdbc:sqlserver://pdc.pentaho.lab:1433;databaseName=AdventureWorks2022;user=sa;password=StrongPassword123;encrypt=false

Driver

mssql-jdbc-12.10.1.jre11.jar*

Database Name

AdventureWorks2022

Manage Drivers

x

Enter connection details
  1. Download the MSSQL JDBC driver:

  1. Untar the file:

cd
  1. Select the JDBC driver:

Select JDBC driver
  1. Enter the URI:

jdbc:sqlserver://pdc.pentaho.lab:1433;databaseName=AdventureWorks2022;user=sa;password=StrongPassword123;encrypt=false
  1. Click Test Connection to test your connection to the specified data source.

Test Connection
  1. Click Ingest Schema, select the following 5 schemas, and then click Ingest Schemas.

Select Schemas

Recommendation

For Pentaho Data Catalog, the best practice is to:

  1. Use the standard connection URI without schema restrictions

  2. Control schema scope through the selective "Ingest Schema" process during data source setup

  3. Use virtual folders and role-based access control for further data governance

This approach aligns with Data Catalog's design philosophy of comprehensive discovery followed by organized access control rather than connection-level restrictions.

  1. (Optional) In the Physical Location field, specify the physical location details of the data source.

  2. (Optional) Configure the following storage optimization options for the data source.

Field
Description

Available for Migration

Enables or disables the data source for storage optimization. When enabled, it includes the data source for data optimizer activities.

Available for Writing

Enables or disables writing capabilities for the data source and enables migration when turned on.

Available for Data Mastering

Enables or disables the data source for data mastering purposes.

To use storage optimization options, you need a Pentaho Data Optimizer license.

  1. (Optional) In the Cost per Terabyte field, specify the data source pricing details like currency, price per terabyte, and billing frequency.

  2. (Optional) In the Total Capacity field, specify the total capacity of the data source in terabytes.

  3. (Optional) Enter a Note for any additional information to share with others who might access this data source.

  4. Click Create Data Source to establish your data source connection.

So we've now ingested the AdventureWorks database schemas.

Before we start getting into the weeds, let's revisit our users and apply their Permissions & Scope.


Last updated

Was this helpful?