# Connectors

{% hint style="info" %}
Pentaho Data Quality&#x20;
{% endhint %}

<figure><img src="https://content.gitbook.com/content/7w2PGGr7BR8BUW5guKgZ/blobs/4JXOSG32zpJ2NgZtZBcX/image.png" alt=""><figcaption><p>PDQ Connectors</p></figcaption></figure>

x

x

{% tabs %}
{% tab title="Snowflake" %}
{% hint style="info" %}
Snowflake is a cloud-based data warehousing and analytics platform that separates compute from storage, allowing for scalable and flexible data processing. Snowflake supports structured and semi-structured data, enables data sharing across organizations, and provides features for data lakes, data engineering, and machine learning.
{% endhint %}

{% hint style="warning" %}
Before you define the Snowflake connection, ensure the prerequisite steps have been completed.

For further information: &#x20;
{% endhint %}

1. Click: + icon in the top right-hand corner of the UI.
2. Select Snowflake from the connector list.

<figure><img src="https://content.gitbook.com/content/7w2PGGr7BR8BUW5guKgZ/blobs/pDXjO9TjnRuarP5q8Jpw/image.png" alt=""><figcaption><p>Snowflake Connector</p></figcaption></figure>

3. Enter the following details to connect to your database.

<table><thead><tr><th width="226">Field</th><th>Setting</th></tr></thead><tbody><tr><td>Connection name</td><td>The connection name has to be the same as defined in PDC.</td></tr><tr><td>Description</td><td></td></tr><tr><td>Account</td><td>Your Snowflake account</td></tr><tr><td>Warehouse</td><td></td></tr><tr><td>Database </td><td></td></tr><tr><td>Authentication Type</td><td>Username and Password</td></tr><tr><td>User</td><td></td></tr><tr><td>Password</td><td></td></tr></tbody></table>

4. Validate the connection details.
5. Click: 'Connect' and select the required Table and Queries - the following screenshot dsiplays the 'Synthea' tables.&#x20;

<figure><img src="https://content.gitbook.com/content/7w2PGGr7BR8BUW5guKgZ/blobs/quMcQtrxRlFPKhSTmNxx/image.png" alt=""><figcaption><p>Connection to 'Synthea'</p></figcaption></figure>

x

x
{% endtab %}

{% tab title="MSSQL" %}
{% hint style="info" %}
PDQ only supports MSSQL versions beyond 2017.
{% endhint %}

x

1. Click: + icon in the top right-hand corner of the UI.

x

2. Select MSSQL from the connector list.

x

3. Enter the following details to connect to: MSSQL AdventureWorks2019 database.

<table><thead><tr><th width="226">Field</th><th>Setting</th></tr></thead><tbody><tr><td>Connection name</td><td>mssql:adventureworks2019</td></tr><tr><td>Description</td><td>Demo dataset of fictitious bicycle manufacturer</td></tr><tr><td>Server</td><td>pdc.pdc.lab</td></tr><tr><td>Port</td><td>1433</td></tr><tr><td>Database </td><td>AdventureWorks2019</td></tr><tr><td>Authentication Type</td><td>Username and Password</td></tr><tr><td>User</td><td>sqlreader</td></tr><tr><td>Password</td><td>2Petabytes</td></tr><tr><td>Schema</td><td>Sales</td></tr><tr><td>Driver</td><td>mssql-jdbc-9.2.1.jre15.jar</td></tr></tbody></table>

x

4. Validate the connection details.

x

5. Click; 'Connect' and select the required Table and Queries.

x

x

x
{% endtab %}

{% tab title="Oracle" %}
{% hint style="info" %}
SyntheaTM is an open-source tool that generates synthetic patient data, simulating individuals' complete medical histories. This encompasses medications, allergies, encounters, and social health determinants for each mock patient.&#x20;
{% endhint %}

<figure><img src="https://content.gitbook.com/content/7w2PGGr7BR8BUW5guKgZ/blobs/VpBMAryzesHEgpjevYu1/image.png" alt=""><figcaption><p>Synthea</p></figcaption></figure>

{% hint style="warning" %}
The generated data is free from legal and privacy concerns.
{% endhint %}

<figure><img src="https://content.gitbook.com/content/7w2PGGr7BR8BUW5guKgZ/blobs/EYaCagjoBJTpD9PYsSfg/image.png" alt=""><figcaption><p>Synthes ERD</p></figcaption></figure>

***

x

x

1. Enter the following details to connect to: Oracle business\_apps\_db (Synthea) database.

<table><thead><tr><th width="226">Field</th><th>Setting</th></tr></thead><tbody><tr><td>Data Source Name</td><td>oracle:synthea</td></tr><tr><td>Data Source ID</td><td>Leave Blank to autogenerate</td></tr><tr><td>Description</td><td>Demo dataset of patients medical records</td></tr><tr><td>Data Source Type</td><td>Oracle</td></tr><tr><td>Affinity</td><td>Default</td></tr><tr><td>Configuration Method</td><td>Credentials</td></tr><tr><td>   Username</td><td>sqlreader</td></tr><tr><td>   Password</td><td>2Petabytes</td></tr><tr><td>   *Host</td><td>pdc.pdc.lab</td></tr><tr><td>   Port</td><td></td></tr><tr><td>**Driver</td><td>postgresql-42.7.1.jar</td></tr><tr><td>Database Name</td><td>business_apps_db</td></tr></tbody></table>

x
{% endtab %}
{% endtabs %}
