# Getting Started

{% hint style="info" %}

#### Introduction

Pentaho Data Catalog is a powerful tool that enables data engineers, data scientists, and business users to accelerate their data intelligence journey. It automatically discovers, classifies, and contextualizes structured and unstructured data. Here are some key features:

* **Powerful Business Glossary**: Contextualize data with business vocabulary based on governance policies and business rules. This helps activate metadata and ensures alignment with business language.
* **Data Lineage and Trust**: Track data lineage with Open Lineage, building trust as data flows through your organization. Enable data quality and remediation activities.
* **Observability and Monitoring**: A robust observability stack captures popular assets, popular searches, and trends. This helps stewardship organizations focus their energy on the right data.
* **Integration and Scalability**: API-powered integrations with various platforms (NetApp, SAP Hana, S3, SQL views) ensure interoperability. The modern architecture design scales seamlessly.
* **Enterprise Security**: Features include role-based access control (RBAC), password vault support, minimum privileges, multifactor authentication, secure cloud deployments, and no data deduplication.

Discover, understand, and govern your data with Pentaho Data Catalog. It offers faster discovery, lower total cost of ownership (TCO), and improved data quality.
{% endhint %}

***

{% hint style="info" %}

#### Accessing Your Catalog

To get started using the Data Catalog, log in using the address and credentials provided by your Data Catalog service user or administrator.
{% endhint %}

To access your catalog, please follow these steps:

1. Open **Google Chrome** web browser.
2. Navigate to:

{% embed url="<https://pdc.pentaho.lab>" %}

3. Enter following email and password, then click **Sign In**.

<figure><img src="/files/85S619jAFWhWove4mbqs" alt=""><figcaption><p>PDC Log In</p></figcaption></figure>

<table><thead><tr><th width="251">Username</th><th width="173">Password</th><th>Default PDC In-built Role</th></tr></thead><tbody><tr><td>system_admin@hv.com</td><td>Welcome123!</td><td>All the Roles combined</td></tr><tr><td>admin@hv.com</td><td>Welcome123!</td><td>Community &#x26; User Administrator</td></tr><tr><td>business_steward@hv.com</td><td>Welcome123!</td><td>Manage Business Glossary</td></tr><tr><td>business_user@hv.com</td><td>Welcome123!</td><td>View Business Glossary</td></tr><tr><td>data_user@hv.com</td><td>Welcome123!</td><td>Add &#x26; Delete content</td></tr><tr><td>data_developer@hv.com</td><td>Welcome123!</td><td>Manage Business Rules &#x26; Domain Assets</td></tr><tr><td>data_steward@hv.com</td><td>Welcome123!</td><td>Manage most features except Glossary</td></tr></tbody></table>

{% hint style="warning" %}

#### **Security Advisory: Handling Login Credentials**

For enhanced security, it is strongly recommended that users avoid saving their login details directly in web browsers. Browsers may inadvertently autofill these credentials in unrelated fields, posing a security risk.

**Best Practice**

• **Disable Autofill:** To mitigate potential risks, users should disable the autofill functionality for login credentials in their browser settings. This preventive measure ensures that sensitive information is not unintentionally exposed or misused.
{% endhint %}

{% tabs %}
{% tab title="1. Tour of UI" %}
{% hint style="info" %}

#### User Interface

The Pentaho Data Catalog Home page provides a central location for accessing the business tools available to you based on your permissions, such as data canvas, business glossary, applications, policies, management, workers, and so on.

You can also use the menu bar on the left to navigate to different features in the product.
{% endhint %}

<figure><img src="/files/vutZwW75fFtBUKHJF6Qf" alt=""><figcaption></figcaption></figure>

***

{% hint style="info" %}

#### User Menu Bar <a href="#user-menu-bar" id="user-menu-bar"></a>

The top menu bar is visible from anywhere in Data Catalog.&#x20;

The following table includes details about its features:
{% endhint %}

<table><thead><tr><th width="93">Icon</th><th>Function</th></tr></thead><tbody><tr><td><div><figure><img src="/files/PzJdenVCcXUPjZc2Ibm5" alt=""><figcaption></figcaption></figure></div></td><td>Click the <strong>Access Request</strong> icon to open the Request Access window. See Request access for information on completing the request.</td></tr><tr><td><div><figure><img src="/files/NHzWHzn8DzQvtwH0sNB8" alt=""><figcaption></figcaption></figure></div></td><td>Click the icon to view your notifications. You can switch your view between <strong>Unread</strong> and <strong>All</strong> notifications. A number next to the icon shows how many unread notifications you have.</td></tr><tr><td><div><figure><img src="/files/YaKD8HJrxFL5isltrF3W" alt=""><figcaption></figcaption></figure></div></td><td>Click the icon to view your assigned user role and email domain.</td></tr><tr><td><div><figure><img src="/files/KI2I0vcKKccXSNwCB4em" alt=""><figcaption></figcaption></figure></div></td><td>Click the icon and select <strong>Log Out</strong> to log out of Data Catalog.</td></tr><tr><td><div><figure><img src="/files/71RJqBYx17cHrlig6zkB" alt=""><figcaption></figcaption></figure></div></td><td>Click the <strong>Documentation link</strong> icon to go to the Data Catalog documentation.</td></tr><tr><td><div><figure><img src="/files/DGfHtElxFLdbFjphZMLz" alt=""><figcaption></figcaption></figure></div></td><td>Click <strong>Edit</strong> to open the Landing page options window, where you can configure the landing page with available options in <strong>Shortcuts</strong> and <strong>Tables</strong>. Additionally, you can choose to have a vertical or stacked arrangement in <strong>Layout</strong>. <strong>Note:</strong> This option is only visible on the Home page.</td></tr></tbody></table>

{% hint style="info" %}

#### Left navigation menu <a href="#left-navigation-menu" id="left-navigation-menu"></a>

Use the navigation menu bar on the left of the page to access the key features available to you in Data Catalog, depending on your permissions. You can expand and collapse the menu to have a better view.&#x20;

The following is a list of the menu icons and the features they open:
{% endhint %}

<table><thead><tr><th width="97">Icon</th><th>Function</th></tr></thead><tbody><tr><td><div><figure><img src="/files/Zzg6PtX4loK7XV5wIFXI" alt="" width="18"><figcaption></figcaption></figure></div></td><td>Returns you to the Home page from your current location in Data Catalog</td></tr><tr><td><div><figure><img src="/files/2zpkfRRFKtGDlb1TnHCX" alt="" width="21"><figcaption></figcaption></figure></div></td><td>Browse, search and discoiver data assets.</td></tr><tr><td><div><figure><img src="/files/4fgQnzxZzkIu6arroODo" alt="" width="18"><figcaption></figcaption></figure></div></td><td>Explore your data on the Data Canvas page. </td></tr><tr><td><div><figure><img src="/files/kXwirGe6uTXYIO9OKwW1" alt="" width="18"><figcaption></figcaption></figure></div></td><td>Opens the Business Glossary page where you can create, organize, and curate business terms to help you navigate your data. </td></tr><tr><td><div><figure><img src="/files/VapSnLRPtBveXwB9MiKV" alt="" width="18"><figcaption></figcaption></figure></div></td><td>Opens the Reference Data page., which contains relatively static, unchanging data values that organization commonly uses.</td></tr><tr><td><div><figure><img src="/files/uAO0mai95bBSsuWHFR6J" alt="" width="18"><figcaption></figcaption></figure></div></td><td>Opens the Master Data page, where you can manage, consolidate, and maintain high-quality master data across the organization.</td></tr><tr><td><div><figure><img src="/files/VQV5yQgc7wBnHsvT3xdg" alt="" width="18"><figcaption></figcaption></figure></div></td><td>Opens the Applications page, where you can create, organize, curate, and identify application assets like external applications, groups, and categories to help you understand what type of data is linked from an external application.</td></tr><tr><td><div><figure><img src="/files/Kn5vpqo1mc4yeMmnaY2a" alt="" width="18"><figcaption></figcaption></figure></div></td><td>Opens the Policy page, where you can access and manage policies that govern how data within Data Catalog is managed, accessed, and used.</td></tr><tr><td><div><figure><img src="/files/Cg7o8ut7uGymKIl6iotd" alt="" width="24"><figcaption></figcaption></figure></div></td><td></td></tr><tr><td><div><figure><img src="/files/oES8PPwAHalqTbricM6I" alt="" width="35"><figcaption></figcaption></figure></div></td><td></td></tr><tr><td><div><figure><img src="/files/8zjE3BxeqxETeWYekR6k" alt="" width="18"><figcaption></figcaption></figure></div></td><td>Manage your data sources, users, user roles, workers, business rules, schedules, dictionaries, and more in the Manage Your Environment page.</td></tr><tr><td></td><td></td></tr><tr><td><div><figure><img src="/files/63OrPTkmY1x6WMt3RExy" alt="" width="18"><figcaption></figcaption></figure></div></td><td>Monitor the data activities’ progress on the Workers page.</td></tr><tr><td><div><figure><img src="/files/jXuapjNMjyUEwqKAVeUa" alt="" width="18"><figcaption></figcaption></figure></div></td><td>Opens the Galaxy View feature, which offers a visual representation of data assets and their relationships. </td></tr></tbody></table>

{% hint style="info" %}

#### Quick access cards <a href="#quick-access-cards" id="quick-access-cards"></a>

When you navigate to the Home page, you can see quick access cards for various Data Catalog features, depending on your permissions. Based on the options selected in the Landing page options window (when you click **Edit**), it can display the following widgets.
{% endhint %}

<figure><img src="/files/Dj7RTp8uerWAiGvx1sE6" alt=""><figcaption></figcaption></figure>

<table><thead><tr><th width="174">Card</th><th>Description</th></tr></thead><tbody><tr><td>Explore your data</td><td></td></tr><tr><td></td><td></td></tr><tr><td></td><td></td></tr></tbody></table>
{% endtab %}

{% tab title="2. Data Canvas" %}
{% hint style="info" %}

#### Explore Your Data with the Data Canvas

Dive into the Data Canvas to uncover and analyze your data in depth. This powerful tool provides extensive insights into resource metadata, enhancing your comprehension and illustrating real-world applications. Discover the potential of your data through this intuitive platform.
{% endhint %}

Once you have processed a dataset ..

1. Click Data Canvas in the left navigation menu to open the Data Canvas view.

<figure><img src="/files/vKkq8Lvr4RURMcIbnviX" alt=""><figcaption><p>Data Canvas</p></figcaption></figure>

<table><thead><tr><th width="84">Item</th><th width="157">Name</th><th>Description</th></tr></thead><tbody><tr><td>1</td><td>Top Navigation</td><td>Navigation path. Navigate the tree of data entities to find the one you want to explore in the canvas.</td></tr><tr><td>2</td><td></td><td>Displays information about the selected entity / resource.</td></tr><tr><td>3</td><td><a href="/pages/ujpLBE9Ed6iChboer1mP">Data Lineage</a></td><td>‍Data lineage refers to the ability to track the origin and movement of data throughout its lifecycle. Data lineage helps to ensure data accuracy, troubleshoot issues, and meet compliance requirements. </td></tr><tr><td>4</td><td>Key Metrics</td><td>Metrics to indicate the overall Data Quality (pulled from Pentaho Data Quality) of the resource. You can set the Sensitivity &#x26; Trust Score</td></tr><tr><td>5</td><td><a href="/pages/cIqRTmUHvrAEy2vCrdZS#id-2.3-terms">Business Terms</a></td><td><p>You create business terms to standardize definitions of business concepts so that your data is described in a uniform and easily understood way across your enterprise.</p><p>Business terms can describe the contents of the data, the sensitivity of the data, or other aspects of the data, such as the subject or purpose of the data. You can assign one or more business terms to individual columns in relational data sets, to other governance artifacts, or to data assets.</p></td></tr><tr><td>6</td><td>Properties</td><td>Metadata about the asset / resource, for example: 'Last Update' &#x26; </td></tr><tr><td>7</td><td>Tags</td><td></td></tr><tr><td>8</td><td>Custom Properties</td><td></td></tr></tbody></table>
{% endtab %}

{% tab title="3. Galaxy View" %}

#### Using Galaxy View for Advanced Data Searches

**Galaxy view** offers an intuitive approach to navigating complex data structures, empowering users to conduct precise searches across databases. It's an invaluable tool for roles such as information security officers who need to pinpoint sensitive information efficiently, like credit card data within expansive databases.

**Key Features:**

* **Search Flexibility:** Easily search for terms like "credit" with the ability to filter results. Filters such as **Columns** allow users to identify specific columns containing credit card information, while the **Tables** filter returns tables explicitly named with "credit".
* **Scope Definition:** Tailor your search scope using filters to streamline the process of locating pertinent information. This ensures that you only get relevant results matching your search criteria.
* **Data Visualization:** The Galaxy view provides a comprehensive overview, highlighting data relationships at a glance. This bird's eye view is particularly useful for understanding the structure and interconnections of your data beyond what a traditional navigation tree offers.
* **Drill-Down Capability:** Once in the Galaxy view, users can delve deeper into specific data points for detailed information, ensuring a thorough analysis of the data structure and content.

Galaxy view is especially recommended for those who require a macro yet detailed perspective on data relationships, making it easier to manage and analyze vast databases effectively.

{% hint style="info" %}
Obviously .. the data needs to be processed and Business Glosseries & Terms added.
{% endhint %}

1. To access Galaxy view from the Data Canvas, select, for example 'synthea' folder.

<figure><img src="/files/k36yGfyjeb3rfqKyD6kr" alt=""><figcaption><p>Actions - View Galaxy</p></figcaption></figure>

2. Click 'Actions' and select 'View Galaxy'.

<figure><img src="/files/1EcRQwMOsGDQ5ErRHGdm" alt=""><figcaption><p>View Galaxy - synthea</p></figcaption></figure>

Here are the key tasks you can perform in Galaxy view:&#x20;

<figure><img src="/files/48EOJpEhvNggd4IO8Fxg" alt=""><figcaption></figcaption></figure>

<table><thead><tr><th width="244">Task</th><th>Description</th></tr></thead><tbody><tr><td>Search</td><td>Enter a keyword and select Search to find specific information within the resources. For example, enter "patients" to just show those sources, tables, and columns containing patients information.</td></tr><tr><td>View Details</td><td>Right-click on a selected data resource or column in Galaxy view and select View Details. The details panel appears. Depending on your selection, you can view different information, such as properties, tags, and custom properties. Additionally, you can also view and add business terms to the resource.</td></tr><tr><td>View Items</td><td>Right-click on a data resource in Galaxy view and select View Items. You can view the associated parent and child data assets in a tree view.</td></tr><tr><td>Select and Select Tree</td><td>To select a single data resource, right-click on a data resource in Galaxy view and click Select. Additionally, you can select associated data resources by clicking Select Tree. When an item is selected, you can right-click and Deselect the item</td></tr><tr><td>Focus</td><td>Right-click on a selected data resource or table and select Focus. Only the resource and its children appear. Continue to drill down using the Focus option as needed or select Leave Focus to return to the full view.</td></tr></tbody></table>

If you want to reduce the amount of data displayed, you can filter the level of detail in your view by columns or tables.&#x20;

3. In Galaxy view, click Filters to open the Filters dialog box and select one or more of the following options:

<figure><img src="/files/t8jiD75ftCoZZUUYU9j3" alt=""><figcaption><p>Filters</p></figcaption></figure>

<table><thead><tr><th width="230">Filter Option</th><th>Description</th></tr></thead><tbody><tr><td>Level of Detail</td><td>By default, Galaxy view shows down to the table level as a reduced set of data. Click Columns and apply to have a detailed view down to the column level.</td></tr><tr><td>Show Relationships</td><td>Helps to limit the results in the view with the data resources that are Declared Foreign Key and Discovered Foreign Key.</td></tr><tr><td>Show only Related Items</td><td>When the Show Relationships is active, you can choose a threshold number to refine the results further.</td></tr><tr><td>Show only Tagged Items</td><td>Select this check box to limit the results in the view with the data resources that have associated tags. You can further refine your view by selecting specific tags.</td></tr><tr><td>Show only Items with Business Terms</td><td>To further limit the results in the view, select this check box. You can further refine your view by selecting specific business terms.</td></tr><tr><td>Show Data Elements</td><td>You can also choose to show the data elements by selecting this check box.</td></tr><tr><td>Reset</td><td>Discard your filters.</td></tr></tbody></table>

<figure><img src="/files/F2VCU3FBfiXBr5ZjRWRK" alt=""><figcaption><p>Filter to Column level that have PII tags.</p></figcaption></figure>
{% endtab %}
{% endtabs %}


---

# Agent Instructions: Querying This Documentation

If you need additional information that is not directly available in this page, you can query the documentation dynamically by asking a question.

Perform an HTTP GET request on the current page URL with the `ask` query parameter:

```
GET https://academy.pentaho.com/pentaho-data-catalog-en/data-catalog/getting-started.md?ask=<question>
```

The question should be specific, self-contained, and written in natural language.
The response will contain a direct answer to the question and relevant excerpts and sources from the documentation.

Use this mechanism when the answer is not explicitly present in the current page, you need clarification or additional context, or you want to retrieve related documentation sections.
