# Pentaho Data Catalog ..

{% hint style="danger" %}
The workshops are currently being updated for the 10.2.8 release ..
{% endhint %}

{% hint style="info" %}

## Introduction

Pentaho Data Catalog serves as a comprehensive metadata management solution that helps organizations document, organize, and understand their data assets. It provides a centralized repository where data professionals can discover, understand, and govern data across the enterprise.

One of the primary use cases for Pentaho Data Catalog is data discovery and lineage tracking. Organizations with complex data ecosystems can use it to map relationships between different data sources, transformations, and outputs. This capability is particularly valuable for regulatory compliance, as it enables teams to trace how sensitive data moves through systems and who has access to it.

Another key application is business glossary management, where Pentaho Data Catalog helps bridge the gap between technical metadata and business terminology. This creates a common language across the organization, allowing business users to find and understand relevant data without requiring deep technical knowledge of underlying systems. For data governance initiatives, this capability ensures consistent definitions and usage of critical business terms.

Pentaho Data Catalog also supports impact analysis, helping teams understand how changes to data sources might affect downstream reports and applications. This proactive approach to change management reduces the risk of disruptions when modifying databases, ETL processes, or reporting structures.

These series of workshops introduce Pentaho Data Catalog and its capabilities to manage both structured and unstructured data efficiently. Through a combination of automated processes and machine learning, the workshops will guide you through the essential functions of data ingestion, profiling, and curation of multiple data sources.&#x20;

By the end of the workshops, you will have a comprehensive understanding of:
{% endhint %}

<details>

<summary>Key Concepts &#x26; Terminology</summary>

Familiarize yourself with the foundational terminology and concepts used within the Pentaho Data Catalog environment.

</details>

<details>

<summary>Connecting to various Data Sources</summary>

Learn how to establish [connections](https://academy.pentaho.com/pentaho-data-catalog-en/data-catalog/data-discovery/connect-aw-database/other-data-sources) to a wide range of data sources to enable [data ingestion](https://academy.pentaho.com/pentaho-data-catalog-en/data-catalog/data-discovery/connect-aw-database/other-data-sources#id-1.1-ingest-metadata).

</details>

<details>

<summary>Ingesting &#x26; Profiling Data</summary>

Discover the methods used for ingesting and how [profiling](https://academy.pentaho.com/pentaho-data-catalog-en/overview/broken-reference) assists in understanding your data's structure and quality.

</details>

<details>

<summary>Business Glossary &#x26; Terms</summary>

Understand the significance of maintaining a [business glossary](https://academy.pentaho.com/pentaho-data-catalog-en/overview/broken-reference) and how it aids in aligning data with business terminology.

</details>

<details>

<summary>Rules</summary>

Explore how metadata rules are applied to data within the Pentaho Data Catalog to ensure consistency and relevance.

</details>

***

{% hint style="info" %}

#### **Prerequisites**

* Basic SQL knowledge
* Familiarity with AdventureWorks2022 database schema
* Access to Pentaho Data Catalog with JDBC connection to AdventureWorks2022
  {% endhint %}

{% hint style="info" %}

#### Overview

Take a look at the following walkthrough to get the best experience ..
{% endhint %}

{% tabs %}
{% tab title="English" %}

<figure><img src="https://1051758685-files.gitbook.io/~/files/v0/b/gitbook-x-prod.appspot.com/o/spaces%2Fw1qJj4OGmdcvowiklB9W%2Fuploads%2FN1Pzjh5WopfRCxOCH64G%2Foverview.png?alt=media&#x26;token=67bfbab7-a491-461f-9865-fa588bacaf21" alt=""><figcaption></figcaption></figure>
{% endtab %}
{% endtabs %}


---

# Agent Instructions: Querying This Documentation

If you need additional information that is not directly available in this page, you can query the documentation dynamically by asking a question.

Perform an HTTP GET request on the current page URL with the `ask` query parameter:

```
GET https://academy.pentaho.com/pentaho-data-catalog-en/overview/pentaho-data-catalog-...md?ask=<question>
```

The question should be specific, self-contained, and written in natural language.
The response will contain a direct answer to the question and relevant excerpts and sources from the documentation.

Use this mechanism when the answer is not explicitly present in the current page, you need clarification or additional context, or you want to retrieve related documentation sections.
