# Observe

{% hint style="info" %}

{% endhint %}

{% tabs %}
{% tab title="Reliability" %}
{% hint style="info" %}
Reliability measures are the measures that include quality/health metrics such as completeness, accuracy, consistency, timeliness, and validity of the data to ensure the trustworthiness and usability of the data:

&#x20;  ● Freshness: The freshness of the data in an asset. &#x20;

&#x20;  ● Volume: Computes the total row count in the data.

&#x20;  ● Schema: Computes the total number of columns in the data.

&#x20;  ● Duplicates: Counts the number of rows that are identical based on either primary key or composite keys deﬁned by the user.
{% endhint %}

#### Freshness

{% hint style="info" %}
In freshness, the trend reports give an overview of how up-to-date the data is across various timelines.
{% endhint %}

x

x

{% hint style="info" %}
The freshness measure is by default active for all the assets and provides the timeliness information of the dataset conﬁgured. If a user wants to know when was the last time the dataset was updated or how often the data gets updated, this measure offers a comprehensive historical perspective on the dataset's trends.&#x20;

Frequently, optimal data freshness is essential for evaluating dataset quality before making crucial business decisions. In the example, the system captures freshness data and visualizes the trend. Additionally, the user can set a manual threshold to get notiﬁed when the freshness number is beyond a certain period.
{% endhint %}

#### Volume

{% hint style="info" %}
In Volume, the trend report shows if there are any changes or deviations in the total row count across a time period.
{% endhint %}

x

{% hint style="info" %}
Volume measure of an asset helps the user understand the changes in the record count of an asset over a period of time. Some use cases require estimating the business checks only on certain limited records.

The completeness dimension offers users insight into the trending record count, enabling them to implement corrective actions upon detecting anomalies. Users can choose to depend on ML-based anomaly detection for alerts or establish a manual threshold at a predetermined number, triggering platform alerts accordingly.
{% endhint %}

#### Schema

{% hint style="info" %}
In Schema, the graph shows the changes and deviations in the total column count across the time period.&#x20;

Actual result: It is the factual outcome generated from the data observability within the platform. It signiﬁes the real, obtained result by understanding and analyzing the data.&#x20;

Expected result: It refers to the anticipated or the predicted outcome based on the previous model or predeﬁned criteria.&#x20;

Comparing the expected result to the actual result in the reliability measures is essential in ensuring accuracy in predictions and assessing the success of various operations that will be performed in PDQ.
{% endhint %}

x

{% hint style="info" %}
Schema measure refers to the number of columns in a table or a view. Ideally, the number of columns would remain the same and the DQScore viewed at the asset level would be based on these columns.&#x20;

However, when columns are added or deleted from the source, this may lead to impacting the DQscore as these attributes contribute to its overall score calculation, misleading the consumer.&#x20;

By default, PDQ do not consider any new columns for score calculation unless the user enables the column in the connection page. It is a best practice to keep a check on the trend in schema.
{% endhint %}
{% endtab %}

{% tab title="Alerts" %}
{% hint style="info" %}
Alerts serve to indicate when an asset, attribute, or other aspect of the data set is not performing within the given parameters, set either by the platform or the user. Alerts are the primary mode of notiﬁcation within the PDQ platform.&#x20;

They serve to chronicle when an asset or attribute is performing outside the set of given parameters. Alerts can be used to monitor the performance of an asset or attribute. They are also linked to incident management through Issues.
{% endhint %}

<figure><img src="/files/XFJTjuPaKNVSCcLga2Fb" alt=""><figcaption><p>Alerts</p></figcaption></figure>

{% hint style="info" %}
On the Home Page: Alerts are featured as the third widget under the search bar.&#x20;
{% endhint %}

1. Click on either 'Alerts' - Assets with Alerts are displayed.

<figure><img src="/files/VPVEHV2M8PGZcJIyrZHj" alt=""><figcaption><p>Alerts</p></figcaption></figure>

2. Choose & Click on an 'asset' - Patients.
3. Click on the Alerts option in the top summary dashboard.

<figure><img src="/files/nixGfWUbr4jpZOCmWgwe" alt=""><figcaption><p>Assets - Patients Alerts</p></figcaption></figure>

4. Click on an Alert.

<figure><img src="/files/fnzHwDoPUpmzGsMagD9N" alt=""><figcaption><p>Alert - Value Range</p></figcaption></figure>

{% hint style="info" %}
By clicking on a chosen Alert, the user will be taken to an expanded view of the alert details, with a chart that displays when the alert was initiated, and its trend.&#x20;

Under Result, the user will ﬁnd the Data and Time when the asset was last run, as well as the Expected result, the Actual result, and the Priority.&#x20;

The user can select whether or not the action was taken regarding the Alert under Actions.
{% endhint %}

<figure><img src="/files/DgRJLIvkRJNZDc1hyzEU" alt=""><figcaption><p>Result</p></figcaption></figure>

x

x
{% endtab %}

{% tab title="Issues" %}
{% hint style="info" %}
Issues can be manually raised after an Alert is automatically detected by the PDQ platform.&#x20;

To create an issue, a user must ﬁrst select an Alert.
{% endhint %}

x

x

x

***

{% hint style="info" %}
Users can view all issues from the PDQ homepage.
{% endhint %}

1. Click on the Explore icon.
2. Select: 'Issues'.

x

3. Select an 'asset' that has a logged Issue -&#x20;

x

4. Change the 'Status' of the 'Issue' to:

New - default

In Progress

Resolved

x

x

x
{% endtab %}

{% tab title="Untitled" %}

{% endtab %}
{% endtabs %}


---

# Agent Instructions: Querying This Documentation

If you need additional information that is not directly available in this page, you can query the documentation dynamically by asking a question.

Perform an HTTP GET request on the current page URL with the `ask` query parameter:

```
GET https://academy.pentaho.com/data-quality-en/pentaho-data-quality/getting-started/observe.md?ask=<question>
```

The question should be specific, self-contained, and written in natural language.
The response will contain a direct answer to the question and relevant excerpts and sources from the documentation.

Use this mechanism when the answer is not explicitly present in the current page, you need clarification or additional context, or you want to retrieve related documentation sections.
