Glossaries

In the beginning ..

The Fundamental Challenge

Imagine walking into a massive library where every book is written in a different language, with no translations, no common indexing system, and where the same concept has 50 different names depending on who wrote about it. That's essentially what modern enterprise data environments look like without a business glossary.

Pentaho Data Catalog (PDC) is designed to solve the enterprise data discovery and governance challenge, but without glossaries, it's like having a powerful search engine that can only search for technical codes rather than business meanings.

Here's the problem:

x

Translation Layer

PDC's powerful search and discovery features are only useful if users can find what they're looking for. Glossaries provide the business vocabulary that makes technical assets discoverable by non-technical users.

Without Glossary:

  • User searches for "client type" → No results

  • User searches for "customer classification" → No results

  • User gives up, emails IT, waits 3 days

  • IT explains it's called CUST_TYPE_CD

  • User forgets by next month, cycle repeats

With Glossary:

  • All variations point to single business term: "Customer Type"

  • Definition: "Classification of customer as Individual (I) or Store (S)"

  • User finds it immediately, understands it, uses it correctly

Last updated

Was this helpful?