Components

Components, User Interface, Configuration options ..

Components

Understanding the architecture and components of Pentaho Data Integration (PDI) is fundamental to becoming an effective Pentaho developer and administrator. This section will familiarize you with the building blocks that make up the Pentaho Data Integration ecosystem and how they work together to deliver enterprise-grade data integration capabilities.

What You'll Learn

Pentaho Data Integration operates on a client-server architecture that separates design-time activities from runtime execution and administration. In this section, you'll explore:

  • Enterprise Components: The server-side infrastructure that handles execution, security, content management, and scheduling

  • Client Tools: The desktop applications used to design, test, and deploy your data integration solutions

  • Configuration Framework: The KETTLE configuration files that control system behaviour and store critical settings

  • Repository Management: How PDI manages versioning, collaboration, and content organization

  • Database Connectivity: The process of integrating JDBC drivers to connect to various data sources

Pentaho Enterprise

Browse to learn about the components:

Pentaho Client / Server Architecture

Data Integration

Spoon

Graphical modelling environment for developing, testing, debugging and monitoring jobs and transformations.

Designer

Drag & Drop 'objects' to design your pipelines and workflows.

Scheduler

Connects to Quartz scheduler on server. Jobs and transformations must be uploaded to Repository.

Engine

Kettle and Spark engines available to execute jobs and transformations.

Repository Browser

Connects to Apache Jackrabbit content Repository, pointing to a supported database:

  • PostgreSQL

  • MSSQL Server

  • Oracle

  • MySQL

  • MariaDB

DB Explorer

Database Explorer that enables you to conduct minimal database operations.

Last updated

Was this helpful?