Hierarchical Data Type
handling Hierarchical Data Types - JSON & YAML ..


x
As part of the Pentaho Data Integration & Analytics plugin release journey to decouple plugins from the core Pentaho Server, Pentaho EE 9.5 GA is releasing new plugins and enhancements to its existing plugin collection.
Log into the 'Pentaho Support Portal' and download the plugin.
Select the Pentaho version.

Download selected plugin(s).

Extract HDT plugin.
Install HDT plugin.
Accept License Agreement -> Next

Browse to ../data-integration/plugins directory

Click 'Next' and accept overwrite warning.

Restart Pentaho Data Integration & check for Hierarchical folder.

Open the following transformation:
~/Workshop--Pentaho-Data-Integration/Module 3

Source tab
Double-click on the Hierarchical JSON Input step to see how its configured.
From file
Select to specify the file path and name of the JSON file you want to load into PDI.
File name
File path and name of the JSON file to load.
From field
Select to use an incoming field as the JSON file path.
Field with file name
The incoming field containing the JSON file path.
Output
Click on the Output tab.

Output field
Specify the field name for output column.
Split rows across path
Specify the JSON path to be parsed.
Filters
Click on the Filters tab.

Double-click on the Extract to rows step to see how its configured.

Step name
Specifies the unique name of the Extract to rows step on the canvas. You can customize the name or leave it as the default.
Source hierarchical field
Specifies the hierarchical input field name from the previous step, which will be used to extract the data.
Pass through fields
Select to add the input fields to the output fields.
Fields
Hierarchical data path
Complete path of the field name in the hierarchical field source.
Output field name
Name of the field that maps to the corresponding field in the hierarchical input source.
Type
Data type of the generated output field.
Path field name
(Optional) Adds the hierarchical path as a new output field with the specified name.
Double-click on the Hierarchical JSON Output step to see how its configured.

Input hierarchical field
Specifies the hierarchical input field name from a previous step which is formatted to the JSON format.
Output field
Specifies the step output field to contain the generated JSON output.
Options
Pass output to servlet
Select to return the data using a web service instead of passing it to output rows.
Pretty print?
Select to format the output JSON data.
RUN the transformation and 'Preview data'.

x
x
x
x
x
x
x
x
x
x
x
x
x
x
x
x
Last updated
Was this helpful?
