Flat Files
How Data Integration handles Flat files ..
Workshops
Despite being the most basic format used to store data, files are broadly used and they exist in several formats as fixed width, comma-separated values, spreadsheet, or even free format files. Pentaho Data Integration can read data from all types of files.
TXT & CSV Files
Some of the Orders data that Steel Wheels process are in a text format. In this guided demo, you will flatten the list, create capture groups, replace text, and finally format the order_value.
In this demonstration, you will format the text file input to be onboarded into a database table:
Text File Input
Flattener
RegEx Evaluation
Replace in String
Select values

TXT & CSV
Steel Wheels wants to send out a survey to its customers, based on a list of questions.
In this demonstration, you will configure a text file survey:
Get System Info
User Defined Java Expression
Data Grid
Append
Text File Output

Excel
Steel Wheels wish to automate their Half Yearly Sales and Expenses Report in Excel. The ETL process has been broken down into various workflows, resulting in writing data to an Excel template, once previous workflows have been completed.
In this demonstration, you will populate an Excel workbook source data sheet in a template:
Excel Writer
CSV File Input
Block Step

XML
Steel Wheels has some data sources in XML format. This guided demonstration illustrates the 3 data source options for retrieving XML data.
In this demonstration, you will retrieve XML data and format:
Data Grid

JSON
Steel Wheels have several JSON data sources. In this guided demonstration, you will create a simple workflow to extract the required reporting dataset.
In this demonstration, you will retrieve JSON data and format:
JSON Input

RSS
The good old days of bulletin boards ..!
A lot of websites have RSS feeds which can be used to: update a news feed, stock prices, sports scores and so on ..
In this demonstration you will configure the following step:
• RSS
Currently being updated to next version ..
Last updated
Was this helpful?
