SMB
File sharing ..
Workshop - SMB/CIFS
The Server Message Block (SMB) protocol is a network file sharing protocol that allows applications on a computer to read and write to files and to request services from server programs in a computer network. The SMB protocol can be used on top of its TCP/IP protocol or other network protocols
Objective of this workshop is to:
install & configure a basic Samba server.
share user home directories as well as provide read-write anonymous access to selected directory.


Create a new Transformation
Any one of these actions opens a new Transformation tab for you to begin designing your transformation.
By clicking File > New > Transformation
By using the CTRL-N hot key
Test SMB Server
Before we fire up Pentaho Data Integration, let's test:
SMB server is up and running
Can log into User - Bob & Alice - & Shared spaces.
Please ensure you have completed the following setup: SMB
Log into your Docker Desktop to check that the SMB Docker container is up and running.

Let's test SMB server ..

SMB Shared Folders
Let's check we have some sample data in our container/shared folder.
There's a couple of ways you could do this ..!
If you have an IDE Editor installed, you can install the Docker Container Extension, see Windows 11 Pentaho Lab.
In the Docker Desktop UI click on the workshop-server-smb.

Click on Files > scroll down to shared folder - expand to see mounted volumes.

Lets connect to a data source using SMB VFS in Pentaho Data Integration.
Pentaho Data Integration
Pentaho Data Integration utilizes Virtual File System (VFS) as the abstraction layer within the kernel to expose different filesystems.
In PDI, you can add a VFS connection and then reference that connection whenever you want to access files or folders on your Virtual File System.
Select the following OS.
Start Pentaho Data Integration.
Windows - PowerShell
x
x
Start Pentaho Data Integration.
Linux
Create a New Transformation.
Drag & drop the Text file input step onto the canvas.
Click on the 'View' tab.
Highlight 'VFS Connections' and select 'New'.

Configure with the following details:

Click 'Test'.

Transformation - SMB File Retrieval
Let's create a simple Transformation to onboard data via an SMB VFS connection.
Create the following transformation:

Double-click on Text file input > File tab
Click on Browse and ensure you select:
VFS Connections > SMB > Pentaho/design-tools/data-integration/samples/transformations/files/sales_data.csv
Add the path.

Click on Content tab & configure with the following settings:

Click on Fields tab & click on 'Get Fields'

Preview the rows.

Click OK.
Add the other steps to format / rename some fields, before output as a .txt in the same directory as your Transformation.
x
Last updated
Was this helpful?
