MinIO
Access S3 type Object Store -VFS ..
Workshop - MinIO
MinIO is a high-performance, Kubernetes-native object storage system designed for cloud-native applications. Built from the ground up to be compatible with Amazon S3, MinIO offers a lightweight yet powerful alternative for organizations looking to deploy object storage in their own infrastructure.
At its core, MinIO provides distributed object storage with performance characteristics. It's capable of handling millions of operations per second and can store petabytes of data while maintaining sub-millisecond latency. This performance is achieved through a simplified architecture that eliminates complex dependencies and optimizes for modern hardware capabilities.
One of MinIO's key strengths lies in its versatility. It can be deployed virtually anywhere - from bare metal servers to public, private, and edge cloud environments. Organizations particularly value its seamless integration with Kubernetes, making it an ideal choice for containerized environments. MinIO's open-source nature also provides transparency and flexibility that many enterprises require for their data infrastructure needs.

Please ensure you have completed the following setup: MinIO

Log into MinIO.
Username: minioadmin
Password: minioadmin
Acknowledge the license agreement.

Click the 'Create Bucket' link.
Enter: sales-data & 'Create Bucket'.

Click on the Upload button.

Upload the sales_data.csv from:

Start Pentaho Data Integration.
Create a New Transformation.
Drag and drop a Text File input step onto the canvas.
Click: 'View' Tab.
Right mouse click on VFS Connections > New.
Enter the following details:

Connection Name
MinIO:sales-data
Connection Type
Amazon S3/Minio/HCP
Description
Connection to sales-data bucket
S3 Connection Type
Minio/HCP
Access Key
minioadmin
Secret Key
minioadmin
Endpoint
http://localhost:9000 [MinIO Container IP]
Signature Version
AWSS3V4SignerType
Root Folder Path
/
Test the connection.
The CSV file input step doesn't support VFS - it only supports local file paths. This is a design restriction in the CSV file input step for performance optimization.
Double-click on the Text file Input step.

Browse & Add the VFS - sales-data/sales_data.csv

Click on the Content tab and configure with the following settings:

Click on the Fields tab > Get Fields which will parse the file:

From here you can Preview the rows.

Last updated
Was this helpful?

