credit-card-frontCredit Card

ML - Gradient Boosting (GBM)

circle-info

The results from H2O point to using a Gradient Boosting (GBM) algorithm.

In this lab, you operationalize that choice in PDI:

  • Train a GBM model in R.

  • Save the model artifact.

  • Predict fraudulent credit card transactions.

You will use the R gbm package.

Walkthrough (video)
circle-info

Train a GBM model with the same dataset.

main_job.kjb
  1. In Spoon, open the following main job:

  1. Right-click the train_model transformation.

  2. Select Open referenced object > Transformation.

train model

R Script Executor

  1. Open the rscrpt-train_gbm step.

  2. On the Configure tab, set:

    • Input frames: sv-convert_booleans_to_numbers

    • R frame name: train

  1. Set Row handling > Number of rows to process to All.

  2. On the R script tab, paste this script:

circle-info

This step writes the model artifact to:

/home/pentaho/Workshop--Data-Integration/Labs/Module 7 - Use Cases/Machine Learning/Credit Card Fraud/solution/train_model_output/gbm_fraud.rdata

Last updated

Was this helpful?