Skip to main content

Task 1: Import dataset

In this tutorial, we'll be working with the Loan Prediction dataset from Kaggle. This dataset contains loan application data collected by a financial institution. It includes details about applicants, such as their income, credit history, loan amount, and demographics.

Our objective is to analyze loan application data and build a machine learning model using H2O Driverless AI to predict the loan amount an applicant is likely eligible for.

Let's import the dataset:

  1. Click + ADD DATASET (OR DRAG & DROP).
  2. Select AMAZON S3.
  3. In the Explore Amazon S3 box, enter the following S3 URI:
    s3://data.h2o.ai/DAI-Tutorials/loan_prediction/
  4. Select the following datasets: loan_prediction_train.csv and loan_prediction_test.csv.
  5. Click CLICK TO IMPORT SELECTION.

Congratulations, you have successfully imported the dataset into H2O Driverless AI. In Task 2, we will explore the dataset to understand each column.


Feedback