Objective: The classification goal is to predict the likelihood of a liability customer buying personal loans.
- Read the column description and ensure you understand each attribute well
- Study the data distribution in each attribute, share your findings
- Get the target column distribution.
- Split the data into training and test set in the ratio of 70:20 respectively
- Use different classification models to predict the likelihood of a liability customer buying personal loans
- Print the confusion matrix and ROC plots for all the above models
Attribute Information:
-
ID : Customer ID
-
Age : Customer's age in completed years
-
Experience : #years of professional experience
-
Income : Annual income of the customer in dollars
-
ZIP Code : Home Address ZIP code.
-
Family : Family size of the customer
-
CCAvg : Avg. spending on credit cards per month in dollars
-
Education : Education Level.
- Undergrad;
- Graduate;
- Advanced/Professional
-
Mortgage : Value of house mortgage if any, in dollars
-
Personal Loan : Did this customer accept the personal loan offered in the last campaign?
-
Securities Account : Does the customer have a securities account with the bank?
-
CD Account : Does the customer have a certificate of deposit (CD) account with the bank?
-
Online : Does the customer use internet banking facilities?
-
Credit card : Does the customer use a credit card issued by
The following models have given ultimate performace for the given data after scaling and upweighting with test and train scores greater than 99.5%
- Decision Tree (Unpruned)
- Decision Tree (Pruned)
- Bagging
- Gradient Boosing
- Adaboosting
It is interesting to note that none of the aforementioned models has mispredicted false negatives.