Predicting Credit Card Balance: A CRISP-DM Data Mining Approach
VerifiedAdded on 2023/05/30
|19
|984
|153
Project
AI Summary
This project demonstrates the application of the CRISP-DM methodology to predict credit card balances using a simulated dataset of 10,000 customers. The project covers the initial phases of CRISP-DM, including Business Understanding, which defines the objectives from both business and technical perspectives, and Data Understanding, which involves data collection, exploration, and analysis of variable types, missing values, and extreme values. The analysis includes univariate analysis of the outcome variable and bivariate analysis of categorical and numerical variables using boxplots and scatterplots, respectively. A correlation matrix is used to identify variables with a strong relationship to the outcome variable. Finally, the project includes Model Building, where four different regression models are run in ML Studio, and the best model is selected based on the evaluation outputs, with the linear regression model being recommended for its suitability when the relationship between covariates and the response variable is linear. The regression equation for the selected model is also provided.
Contribute Materials
Your contribution can guide someone’s learning journey. Share your
documents today.
1 out of 19