ANL307e - Performance and Robustness Evaluation of Logistic Regression
VerifiedAdded on 2023/04/20
|3
|470
|407
Homework Assignment
AI Summary
This assignment solution discusses the evaluation of the performance and robustness of a logistic regression model applied to a breast cancer dataset from the UCI Machine Learning Repository. The logistic regression model, implemented using the 1010data function g_logreg(G;S;Y;XX;Z), predicts whether a cancer is benign or malignant based on variables such as ID number, diagnosis, radius-mean, texture-mean, and perimeter-mean. The dataset is divided into training (90%) and testing (10%) sets, and dummy variables are created for categorical columns. The model's performance is assessed using functions like score(XX;M;Z) to predict the probability of benign cancer and param(M;P;I) to obtain model coefficients. The logit of the predicted probability is calculated for visualization, and fit statistics are extracted. References to Hosmer & Lemeshow (2000) and Long et al. (2006) are included.
1 out of 3






