University of Hertfordshire Data Classification Project 7COM1073
VerifiedAdded on 2022/09/02
|10
|1545
|17
Project
AI Summary
This project focuses on data classification for software defect prediction using machine learning techniques. The student implemented and evaluated a Naive Bayes classification model on a provided dataset containing static code metrics. The analysis involved data preprocessing, including handling of missing values and feature scaling. The project explored Principal Component Analysis (PCA) for dimensionality reduction and visualization of the data. The student built and evaluated multiple Naive Bayes models with different feature sets. The results, including accuracy, classification reports, and confusion matrices, were presented and discussed. The project concluded with an analysis of the model's performance and suggestions for future improvements, such as exploring other machine learning models and addressing data outliers. The student used Python with libraries like NumPy, Pandas, Matplotlib, Seaborn, and Scikit-learn to perform the analysis and present the findings. The goal was to assess the feasibility of an automated software defect prediction system.
Contribute Materials
Your contribution can guide someone’s learning journey. Share your
documents today.
1 out of 10