Group Project: Business Analysis of Boston Housing Dataset - NIT3171

Verified

Added on  2023/04/25

|11
|757
|94
Project
AI Summary
This group project focuses on analyzing the Boston housing dataset to provide business solutions, assuming a business analyst role in a real estate consulting firm. The project involves understanding the dataset, discovering relationships among features using normalization techniques, and proposing potential business analysis tasks. The analysis employs classification and prediction algorithms, including ZeroR and multi-scheme classification, to address business problems and offer solutions with benefits like improved business processes and support for decision-making and strategy development. The project provides justifications for the chosen tasks, evaluates the performance of different algorithms, and discusses the significance of error rates in determining the suitability of algorithms for data analysis. The provided solution includes details about the ZeroR classifier, its algorithm, and its performance compared to other classifiers, as well as references to relevant literature.
Document Page
Business Analysis & Data
Visualization Group
Assignment
tabler-icon-diamond-filled.svg

Secure Best Marks with AI Grader

Need help grading? Try our AI Grader for instant feedback on your assignments.
Document Page
Project Description
Main objective of this project is to use the Boston housing
dataset to apply the data mining techniques to resolve a business
problem.
Analysis the provided data set to provide the suitable business
solutions by using the Weka data mining tool.
To analysis the provided data by review the current,
methodologies and algorithms for business analytics.
Document Page
Understand the dataset
Analysis the provided data set, first user needs to understand the data
set. The provided Boston housing dataset is described as below.
Document Page
Relationships discovery among
features
In this task, user needs to discover the relationships existed
among all the attributes.
Here, we are applying the normalization techniques to discover
the relationships among all the attributes in the Boston Housing
data.
The normalization technique is used to remove the duplicates in
the data (Azzalini & Scarpa, 2012).
tabler-icon-diamond-filled.svg

Secure Best Marks with AI Grader

Need help grading? Try our AI Grader for instant feedback on your assignments.
Document Page
Business Analysis Tasks With
Your Justification
In this task, user requires to list the potential business analysis for a
provided data set. Here, we are using the classification and prediction
algorithm to resolve the business problem. And, also provide the
effective solutions for that problem. The effective results is used to
provides the following benefits for real estate consulting firm such as,
Business benefits
Improve the business process
Support decision making
Support strategy development.
Document Page
ZeroR Classification
1. ZeroR is the most straightforward classification methods which
depends on the objective and predicts all Predictors.
2. ZeroR classifier essentially predicts the category which is class
(Witten, Frank & Hall, 2011).
3. Despite the fact that there is no consistency control in ZeroR, it
is helpful for deciding a standard execution as a benchmark for
other classification methods.
4. Algorithm Construct a recurrence table for the objective and
select it is most regular value.
Document Page
ZeroR Classification
The ZeroR algorithm predicts the mean Boston House class values is
180921.19589041095. It must achieve an RMSE better than this value. The
ZeroR algorithm predicts the tested negative value for all instances as it is
the majority class, and achieves an accuracy of 82 % (Kaluža, 2013).
tabler-icon-diamond-filled.svg

Paraphrase This Document

Need a fresh take? Get an instant paraphrase of this document with our AI Paraphraser
Document Page
Multi Scheme Classification
Document Page
Justification
1. In light of the above tables and figures, we can obviously observe that for the
Boston Housing data most significant accuracy is 100% and the least is 17.94
%. The other algorithm yields a normal accuracy of around 85%. In fact, the
most important accuracy has a place with the Multi scheme classifier. ZeroR
Classifier present at the base of the outline with percentage around 100%.
2. A normal of 1198 instances out of absolute 1460 instances is observed to be
effectively characterized with most elevated score of 262 occurrences
contrasted with 1460 instances, which is the least score (Maimon & Rokach,
2010). The total time required to build the model is likewise a basic parameter
in contrasting the classification algorithm. It is regular to recognize the
reliability quality of the data gathered and their legality.
Document Page
Justification
This analysis suggests a normally utilized pointer which is mean of
supreme errors and root mean squared errors. Then again, the
relative errors are additionally utilized.
It is found that the most important error is found in ZeroR
Classifier with a normal score of around 0.821.
A algorithm which has a lower error percentage will be favoured as
it has all the more powerful classification capability, so after
investigation we can say that ZeroR algorithm isn't appropriate for
a Data since it has most extreme number of errors and can't classify
the data effectively (Olson, 2017).
tabler-icon-diamond-filled.svg

Secure Best Marks with AI Grader

Need help grading? Try our AI Grader for instant feedback on your assignments.
Document Page
References
Kaluža, B. (2013). Instant Weka how-to. Birmingham: Packt
Pub.
Maimon, O., & Rokach, L. (2010). Data mining and
knowledge discovery handbook. New York: Springer.
Olson, D. (2017). Descriptive Data Mining. Singapore:
Springer Singapore.
Witten, I., Frank, E., & Hall, M. (2011). Data mining.
Burlington, Mass.: Morgan Kaufmann Publishers.
chevron_up_icon
1 out of 11
circle_padding
hide_on_mobile
zoom_out_icon
logo.png

Your All-in-One AI-Powered Toolkit for Academic Success.

Available 24*7 on WhatsApp / Email

[object Object]