Data Mining Assignment: ARFF File Creation and Analysis

Verified

Added on  2023/06/05

|12
|968
|236
Homework Assignment
AI Summary
This data mining assignment delves into the crucial aspects of data mining, encompassing security, privacy, and ethical considerations within the field. The assignment explores the implications of data mining for business sectors, analyzing its impact on decision-making and forecasting. Furthermore, it provides practical exercises, including ARFF file creation, histogram generation, and the application of unsupervised discretization and missing value replacement filters using the Weka tool. The solution also references relevant literature and articles, offering a comprehensive understanding of the subject matter. The assignment covers both theoretical and practical aspects of data mining, making it a valuable resource for students studying data science and related fields. Desklib provides a platform for accessing similar assignments and study materials.
Document Page
Data mining and
visualization
tabler-icon-diamond-filled.svg

Paraphrase This Document

Need a fresh take? Get an instant paraphrase of this document with our AI Paraphraser
Document Page
Table of Contents
Part – 2 Security, Privacy and Ethics in Data Mining......................................................................2
Question - 1......................................................................................................................................2
Question – 2......................................................................................................................................2
Part – 3 Practical.................................................................................................................................3
ARFF File Creation.........................................................................................................................3
Histograms.......................................................................................................................................6
Unsupervised Discretize filter.........................................................................................................9
Replace Missing Values.................................................................................................................10
References..........................................................................................................................................11
1
Document Page
Part – 2 Security, Privacy and Ethics in Data Mining
Question - 1
Security and Privacy Implications of Data Mining
Data mining, with its guarantee to effectively find the significant data about patients
from a lot of data, is progressively seen as an innovative leap forward in the continuous
progress of well being IT. However numerous protection and security specialists see data
mining as the most essential test that customers will look in the following decade. The
particular test in data mining is creating exact models for investigation of information without
giving access to the exact data in particular patient records. Protection concerns related with
the revelation of data are mounting. Presently, particular enactment to manage the protection
and security worries of data mining has been constrained (Coughlin & Dawson, 2014).
Ethical Implications of Data Mining
Ethical implications for organizations utilizing data mining are not quite the same as
legitimate complications. The whole innovation can't be viewed as great or horrible since it
has numerous helpful favourable circumstances for the general population great as well.
Another location of concern is the moral utilization of data mining functions in the human
services industry ("10 ethical issues confront IT managers", 2018). Understanding facts is
required with the aid of law to be assembled simply with end assent by using the patient.
What's more, such information can be gotten to the utilized through seem into organizations
absolutely after several tiers of security checks. The answer for the changed types of ethical
worries of information digging by organizations is for organizations to keep up straight
forwardness in mining information and being responsible for any ruptures of security. They
should be proactive in a executing the over two viewpoints keeping in mind the end goal to
improve clients that their own information isn't being abused and that the information is
secure.
2
Document Page
Question – 2
Evaluate how significant these implications are for the business sector
The research work focused on the implication of the business sectors significant
impact through the administered questionnaires to identifiers of selected business
Organizations and inhabitants of senatorial district that constitute Lagos metropolis here is
symbolic relationship between business sectors. The strategic decision making requires a
future orientation. Naturally, forecasting is an essential element of the business records
analysis. Forecasting is concerned with developing plausible production of the direction,
scope and intensity of the business change. The work has equally placed business factors in
the fore front of business survival and growth thus enlightening that the success of any
business organisation is contingent on understanding the factors. The framework are used to
track the records of the document identifiers all need to be kept up and the accredited faculty
the frameworks and to be utilized of the standard of saving quality of the document which
used for the business sectors. ("Privacy and Security Concerns in Data Mining", 2018).
Business is affected by different factors which collectively form the business sectors.
These include the social, legal, technological and the specific administration of the advanced
in innovation of the business sector. By and by, one of our subjects is that advances in
innovation, an awful lot the equal as advances in some different territory of undertaking, can
produce societal modifications that must make us rethink our conduct. The dynamic concept
of human development implies a few segments of ethical codes that were splendidly
acceptable in past a while may additionally by no means again apply. In spite of the reality
that space limits us to 10 issues, the ones we check out right here rely on five essential classes
instructions exceedingly compelling to technologists: protection, proprietorship, control,
precision, and security.
Part – 3 Practical
ARFF File Creation
To create the ARFF file for provided data set by using the following steps(Witten, Frank,
Hall & Pal, 2017).
1. First the provided data on the Excel and that file is saved as CSV format. It is shown
below.
3
tabler-icon-diamond-filled.svg

Paraphrase This Document

Need a fresh take? Get an instant paraphrase of this document with our AI Paraphraser
Document Page
2. After, open Weka tool.
Then, open the CSV file.
4
Document Page
5
Document Page
Finally, click saves to save the file as ARFF format. It is shown below.
Histograms
For Assignment– 1
For Assignment– 2
6
tabler-icon-diamond-filled.svg

Paraphrase This Document

Need a fresh take? Get an instant paraphrase of this document with our AI Paraphraser
Document Page
For Assignment – 3
For Assignment – 4
7
Document Page
For Final Result
8
Document Page
Unsupervised Discretized filter
Choose Filter--- unsupervised----Discretized filter.
9
tabler-icon-diamond-filled.svg

Paraphrase This Document

Need a fresh take? Get an instant paraphrase of this document with our AI Paraphraser
Document Page
Replace Missing Values
Choose Filter--- unsupervised---- Replace missing values
10
Document Page
References
10 ethical issues confronting IT managers. (2018). Retrieved from
https://www.techrepublic.com/article/10-ethical-issues-confronting-it-managers/
Coughlin, S., & Dawson, A. (2014). Ethical, Legal and Social Issues in Exposomics: A Call
for Research Investment. Public Health Ethics, 7(3), 207-210. doi: 10.1093/phe/phu031
Privacy and Security Concerns in Data Mining. (2018). Retrieved from
https://www.himss.org/news/privacy-and-security-concerns-data-mining
Witten, I., Frank, E., Hall, M., & Pal, C. (2017). Data mining. Amsterdam: Morgan
Kaufmann.
11
chevron_up_icon
1 out of 12
circle_padding
hide_on_mobile
zoom_out_icon
[object Object]