logo

Data Mining: Definition, Importance, and Problems

9 Pages1578 Words474 Views
   

Added on  2023-04-21

About This Document

This article provides an overview of data mining, including its definition, importance, and problems. It explains how data mining helps in making informed decisions and discusses the elements that need to be considered. The article also explores the challenges faced in data mining, such as poor quality data and confidentiality concerns. Additionally, it highlights the role of a data dictionary in understanding the nature of data and provides insights into the most popular authors and their books based on the number of weeks on the bestseller list.

Data Mining: Definition, Importance, and Problems

   Added on 2023-04-21

ShareRelated Documents
BUSINESS INFORMATION SYSTEMS
Authors name
BUS 105
Name of the professor
Institution name
City/state
Date
Data Mining: Definition, Importance, and Problems_1
Part A
Question 1
Definition of data mining
Data mining is defined as the process that involved sorting large data sets as a way of
identifying and establishing patterns and relationships which are necessary for analytical
problem solving. Through the data mining tools enterprises are able to extrapolate trends into
the future.
Phases of the Cross-Industry Standard Process for Data Mining
Business understanding
This is the primary phase whose, main objective is to understand the objectives and
requirements of a project. This knowledge is thereafter converted into a definition that can fit
data mining (IBM Corporation 2012).
Data understanding
This phase began with data collection then proceeds to activities that enhance familiarity with
the data set. It assists detect subsets of interest that are useful in the hypothesis formulation.
Data preparation
This phase accounts for all the activities that are necessary to construct the final dataset from
the raw data. The tasks here include recording, tabling as well as selection of attributes. Data
transformation and cleaning also forms part of this phase.
Modelling
This stage involved selection and application of a number of models to obtain optimal values.
Evaluation
After building the models, this stage involves evaluation and reviewing the steps which have
been executed so as to create the model (Chapman et al. 2000). This way the application of
the model in solving a business problem is verified.
Deployment
Data Mining: Definition, Importance, and Problems_2
After creating the model there is need to organize the knowledge gained in a way that it can
be useful to the consumers. This phase therefore entails carrying out of the deployment tasks
by the clients.
Question 2
Importance of data mining
Data mining assists in making informed decisions by projecting the future using the past
available data.
Elements of data mining
Below elements need to be put in place when carrying out data mining
Accuracy: a resource is only valuable if the quality aspect is reliable. Hence, when preparing
a tool for data mining it’s important to ensure the sources where the information is to be
gathered are accurate.
For example, a retail trader may be interested in product pricing offered by a number of
competitors. So as to ensure the information gathered is accurate, he/ she needs to consider
season, source and the products of concern.
The tools used to gather the data need to be able to differentiate the nature of data that is to be
collected this way data accuracy is aligned to the harvesting reason.
Relevancy: For information to be useful for decision making, it’s vital that it do consider the
context aspect. In a situation where a simple tool is designed to support harvesting of data it
is possible that it may fail to ensure the context needed is enough to ensure the data source is
relevant. The use of machine learning has assisted bridge the context gap. According to IBM
there are three areas that context need to cover that is industry, data and transfer.
For example; when crafting MySQL, it is not possible for your code to recognise context. A
more sophisticated tool needs to be applied so as to learn about the data context by using the
accumulated data and logical expressions.
Specificity: in the real world there are huge chunks of data that is availed on a daily basis.
When collecting data its therefore vital that the researcher do mine only the vital information
that can be in line with business strategy and the awareness of the industry (Cerami 2018).
Question 3
Data Mining: Definition, Importance, and Problems_3

End of preview

Want to access all the pages? Upload your documents or become a member.

Related Documents
Business Information Systems .
|36
|6730
|179

Business Information Systems
|9
|1736
|235

ISY310 - Business Analysis | Data mining
|10
|2653
|283

Data Mining: A Solution for Business Problems
|7
|1117
|413

Data Mining - Definition, Stages, Advantages and Drawbacks
|7
|1250
|290

B9DA103 Data Mining - CRISP-DM model
|6
|1587
|14