logo

Enterprise Business Intelligence: Data Mining and Machine Learning for Business Intelligence

   

Added on  2023-06-05

69 Pages5378 Words174 Views
 | 
 | 
 | 
Enterprise Business Intelligence
Abstract
This project is designed to provide the good opportunity to use the data mining and
machine learning method in discovering knowledge from a dataset and explore the
Enterprise Business Intelligence: Data Mining and Machine Learning for Business Intelligence_1

applications for business intelligence. This project analysis the health news dataset to explore
the weka data mining applications. This project divided in to 10 practical. In First practical,
we are going to install the weka software and download the Data repository. In Second
practical, this practical is used to do data and data pre-processing for provided data set. In
third practical, this practical is used for data visualization and dimension reduction. In fourth
practical, this practical is used to do the clustering algorithm like K-Means. In Fifth practical,
this task is used to do supervised mining that is classification algorithm on weka. In Sixth
practical, this practical is used to do performance evaluation on weka experimenter and
Knowledge Flow. In seventh practical, this practical is used for predicting the time series on
weka package manager. In eighth practical, this task is used to do text mining. In final, this
practical is used to do the image analytics on weka. These are will be analysed and
demonstrated in detail.
Table of Contents
1 Introduction......................................................................................................................3
2 Data set..............................................................................................................................3
1
Enterprise Business Intelligence: Data Mining and Machine Learning for Business Intelligence_2

3 Data mining Techniques..................................................................................................4
4 Evaluation and Demonstration.......................................................................................4
4.1 Practical – 1................................................................................................................5
4.2 Practical – 2................................................................................................................6
4.3 Practical – 3................................................................................................................9
4.3.1 Visualising the Dataset.......................................................................................9
4.3.2 Visualising the Dataset using Classifiers........................................................15
4.4 Practical – 4..............................................................................................................21
4.4.1 Manually Working with K-Means..................................................................21
4.4.2 Unsupervised Learning in WEKA – Clustering............................................24
4.5 Practical – 5..............................................................................................................26
4.6 Practical – 6..............................................................................................................33
4.6.1 Weka Experimenter.........................................................................................33
4.6.2 Weka Knowledge Flow....................................................................................37
4.7 Practical – 7..............................................................................................................43
4.8 Practical – 8..............................................................................................................56
4.8.1 Training the Classifier Model.........................................................................56
4.8.2 Predict the Class in Test..................................................................................58
4.9 Practical – 9..............................................................................................................61
5 Conclusion.......................................................................................................................65
References...............................................................................................................................66
1 Introduction
This project is designed to provide the good opportunity to use the data mining and
machine learning method in discovering knowledge from a dataset and explore the
2
Enterprise Business Intelligence: Data Mining and Machine Learning for Business Intelligence_3

applications for business intelligence. This project analysis the health news dataset to explore
the weka data mining applications. This project divided in to 10 practical. In First practical,
we are going to install the weka software and download the Data repository. In Second
practical, this practical is used to do data and data pre-processing for provided data set. In
third practical, this practical is used for data visualization and dimension reduction. In fourth
practical, this practical is used to do the clustering algorithm like K-Means. This practical
divided into two parts such as Part 1 and part 2. The part 1 is manually calculated the K-
means for provided data set. The part 2 is to use weka clustering algorithm to calculate the K-
Means. In Fifth practical, this task is used to do supervised mining that is classification
algorithm on weka. In Sixth practical, this practical is used to do performance evaluation on
weka experimenter and Knowledge Flow. In seventh practical, this practical is used for
predicting the time series on weka package manager. In eighth practical, this task is used to
do text mining. In final, this practical is used to do the image analytics on weka. These are
will be analysed and demonstrated in detail.
2 Data set
Each record is identified with one Twitter record of a news office. For instance,
bbchealth.txt is identified with BBC wellbeing news. Each line contains tweet id | date and
time | tweet. The separator is '|'. This content information has been utilized to assess the
execution of point models on short content information. Be that as it may, it very well may be
utilized for different assignments, for example, clustering.
3 Data mining Techniques
Data mining techniques is illustrated as below.
3
Enterprise Business Intelligence: Data Mining and Machine Learning for Business Intelligence_4

4 Evaluation and Demonstration
4.1 Practical – 1
This task, we are going to install the weka software and download the Data repository.
First, user needs to download and install the weka software.
4
Enterprise Business Intelligence: Data Mining and Machine Learning for Business Intelligence_5

Once weka installed successfully, user requires the open the weka software. It is illustrated as
below.
After, download the UCI data repository by using the provided link and it is illustrated as
below.
https://archive.ics.uci.edu/ml/datasets/Health+News+in+Twitter
After, click the data folder
Then, click the Health_News_Tweets.zip file to download the data set. The download data set
is attached below.
5
Enterprise Business Intelligence: Data Mining and Machine Learning for Business Intelligence_6

4.2 Practical – 2
This practical is used to do data and data pre-processing for provided data set. To do the
data pre-processing on Weka by follows the below steps.
First, user needs to open the weka.
6
Enterprise Business Intelligence: Data Mining and Machine Learning for Business Intelligence_7

After, click the Explorer to load the data set.
Once data is load successfully, the data pre-processing process is completed. It is shown
below.
7
Enterprise Business Intelligence: Data Mining and Machine Learning for Business Intelligence_8

End of preview

Want to access all the pages? Upload your documents or become a member.

Related Documents