logo

Assignment- Data Mining and Visualization

4 Pages1026 Words46 Views
   

Added on  2020-04-01

Assignment- Data Mining and Visualization

   Added on 2020-04-01

ShareRelated Documents
DATA MININGSTUDENT ID:[Pick the date]
Assignment- Data Mining and Visualization_1
DATA MININGQuestion 1 (Association Rules)The XL Miner has been used in order to obtain the output for Association rules. The list of rules obtained considering 100 as the minimum transactions is illustrated as follows.i)The three vital rules that are obtained from the XL Miner output referred to above are as shown below.Rule No. 1- The event of brush purchase is followed by the nail polish purchase. Also, theconfidence linked to this rule is 100%. This essentially refers to the conditional probability of1 and hence indicates a certain event.Rule No. 2- The event of nail polish purchase is followed by the brush purchase. Also, theconfidence linked to this rule is 63.22%. This essentially refers to the conditional probabilityof 0.6322.Rule No. 3- The event of nail polish purchase is followed by the bronzer purchase. Also, theconfidence linked to this rule is 59.19%. This essentially refers to the conditional probabilityof 0.5919.ii)Definition: A given rule would be considered as redundant only in the context of another rule iffor every dataset, the support and corresponding level is at minimum equal to the latter rule. Inthe context of first couple of dozen rules, redundancy would be observed for rule no.16 and ruleno. 17. A redundancy situation is also visible for rule no. 2 but the confidence level tends to showvariation which is apparent from the above output.The rules have limited utility in the sense that various patterns of purchase may be decipheredusing corresponding conditional probabilities and the same can then be used to derivemeaningful conclusions in the wake of the various theoretical framework and literature review inplace for the same. Also, at times, these rules extend support in extending the validity of thepattern indicated which eventually could lead to useful information being derived.iii)In the given case, the confidence level which is increased from 50% to 75% tends to causereduction of rules. Contrary to the earlier case where rules have a confidence level in excess of50% were listed, in this case, only the rules which would have a confidence level greater than75% would be listed in the XL Miner Output. Hence, only one rule which has a confidence level of100% is stated.
Assignment- Data Mining and Visualization_2

End of preview

Want to access all the pages? Upload your documents or become a member.

Related Documents
COMP3340 - Data Mining Study
|4
|991
|208

Data Mining Assignment -Dendrogram
|9
|973
|68

Rules of Association for XL Miner Output
|7
|739
|219

Assignment of Data Mining Techniques
|10
|1279
|108

Assignment of Data Mining
|7
|706
|46

Data Mining Association Rules & Clustering
|9
|951
|62