Ask a question from expert

Ask now

Assignment Data Mining - XL Miner

10 Pages1765 Words134 Views
   

Added on  2020-03-23

Assignment Data Mining - XL Miner

   Added on 2020-03-23

BookmarkShareRelated Documents
Data Mining[Pick the date]Student id
Assignment Data Mining - XL Miner_1
Question 1 Association Rule Output generated by XLMiner (i)Interpretation of initial three rules (Liebowitz, 2015) Association rule 1 – There is 1 conditional probability that if brushes are purchased, then nailpolish would also be purchased by an individual. Association rule 2 - There is 0.6322 conditional probability that if nail polish is purchased,then brushes would also be purchased by an individual.1
Assignment Data Mining - XL Miner_2
Association rule 3 - There is 0.5919 conditional probability that if nail polish is purchased,then bronzer would also be purchased by an individual.(ii)A rule may be redundant in associated with the ancestor i.e. the rule that precedes it. Onlythe rule that precedes is taken into consideration. This is on account of the various rulesbeing arranged in the decreasing order of support which in turn is captured by the lift ratio.Hence, redundancy occurs when the support level seen in the redundant rule is as expectedby the ancestor rule. An apt example of this is found in the output that is attached in part(i).On close observation, it is apparent that the support of both the rule 2 and the ancestor rule1 is exactly the same as underlined from the lift ratio. However, this support level observedfor rule 2 was already predicted by rule 1 thus making rule 2 redundant. Similarredundancy situation occurs for Rule 16 and Rule 17. In case of association rules,redundancy is commonly observed and hence elimination of redundant rules is done so asto enhance the overall utility of the rules (Shumueli, et. al., 2016).The utility of the rules derived from the above output are assessed on the basis of thesupport (denoted by lift ratio) along with the confidence (denoted by confidence level).The support indicates the importance of the underlying rule while the confidence level isan indicator of the underlying conditional probability. A careful and delicate balancebetween the two is expected. If the support level desired is too high, then any relation thattends to exist between the relatively rare items would be ignored owing to the lack of theminimum support transactions. Further, a low support level definition would lead toidentification of too many rules with many of these would have not any meaningful utilityin relation to expression of customer behavior (Homg, Kuo & Chi, 1999). (iii)In this case, the minimum confidence % has increased to 75 from the earlier value of 50. Output generated by XLMiner 2
Assignment Data Mining - XL Miner_3

End of preview

Want to access all the pages? Upload your documents or become a member.

Related Documents
Assignment of Data Mining Techniques
|10
|1279
|108

Data Mining Assignment PDF
|9
|840
|46

Business Case Analysis Association
|11
|1296
|45

Data Mining Assignment XL Miner
|9
|1504
|67

Association Rules (Relevant Output)
|11
|978
|303

Data Mining and Redundant Rule Assignment
|9
|993
|143