Escolar Documentos
Profissional Documentos
Cultura Documentos
December 9, 2021 1
What Is Association Mining?
December 9, 2021 2
Association Rule: Basic Concepts
December 9, 2021 3
Rule Measures: Support and
Confidence
Customer
buys both
Customer Find all the rules X & Y Z with
buys diaper
minimum confidence and support
support, s, probability that a
brands of diapers?
December 9, 2021 5
Mining Association Rules—An Example
December 9, 2021 7
The Apriori Algorithm
Join Step: C is generated by joining L with itself
k k-1
December 9, 2021 8
The Apriori Algorithm — Example
Database D itemset sup.
L1 itemset sup.
TID Items C1 {1} 2 {1} 2
100 134 {2} 3 {2} 3
200 235 Scan D {3} 3 {3} 3
300 1235 {4} 1 {5} 3
400 25 {5} 3
C2 itemset sup C2 itemset
L2 itemset sup {1 2} 1 Scan D {1 2}
{1 3} 2 {1 3} 2 {1 3}
{2 3} 2 {1 5} 1 {1 5}
{2 3} 2 {2 3}
{2 5} 3
{2 5} 3 {2 5}
{3 5} 2
{3 5} 2 {3 5}
C3 itemset Scan D L3 itemset sup
{2 3 5} {2 3 5} 2
December 9, 2021 9
Interestingness Measurements
Objective measures
Two popular measurements:
support; and
confidence
December 9, 2021 10
Criticism to Support and Confidence
Example 1: (Aggarwal & Yu, PODS98)
Among 5000 students
3000 play basketball
3750 eat cereal
2000 both play basket ball and eat cereal
play basketball eat cereal [40%, 66.7%] is misleading
December 9, 2021 13
Summary
December 9, 2021 14