Constraint-Based Rule Mining in Large, Dense Databases
作者:Roberto J. Bayardo Jr, Rakesh Agrawal, Dimitrios Gunopulos
摘要
Constraint-based rule miners find all rules in a given data-set meeting user-specified constraints such as minimum support and confidence. We describe a new algorithm that directly exploits all user-specified constraints including minimum support, minimum confidence, and a new constraint that ensures every mined rule offers a predictive advantage over any of its simplifications. Our algorithm maintains efficiency even at low supports on data that is dense (e.g. relational tables). Previous approaches such as Apriori and its variants exploit only the minimum support constraint, and as a result are ineffective on dense data due to a combinatorial explosion of “frequent itemsets”.
论文关键词:data mining, association rules, rule induction
论文评审过程:
论文官网地址:https://doi.org/10.1023/A:1009895914772