Efficient Rule-Based Attribute-Oriented Induction for Data Mining

作者:David W. Cheung, H.Y. Hwang, Ada W. Fu, Jiawei Han

摘要

Data mining has become an important technique which has tremendous potential in many commercial and industrial applications. Attribute-oriented induction is a powerful mining technique and has been successfully implemented in the data mining system DBMiner (Han et al. Proc. 1996 Int'l Conf. on Data Mining and Knowledge Discovery (KDD'96), Portland, Oregon, 1996). However, its induction capability is limited by the unconditional concept generalization. In this paper, we extend the concept generalization to rule-based concept hierarchy, which enhances greatly its induction power. When previously proposed induction algorithm is applied to the more general rule-based case, a problem of induction anomaly occurs which impacts its efficiency. We have developed an efficient algorithm to facilitate induction on the rule-based case which can avoid the anomaly. Performance studies have shown that the algorithm is superior than a previously proposed algorithm based on backtracking.

论文关键词:data mining, knowledge discovery in databases, rule-based concept generalization, rule-based concept hierarchy, attribute-oriented induction, inductive learning, learning and adaptive systems

论文评审过程:

论文官网地址:https://doi.org/10.1023/A:1008778107391