Efficient redundancy reduced subgroup discovery via quadratic programming

作者:Rui Li, Robert Perneczky, Alexander Drzezga, Stefan Kramer

摘要

Subgroup discovery is a task at the intersection of predictive and descriptive induction, aiming at identifying subgroups that have the most unusual statistical (distributional) characteristics with respect to a property of interest. Although a great deal of work has been devoted to the topic, one remaining problem concerns the redundancy of subgroup descriptions, which often effectively convey very similar information. In this paper, we propose a quadratic programming based approach to reduce the amount of redundancy in the subgroup rules. Experimental results on 12 datasets show that the resulting subgroups are in fact less redundant compared to standard methods. In addition, our experiments show that the computational costs are significantly lower than the costs of other methods compared in the paper.

论文关键词:Subgroup discovery, Mutual information, Quadratic programming, Rule learning, Redundancy

论文评审过程:

论文官网地址:https://doi.org/10.1007/s10844-013-0284-1