Bootstrapping rule induction to achieve rule stability and reduction

作者:Lemuel R. Waitman, Douglas H. Fisher, Paul H. King

摘要

Most rule learning systems posit hard decision boundaries for continuous attributes and point estimates of rule accuracy, with no measures of variance, which may seem arbitrary to a domain expert. These hard boundaries/points change with small perturbations to the training data due to algorithm instability. Moreover, rule induction typically produces a large number of rules that must be filtered and interpreted by an analyst. This paper describes a method of combining rules over multiple bootstrap replications of rule induction so as to reduce the total number of rules presented to an analyst, to measure and increase the stability of the rule induction process, and to provide a measure of variance to continuous attribute decision boundaries and accuracy point estimates. A measure of similarity between rules is also introduced as a basis of multidimensional scaling to visualize rule similarity. The method was applied to perioperative data and to the UCI (University of California, Irvine) thyroid dataset.

论文关键词:Decision rules, Rule induction, Bootstrapping, Multidimensional scaling, Perioperative medicine, Rule similarity, Rule visualization, Rule abstraction, Algorithm stability

论文评审过程:

论文官网地址:https://doi.org/10.1007/s10844-006-1626-z