Decision tree induction with a constrained number of leaf nodes

作者:Chia-Chi Wu, Yen-Liang Chen, Yi-Hung Liu, Xiang-Yu Yang

摘要

With the advantages of being easy to understand and efficient to compute, the decision tree method has long been one of the most popular classifiers. Decision trees constructed with existing approaches, however, tend to be huge and complex, and consequently are difficult to use in practical applications. In this study, we deal with the problem of tree complexity by allowing users to specify the number of leaf nodes, and then construct a decision tree that allows maximum classification accuracy with the given number of leaf nodes. A new algorithm, the Size Constrained Decision Tree (SCDT), is proposed with which to construct a decision tree, paying close attention on how to efficiently use the limited number of leaf nodes. Experimental results show that the SCDT method can successfully generate a simpler decision tree and offers better accuracy.

论文关键词:Classification, Data mining, Decision tree, Constraint tree

论文评审过程:

论文官网地址:https://doi.org/10.1007/s10489-016-0785-z