Coding Decision Trees

作者:C.S. Wallace, J.D. Patrick

摘要

Quinlan and Rivest have suggested a decision-tree inference method using the Minimum Description Length idea. We show that there is an error in their derivation of message lengths, which fortunately has no effect on the final inference. We further suggest two improvements to their coding techniques, one removing an inefficiency in the description of non-binary trees, and one improving the coding of leaves. We argue that these improvements are superior to similarly motivated proposals in the original paper.

论文关键词:decision trees, supervised learning, minimum message length, minimum description length, information theory

论文评审过程:

论文官网地址:https://doi.org/10.1023/A:1022646101185