Histogram distance-based Bayesian Network structure learning: A supervised classification specific approach

作者:

Highlights:

摘要

In this work we introduce a methodology based on histogram distances for the automatic induction of Bayesian Networks (BN) from a file containing cases and variables related to a supervised classification problem. The main idea consists of learning the Bayesian Network structure for classification purposes taking into account the classification itself, by comparing the class distribution histogram distances obtained by the Bayesian Network after classifying each case. The structure is learned by applying eight different measures or metrics: the Cooper and Herskovits metric for a general Bayesian Network and seven different statistical distances between pairs of histograms.The results obtained confirm the hypothesis of the authors about the convenience of having a BN structure learning method which takes into account the existence of the special variable (the one corresponding to the class) in supervised classification problems.

论文关键词:Bayesian Network,Histogram distance,Supervised classification,Machine learning,Structure learning

论文评审过程:Received 16 May 2008, Revised 12 June 2009, Accepted 22 July 2009, Available online 6 August 2009.

论文官网地址:https://doi.org/10.1016/j.dss.2009.07.010