Extracting qualitative relations from categorical data

作者:

摘要

Qualitative modeling is traditionally concerned with the abstraction of numerical data. In numerical domains, partial derivatives describe the relation between the independent and dependent variable; qualitatively, they tell us the trend of the dependent variable. In this paper, we address the problem of extracting qualitative relations in categorical domains. We generalize the notion of partial derivative by defining the probabilistic discrete qualitative partial derivative (PDQ PD). PDQ PD is a qualitative relation between the target class c and the discrete attribute; the derivative corresponds to ordering the attribute's values, ai, by P(c|ai) in a local neighborhood of the reference point, respecting the ceteris paribus principle. We present an algorithm for computation of PDQ PD from labeled attribute-based training data. Machine learning algorithms can then be used to induce models that explain the influence of the attribute's values on the target class in different subspaces of the attribute space.

论文关键词:Qualitative modeling,Machine learning,Ceteris paribus

论文评审过程:Received 4 September 2014, Revised 23 June 2016, Accepted 28 June 2016, Available online 5 July 2016, Version of Record 16 August 2016.

论文官网地址:https://doi.org/10.1016/j.artint.2016.06.007