A scalable decision-tree-based method to explain interactions in dyadic data

作者:

Highlights:

• Scalable method that obtains an easily interpretable high-level summary of the relationship between entities on dyadic data.

• Approach based on the entropy of value of the learnt utility function.

• Increased accuracy and model interpretability with respect to alternatives.

• Meaningful and actionable insights retrieved from dyadic data.

摘要

Gaining relevant insight from a dyadic dataset, which describes interactions between two entities, is an open problem that has sparked the interest of researchers and industry data scientists alike. However, the existing methods have poor explainability, a quality that is becoming essential in certain applications. We describe an explainable and scalable method that, operating on dyadic datasets, obtains an easily interpretable high-level summary of the relationship between entities. To do this, we propose a quality measure, which can be configured to a level that suits the user, that factors in the explainability of the model. We report experiments that confirm better results for the proposed method over alternatives, in terms of both explainability and accuracy. We also analyse the method's capacity to extract relevant actionable information and to handle large datasets.

论文关键词:Dyadic data,Machine learning,Interpretable machine learning,Explainable artificial intelligence,Scalable machine learning

论文评审过程:Received 23 April 2019, Revised 14 August 2019, Accepted 19 August 2019, Available online 27 August 2019, Version of Record 15 November 2019.

论文官网地址:https://doi.org/10.1016/j.dss.2019.113141