Predicting missing values with biclustering: A coherence-based approach

作者:

Highlights:

摘要

In this work, a novel biclustering-based approach to data imputation is proposed. This approach is based on the Mean Squared Residue metric, used to evaluate the degree of coherence among objects of a dataset, and presents an algebraic development that allows the modeling of the predictor as a quadratic programming problem. The proposed methodology is positioned in the field of missing data, its theoretical aspects are discussed and artificial and real-case scenarios are simulated to evaluate the performance of the technique. Additionally, relevant properties introduced by the biclustering process are also explored in post-imputation analysis, to highlight other advantages of the proposed methodology, more specifically confidence estimation and interpretability of the imputation process.

论文关键词:Biclustering,Missing data imputation,Knowledge discovery,Quadratic programming

论文评审过程:Received 17 February 2012, Revised 3 August 2012, Accepted 31 October 2012, Available online 8 November 2012.

论文官网地址:https://doi.org/10.1016/j.patcog.2012.10.022