A robust missing value imputation method for noisy data

作者:Bing Zhu, Changzheng He, Panos Liatsis

摘要

Missing data imputation is an important research topic in data mining. The impact of noise is seldom considered in previous works while real-world data often contain much noise. In this paper, we systematically investigate the impact of noise on imputation methods and propose a new imputation approach by introducing the mechanism of Group Method of Data Handling (GMDH) to deal with incomplete data with noise. The performance of four commonly used imputation methods is compared with ours, called RIBG (robust imputation based on GMDH), on nine benchmark datasets. The experimental result demonstrates that noise has a great impact on the effectiveness of imputation techniques and our method RIBG is more robust to noise than the other four imputation methods used as benchmark.

论文关键词:Missing data imputation, Noise, Group method of data handling (GMDH)

论文评审过程:

论文官网地址:https://doi.org/10.1007/s10489-010-0244-1