Robust Neighborhood Covering Reduction with Determinantal Point Process sampling

作者:

Highlights:

摘要

Neighborhood Covering Reduction (NCR) methods are too rigid to filter out redundant neighborhoods and sensitive to noise. To tackle this problem, we propose a flexible neighborhood reduction method based on Determinantal Point Process (DPP) sampling in this paper. Sampling from a DPP, the probabilities of subset selection are computed from the correlation matrix of items and the subsets of diverse items will be assigned high probabilities. We model the process of neighborhood selection as a DPP and thereby implement a NCR algorithm with DPP neighborhood sampling (NCRDPP). NCRDPP selects significant neighborhoods of both high quality and diversity to form concise coverings as the approximations of data distributions, which facilitate the model generalization and robustness of neighborhood-based learning. Experimental results verify the superiority of DPP sampling for neighborhood selection and the robustness of NCRDPP method for noisy data classification.

论文关键词:Neighborhood covering reduction,Determinantal point process

论文评审过程:Received 27 August 2018, Revised 20 September 2019, Accepted 22 September 2019, Available online 24 September 2019, Version of Record 20 January 2020.

论文官网地址:https://doi.org/10.1016/j.knosys.2019.105063