Feature assessment and ranking for classification with nonlinear sparse representation and approximate dependence analysis

作者：

Highlights：

• A nonlinear sparse representation method is applied to find salient feature clusters.

• An approximate feature dependence analysis strategy is proposed.

• Salient and interpretable features can be obtained by the proposed method.

摘要

Feature selection has received significant attention in knowledge management and decision support systems in the past decades. In this study, kernel-based sparse representation and feature dependence analysis are integrated into a feature assessment and ranking framework. The proposed method utilizes the advantages of the kernel-based sparse representation technique and of the information theoretic metric to iteratively obtain the salient feature cluster. Then, a novel approximate dependence analysis is applied to further maintain complementarity while eliminating redundancy among the features selected by nonlinear orthogonal matching pursuit (NOMP). This can effectively prevent the significant bias caused by the pairwise correlation analysis for a large-scale feature set. To illustrate the effectiveness of the proposed method, classification experiments are conducted with three representative classifiers, on nine well-known datasets. The experimental results show the superiority of the proposed method compared with the representative information theoretic and model-based methods in classification for data-driven decision support systems.

论文关键词：Feature selection,Dimensionality reduction,Classification,Sparse representation,Dependence analysis

论文评审过程：Received 10 December 2018, Revised 27 April 2019, Accepted 17 May 2019, Available online 31 May 2019, Version of Record 4 July 2019.

论文官网地址：https://doi.org/10.1016/j.dss.2019.05.004