Fast feature selection aimed at high-dimensional data via hybrid-sequential-ranked searches

作者:

Highlights:

摘要

We address the feature subset selection problem for classification tasks. We examine the performance of two hybrid strategies that directly search on a ranked list of features and compare them with two widely used algorithms, the fast correlation based filter (FCBF) and sequential forward selection (SFS). The proposed hybrid approaches provide the possibility of efficiently applying any subset evaluator, with a wrapper model included, to large and high-dimensional domains. The experiments performed show that our two strategies are competitive and can select a small subset of features without degrading the classification error or the advantages of the strategies under study.

论文关键词:Feature selection,Feature ranking,Classification,Data mining

论文评审过程:Available online 1 April 2012.

论文官网地址:https://doi.org/10.1016/j.eswa.2012.03.061