Performance comparison of feature selection and extraction methods with random instance selection

作者：

Highlights：

• We evaluate a framework for reducing run time of big data feature reduction methods.

• This framework selects a small random subset of instances prior to feature reduction.

• We present comprehensive computational experiments using large public datasets.

• Execution time can be reduced by a factor of 90 with minimal impact on performance.

• We provide recommendations on which feature reduction method to use with a classifier.

摘要

•We evaluate a framework for reducing run time of big data feature reduction methods.•This framework selects a small random subset of instances prior to feature reduction.•We present comprehensive computational experiments using large public datasets.•Execution time can be reduced by a factor of 90 with minimal impact on performance.•We provide recommendations on which feature reduction method to use with a classifier.

论文关键词：Explainable artificial intelligence,Dimension reduction,Feature selection,Feature extraction,Instance selection,Data preprocessing

论文评审过程：Received 10 February 2020, Revised 10 December 2020, Accepted 15 April 2021, Available online 23 April 2021, Version of Record 5 May 2021.

论文官网地址：https://doi.org/10.1016/j.eswa.2021.115072