Application of global optimization methods to model and feature selection

作者:

Highlights:

摘要

Many data mining applications involve the task of building a model for predictive classification. The goal of this model is to classify data instances into classes or categories of the same type. The use of variables not related to the classes can reduce the accuracy and reliability of classification or prediction model. Superfluous variables can also increase the costs of building a model particularly on large datasets. The feature selection and hyper-parameters optimization problem can be solved by either an exhaustive search over all parameter values or an optimization procedure that explores only a finite subset of the possible values. The objective of this research is to simultaneously optimize the hyper-parameters and feature subset without degrading the generalization performances of the induction algorithm. We present a global optimization approach based on the use of Cross-Entropy Method to solve this kind of problem.

论文关键词:Cross-Entropy Method,Feature selection,Particle swarm optimization,Hyper-parameters optimization,Support vector machines

论文评审过程:Received 2 November 2011, Revised 30 March 2012, Accepted 13 April 2012, Available online 23 April 2012.

论文官网地址:https://doi.org/10.1016/j.patcog.2012.04.015