Comparative analysis of statistical pattern recognition methods in high dimensional settings

作者:

Highlights:

摘要

An extensive simulation study is reported comparing eight statistical classification methods, focusing on problems where the number of observations is less than the number of variables. Using a wide range of artificial and real data sets, two types of classifiers are contrasted; methods that classify using all variables, and methods that first reduce the number of dimensions to two or three. The simulations identified regularized discriminant analysis as the overall clearly most powerful classifier, and show that in most cases, a reduction of the dimensionality to two or three dimensions prior to classification increases the error in allocating test observations.

论文关键词:Discriminant analysis,High dimensionality,Classifier evaluation,Simulation,Dimensionality,Reduction

论文评审过程:Received 8 April 1993, Revised 7 December 1993, Accepted 20 December 1993, Available online 19 May 2003.

论文官网地址:https://doi.org/10.1016/0031-3203(94)90145-7