Various dimension reduction techniques for high dimensional data analysis: a review

作者:Papia Ray, S. Surender Reddy, Tuhina Banerjee

摘要

In the era of healthcare, and its related research fields, the dimensionality problem of high dimensional data is a massive challenge as it contains a huge number of variables forming complex data matrices. The demand for dimension reduction of complex data is growing immensely to improvise data prediction, analysis and visualization. In general, dimension reduction techniques are defined as a compression of dataset from higher dimensional matrix to lower dimensional matrix. Several computational techniques have been implemented for data dimension reduction, which is further segregated into two categories such as feature extraction and feature selection. In this review, a detailed investigation of various feature extraction and feature selection methods has been carried out with a systematic comparison of several dimension reduction techniques for the analysis of high dimensional data and to overcome the problem of data loss. Then, some case studies are also cited to verify the better approach for data dimension reduction by considering few advances described in the technical literature. This review paper may guide researchers to choose the most effective method for satisfactory analysis of high dimensional data.

论文关键词:Canonical correlation analysis, Feature extraction, Feature selection, Local Fisher’s discriminate analysis, Locally linear embedding, Principle component analysis

论文评审过程:

论文官网地址:https://doi.org/10.1007/s10462-020-09928-0