Genetic algorithm-based feature selection in high-resolution NMR spectra

作者:

Highlights:

摘要

High-resolution nuclear magnetic resonance (NMR) spectroscopy has provided a new means for detection and recognition of metabolic changes in biological systems in response to pathophysiological stimuli and to the intake of toxins or nutrition. To identify meaningful patterns from NMR spectra, various statistical pattern recognition methods have been applied to reduce their complexity and uncover implicit metabolic patterns. In this paper, we present a genetic algorithm (GA)-based feature selection method to determine major metabolite features to play a significant role in discrimination of samples among different conditions in high-resolution NMR spectra. In addition, an orthogonal signal filter was employed as a preprocessor of NMR spectra in order to remove any unwanted variation of the data that is unrelated to the discrimination of different conditions. The results of k-nearest neighbors and the partial least squares discriminant analysis of the experimental NMR spectra from human plasma showed the potential advantage of the features obtained from GA-based feature selection combined with an orthogonal signal filter.

论文关键词:Metabolomics,Nuclear magnetic resonance (NMR),Feature selection,Discrimination,Genetic algorithm (GA),Orthogonal signal correction filter

论文评审过程:Available online 15 August 2007.

论文官网地址:https://doi.org/10.1016/j.eswa.2007.08.050