Cluster analysis of genome-wide expression data for feature extraction

作者:

Highlights:

摘要

Bio-chip data that consists of high-dimensional attributes have more attributes than specimens. Thus, it is difficult to obtain covariance matrix from tens thousands of genes within a number of samples. Feature selection and extraction is critical to remove noisy features and reduce the dimensionality in microarray analysis. This study aims to fill the gap by developing a data mining framework with a proposed algorithm for cluster analysis of gene expression data, in which coefficient correlation is employed to arrange genes. Indeed, cluster analysis of microarray data can find coherent patterns of gene expression. The output is displayed as table list for convenient survey. We adopt the breast cancer microarray dataset to demonstrate practical viability of this approach.

论文关键词:Bio-chip,Microarray,Gene expression,Cluster analysis,Data mining,Feature extraction

论文评审过程:Available online 15 February 2008.

论文官网地址:https://doi.org/10.1016/j.eswa.2008.01.068