Knowledge-assisted recognition of cluster boundaries in gene expression data

作者:

Highlights:

摘要

Background and motivation:DNA microarray technology has made it possible to determine the expression levels of thousands of genes in parallel under multiple experimental conditions. Genome-wide analyses using DNA microarrays make a great contribution to the exploration of the dynamic state of genetic networks, and further lead to the development of new disease diagnosis technologies. An important step in the analysis of gene expression data is to classify genes with similar expression patterns into the same groups. To this end, hierarchical clustering algorithms have been widely used. Major advantages of hierarchical clustering algorithms are that investigators do not need to specify the number of clusters in advance and results are presented visually in the form of a dendrogram. However, since traditional hierarchical clustering methods simply provide results on the statistical characteristics of expression data, biological interpretations of the resulting clusters are not easy, and it requires laborious tasks to unveil hidden biological processes regulated by members in the clusters. Therefore, it has been a very difficult routine for experts.

论文关键词:Gene expression data,Hierarchical clustering,Cluster boundaries,Gene annotation,Genome database

论文评审过程:Received 30 October 2004, Revised 18 January 2005, Accepted 22 February 2005, Available online 27 July 2005.

论文官网地址:https://doi.org/10.1016/j.artmed.2005.02.007