An efficient greedy K-means algorithm for global gene trajectory clustering

作者:

Highlights:

摘要

Optimal clustering of co-regulated genes is critical for reliable inference of the underlying biological processes in gene expression analysis, for which the K-means algorithm have been widely employed for its efficiency. However, given that the solution space is large and multimodal, which is typical of gene expression data, K-means is prone to produce inconsistent and sub-optimal cluster solutions that may be unreliable and misleading for biological interpretation.This paper applies a novel global clustering method called the greedy elimination method (GEM) to alleviate these problems. GEM is simple to implement, yet very effective in improving the global optimality of the solutions. Experiments over two sets of gene expression data show that the GEM scores significantly lower clustering errors than the standard K-means and the greedy incremental method.

论文关键词:K-means clustering,Gene expression data,Greedy elimination method

论文评审过程:Available online 10 October 2005.

论文官网地址:https://doi.org/10.1016/j.eswa.2005.09.049