Measuring semantic similarity between Gene Ontology terms

作者:

Highlights:

摘要

Many bioinformatics applications would benefit from comparing proteins based on their biological role rather than their sequence. This paper adds two new contributions. First, a study of the correlation between Gene Ontology (GO) terms and family similarity demonstrates that protein families constitute an appropriate baseline for validating GO similarity. Secondly, we introduce GraSM, a novel method that uses all the information in the graph structure of the Gene Ontology, instead of considering it as a hierarchical tree. GraSM gives a consistently higher family similarity correlation on all aspects of GO than the original semantic similarity measures.

论文关键词:Knowledge manipulation technique,Semantic similarity,Gene Ontology,Bioinformatics

论文评审过程:Received 17 December 2005, Revised 14 April 2006, Accepted 16 May 2006, Available online 16 June 2006.

论文官网地址:https://doi.org/10.1016/j.datak.2006.05.003