An efficient strategy for generating all descendant subtree patterns from phylogenetic trees with its implementation

作者:

Highlights:

摘要

Phylogenetic trees are an important tool to help in the understanding of relationships between objects that evolve through time, in particular molecular sequences. In this paper, we efficiently solve two subtree-comparison problems on a set of phylogenetic trees which have practical applications to analyze the evolution and co-evolution genes clustering of genomic sequences. Let T1,T2,…,Tk be a set of k phylogenetic trees such that the leaves of each tree are drawn from {1,2,…,n} and the leaves for two arbitrary trees are not necessary the same, where n is the maximum number of the leaves among the k trees. We present a linear-time algorithm to find all the leaf-agreement descendant subtrees. By further extending this result, we present a linear-time algorithm to find all the leaf-agreement isomorphic descendant subtrees. Based on our algorithms, a web-based system using input tree files from TreeBASE is also implemented.

论文关键词:Knowledge discovery,Tree pattern retrieval,Phylogenetic trees,Descendant subtrees,Web-based system,TreeBASE,Design and analysis of algorithms

论文评审过程:Available online 2 April 2007.

论文官网地址:https://doi.org/10.1016/j.amc.2007.03.077