On the complexity of the spaced seeds

作者:

Highlights:

摘要

Optimal spaced seeds were introduced by the theoretical computer science community to bioinformatics to effectively increase homology search sensitivity. These seeds are serving many homology queries daily. However the computational complexity of finding the optimal spaced seeds remains to be open. In this paper, we prove that computing hit probability of a spaced seed in a uniform homology region is NP-hard, but it admits a probabilistic PTAS. We also show that the asymptotic hit probability is computable in exponential time in seed length, independent of the homologous region length.

论文关键词:Optimal spaced seeds,Homology search

论文评审过程:Received 17 June 2006, Revised 6 February 2007, Available online 14 March 2007.

论文官网地址:https://doi.org/10.1016/j.jcss.2007.03.008