Linear-time algorithms for computing maximum-density sequence segments with bioinformatics applications

作者:

Highlights:

摘要

We study an abstract optimization problem arising from biomolecular sequence analysis. For a sequence A of pairs (ai,wi) for i=1,…,n and wi>0, a segment A(i,j) is a consecutive subsequence of A starting with index i and ending with index j. The width of A(i,j) is w(i,j)=∑i⩽k⩽jwk, and the density is (∑i⩽k⩽jak)/w(i,j). The maximum-density segment problem takes A and two values L and U as input and asks for a segment of A with the largest possible density among those of width at least L and at most U. When U is unbounded, we provide a relatively simple, O(n)-time algorithm, improving upon the O(nlogL)-time algorithm by Lin, Jiang and Chao. We then extend this result, providing an O(n)-time algorithm for the case when both L and U are specified.

论文关键词:Bioinformatics,Sequences,Density

论文评审过程:Received 3 October 2002, Revised 3 August 2004, Available online 18 November 2004.

论文官网地址:https://doi.org/10.1016/j.jcss.2004.08.001