On pseudoknot-bordered words and their properties

作者:

Highlights:

摘要

We study a generalization of the classical notions of bordered and unbordered words, motivated by biomolecular computing. DNA strands can be viewed as finite strings over the alphabet {A, G, C, T}, and are used in biomolecular computing to encode information. Due to the fact that A is Watson–Crick complementary to T and G to C, DNA single strands that are Watson–Crick complementary can bind to each other or to themselves forming so-called secondary structures. Most of these secondary structures are undesirable for biomolecular computational purposes since the strands they involve cannot further interact with other strands. This paper studies pseudoknot-bordered words, a mathematical formalization of pseudoknot-like inter- and intra-molecular structures. In this context, pseudoknot-unbordered words model DNA or RNA strands that will be free of such secondary structures. We obtain several properties of pseudoknot-bordered and -unbordered words. We also address following problem: Given a pseudoknot-unbordered word u, does {u}+ consist of pseudoknot-unbordered words only? We show that this is not generally true. We find that a sufficient condition for {u}+ to consist of pseudoknot-unbordered words only is that u be not primitive. All of our results hold for arbitrary antimorphic involutions, of which the DNA Watson–Crick complementarity function is a particular case.

论文关键词:DNA computing,Bordered words,Unbordered words,RNA pseudoknots,Pseudoknot-bordered words

论文评审过程:Received 21 September 2007, Revised 28 July 2008, Available online 14 August 2008.

论文官网地址:https://doi.org/10.1016/j.jcss.2008.08.002