Segmentation of DNA using simple recurrent neural network

作者:

Highlights:

摘要

We report the discovery of strong correlations between protein coding regions and the prediction errors when using the simple recurrent network to segment genome sequences. We are going to use SARS genome to demonstrate how we conduct training and derive corresponding results. The distribution of prediction error indicates how the underlying hidden regularity of the genome sequences and the results are consistent with the finding of biologists: predicated protein coding features of SARS genome. This implies that the simple recurrent network is capable of providing new features for further biological studies when applied on genome studies. The HA gene of influenza A subtype H1N1 is also analyzed in a similar way.

论文关键词:Quasi-regular structure,Elman network,Segmentation of DNA,SARS,H1N1

论文评审过程:Received 17 June 2011, Revised 1 September 2011, Accepted 2 September 2011, Available online 17 September 2011.

论文官网地址:https://doi.org/10.1016/j.knosys.2011.09.001