Protein secondary structure prediction by using deep learning method

作者:

Highlights:

摘要

The prediction of protein structures directly from amino acid sequences is one of the biggest challenges in computational biology. It can be divided into several independent sub-problems in which protein secondary structure (SS) prediction is fundamental. Many computational methods have been proposed for SS prediction problem. Few of them can model well both the sequence-structure mapping relationship between input protein features and SS, and the interaction relationship among residues which are both important for SS prediction. In this paper, we proposed a deep recurrent encoder–decoder networks called Secondary Structure Recurrent Encoder–Decoder Networks (SSREDNs) to solve this SS prediction problem. Deep architecture and recurrent structures are employed in the SSREDNs to model both the complex nonlinear mapping relationship between input protein features and SS, and the mutual interaction among continuous residues of the protein chain. A series of techniques are also used in this paper to refine the model’s performance. The proposed model is applied to the open dataset CullPDB and CB513. Experimental results demonstrate that our method can improve both Q3 and Q8 accuracy compared with some public available methods. For Q8 prediction problem, it achieves 68.20% and 73.1% accuracy on CB513 and CullPDB dataset in fewer epochs better than the previous state-of-art method.

论文关键词:Deep learning,Secondary structure prediction,Encoder–decoder networks,Recurrent neural networks

论文评审过程:Received 9 March 2016, Revised 16 November 2016, Accepted 16 November 2016, Available online 17 November 2016, Version of Record 12 January 2017.

论文官网地址:https://doi.org/10.1016/j.knosys.2016.11.015