Highly accurate and consistent method for prediction of helix and strand content from primary protein sequences

作者:

Highlights:

摘要

Objective:One of interesting computational topics in bioinformatics is prediction of secondary structure of proteins. Over 30 years of research has been devoted to the topic but we are still far away from having reliable prediction methods. A critical piece of information for accurate prediction of secondary structure is the helix and strand content of a given protein sequence. Ability to accurately predict content of those two secondary structures has a good potential to improve accuracy of prediction of the secondary structure. Most of the existing methods use composition vector to predict the content. Their underlying assumption is that the vector can be used to provide functional mapping between primary sequence and helix/strand content. While this is true for small sets of proteins we show that for larger protein sets such mapping are inconsistent, i.e. the same composition vectors correspond to different contents. To this end, we propose a method for prediction of helix/strand content from primary protein sequences that is fundamentally different from currently available methods.

论文关键词:Protein content prediction,Composition vector,Composition moment vector,Primary protein sequence,Secondary protein structure,Proteomics,Bioinformatics

论文评审过程:Received 14 November 2004, Revised 22 January 2005, Accepted 22 February 2005, Available online 2 August 2005.

论文官网地址:https://doi.org/10.1016/j.artmed.2005.02.006