PSSP with dynamic weighted kernel fusion based on SVM-PHGS

作者:

Highlights:

摘要

Since 1960s, researchers have proposed several prediction methods, for protein secondary structure prediction (PSSP), whereas the accuracy of them is no more than 80%. In this case, there is an urgent need to introduce a high accuracy prediction method. One learning method called support vector machines (SVMs) has shown comparable or better results than neural networks on bioinformatics applications. This research proposes a method based on SVM which has been improved by a new parallel multi class (PMC) method, parallel hierarchical grid search (PHGS), cross validation (CV) technique and weighted kernel fusion (WKF) method. The presented PHGS has been applied to regularize parameters of SVM’s kernel function which have an important impact on the accuracy. Using a suitable input data and kernel function for a particular problem can improve the prediction results remarkably. Also to improve our method, Position Scoring Matrix (PSSM) profiles are used as the input information to it. The goals of this study are to calibrate kernel function parameters and fusion of different kernel functions’ result in order to determine protein secondary structure classes accurately. The right choice of a fusion method is an important issue in creating a supreme performance so we propose a dynamic weight allocation method based on a non-linear analysis system. The obtained classification accuracies of our method are 84.65% and 83.94% on RS126 and CB513 datasets respectively and they are very promising with regard to other classification methods in the literature for this problem. Also for evaluating our method behavior in comparison to other state of arts methods, an independent dataset is used. The results show that the comprehensibility of WKF based on SVM-PHGS is much better than other methods.

论文关键词:Protein secondary structure prediction,Machine learning approach,Support vector machines,Parallel hierarchical grid search,Weighted kernel fusion

论文评审过程:Received 9 May 2011, Revised 2 November 2011, Accepted 3 November 2011, Available online 13 November 2011.

论文官网地址:https://doi.org/10.1016/j.knosys.2011.11.002