Browse > Article
http://dx.doi.org/10.6109/jkiice.2016.20.9.1816

Prediction of Protein Secondary Structure Using the Weighted Combination of Homology Information of Protein Sequences  

Chi, Sang-mun (Department of Computer Science and Engineering, Kyungsung University)
Abstract
Protein secondary structure is important for the study of protein evolution, structure and function of proteins which play crucial roles in most of biological processes. This paper try to effectively extract protein secondary structure information from the large protein structure database in order to predict the protein secondary structure of a query protein sequence. To find more remote homologous sequences of a query sequence in the protein database, we used PSI-BLAST which can perform gapped iterative searches and use profiles consisting of homologous protein sequences of a query protein. The secondary structures of the homologous sequences are weighed combined to the secondary structure prediction according to their relative degree of similarity to the query sequence. When homologous sequences with a neural network predictor were used, the accuracies were higher than those of current state-of-art techniques, achieving a Q3 accuracy of 92.28% and a Q8 accuracy of 88.79%.
Keywords
Protein secondary structure; Homologous protein search; PSI-BLAST; Sequence similarity;
Citations & Related Records
연도 인용수 순위
  • Reference
1 C. N. Magnan and P. Baldi, "SSpro/ACCpro 5: almost perfect prediction of protein secondary structure and relative solvent accessibility using profiles, machine learning and structural similarity," Bioinformatics, vol. 30, no. 18, pp, 2592-2597, Sep. 2014.   DOI
2 C. Mirabello and G. Pollastri, "Porter, PaleAle 4.0: high-accuracy prediction of protein secondary structure and relative solvent accessibility," Bioinformatics, vol. 29, no. 16, pp. 2056-2058, Aug. 2013.   DOI
3 R. Yan, et al, "A comparative assessment and analysis of 20 representative sequence alignment methods for protein structure prediction," Scientific Reports 3, Article number: 2619, Sep. 2013.
4 J. Zhou and O. Troyanskaya, "Deep supervised convolutional generative stochastic network for protein secondary structure prediction," in JMLR Proceedings, 32, pp. 745-753, Beijing, China, 2014.
5 M. Spencer, J. Eickholt and J. Cheng, "A deep learning network approach to ab initio protein secondary structure prediction," IEEE/ACM Trans. Comput. Biol. Bioinform., vol. 12, no. 1, pp. 103-112, Jan.-Feb. 2015.   DOI
6 R. Heffernan, et al, "Improving prediction of secondary structure, local backbone angles, and solvent accessible surface area of proteins by iterative deep learning," Scientific Reports 5, Article number: 11476, June 2015.
7 S. Wang, J. Peng, J. Ma, and J. Xu, "Protein secondary structure prediction using deep convolutional neural fields," Scientific Reports 6, Article number: 18962, Jan. 2016.
8 S. F. Altschul, et al, "Gapped blast and PSI-BLAST: a new generation of protein database search programs," Nucleic Acids Res., vol. 25, no. 17, pp. 3389-3402, Sep. 1997.   DOI
9 H. M. Berman, et al, "The protein data bank," Nucleic Acids Res. vol. 28, no. 1, pp. 235-242, Jan. 2000.   DOI
10 B. E. Suzek, et al, "Uniref: comprehensive and nonreduncant uniprot reference clusters," Bioinformatics, vol. 23, no. 10, pp. 1282-1288, May 2007.   DOI
11 H. W. Buchan, et al., "Scalable web services for the PSIPRED protein analysis workbench," Nucleic Acids Res., vol. 41, W72-W76, Jul. 2013.   DOI
12 W. Kabsch and C. Sander, "Dictionary of protein secondary structure: pattern recognition of hydrohen-bonded and geometrical features," Biopolymers, vol. 22, no. 12, pp. 2577-2637, Dec. 1983.   DOI
13 H. Lodish, A. Berk, C.A. Kaiser, et al., Molecular Cell Biology, 6th Ed. New York, NY: W. H. Freeman and Company, 2007.