Browse > Article
http://dx.doi.org/10.6109/jkiice.2018.22.4.728

Architectures of Convolutional Neural Networks for the Prediction of Protein Secondary Structures  

Chi, Sang-Mun (Department of Computer Science, Kyungsung University)
Abstract
Deep learning has been actively studied for predicting protein secondary structure based only on the sequence information of the amino acids constituting the protein. In this paper, we compared the performances of the convolutional neural networks of various structures to predict the protein secondary structure. To investigate the optimal depth of the layer of neural network for the prediction of protein secondary structure, the performance according to the number of layers was investigated. We also applied the structure of GoogLeNet and ResNet which constitute building blocks of many image classification methods. These methods extract various features from input data, and smooth the gradient transmission in the learning process even using the deep layer. These architectures of convolutional neural networks were modified to suit the characteristics of protein data to improve performance.
Keywords
Protein secondary structure; Deep learning; Convolutional neural networks; GoogLeNet; ResNet;
Citations & Related Records
Times Cited By KSCI : 1  (Citation Analysis)
연도 인용수 순위
1 Theano Development Team. "Theano: A Python framework for fast computation of mathematical expressions," [Online]. arXiv:1605.02688, May. 2016.
2 S.. Dieleman, et al., "Lasagne: First release," [Internet]. Available: http://dx.doi.org/10.5281/zenodo.27878.
3 J. Duchi, E. Hazan, and Y. Singer, "Adaptive subgradient methods for online learning and stochastic optimization," Journal of Machine Learning Research, vol. 12, pp. 2121-2159, Jul. 2011.
4 W. Li, et al., "Regularization of neural networks using dropconnect," Proceedings of the 30th International Conference on Machine Learning, Atlanta, USA, vol. 28, no. 3, pp. 1058-1066, Jun. 2013.
5 D. Ciresan, U. Meier, and J. Schmidhuber, "Multi-column deep neural networks for image classification," Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 3642-3649, Washington, DC, USA, Jun. 2012.
6 Shin-Hye, et al., "A Comparison of Predicting Movie Success between Artificial Neural Network and Decision Tree", Asia-pacific Journal of Multimedia, vol.7, no.4, pp. 593-602, 2017.
7 S. Chi, "A Performance Comparison of Protein Profiles for the Prediction of Protein Secondary Structures," Journal of the Korea Institute of Information and Communication Engineering, vol. 22, no. 1, pp. 26-32 Jan. 2018.   DOI
8 D. Baker and A. Sali., "Protein structure prediction and structural genomics," Science, vol. 294 no. 5, pp. 93-96, Oct. 2001.   DOI
9 H. Lodish, et al., Molecular Cell Biology, 6th ed. New York, NY: W.H. Freeman and Company, 2007
10 H. W. Buchan, et al., "Scalable web services for the PSIPRED protein analysis workbench," Nucleic Acids Research, vol. 41, W72-W76, Jul. 2013.   DOI
11 S. Wang, et al., "Protein secondary structure prediction using deep convolutional neural fields," Scientific Reports 6, Article number: 18962, Jan. 2016.
12 C. N. Magnan and P. Baldi, "SSpro/ACCpro 5: almost perfect prediction of protein secondary structure and relative solvent accessibility using profiles, machine learning and structural similarity," Bioinformatics, vol. 30, no. 18, pp, 2592-2597, Sep. 2014.   DOI
13 J. Zhou, and O. Troyanskaya, "Deep supervised convolutional generative stochastic network for protein secondary structure prediction," Proceedings of Machine Learning Research, vol. 32, no. 1, pp. 745-753, Jun. 2014.
14 M. Spencer, J. Eickholt, and J. Cheng, "A deep learning network approach to ab initio protein secondary structure prediction," IEEE/ACM Transactions on Computational Biology Bioinformatics, vol. 12, no. 1, pp. 103-112, Jan/Feb. 2015.   DOI
15 Olga Russakovsky, et al., "ImageNet Large Scale Visual Recognition Challenge," International Journal of Computer Vision, vol. 115, no. 3, pp. 211-252, 2015.   DOI
16 C. Szegedy, et al., "Going deeper with convolution," IEEE Conference on Computer Vision and Pattern Recognition, pp. 1-9, Jun. 2015.
17 K. He, et al., "Deep residual learning for image recognition," IEEE Conference on Computer Vision and Pattern Recognition, pp. 770-778, Jun. 2016.
18 K. He, et al., "Identity mapping in deep residual networks," European Conference on Computer Vision, pp. 630-645, Sep. 2016.
19 G. Wang and R.L. Dunbrack "PISCES: a protein sequence culling server," Bioinformatics, vol. 19, no. 12, pp. 1589-1591, 2003.   DOI
20 W. Kabsch and C. Sander, "Dictionary of protein secondary structure: pattern recognition of hydrohen-bonded and geometrical features," Biopolymers, vol. 22, no. 12, pp. 2577-2637, Dec. 1983.   DOI
21 G. E. Hinton, et al., "Improving neural networks by preventing co-adaptation of feature detectors," [Online]. arXiv:1207.0580, Jul. 2012.
22 S. F. Altschul, et al., "Gapped blast and PSI-BLAST: a new generation of protein database search programs," Nucleic Acids Research, vol. 25, no. 17, pp. 3389-3402, Sep. 1997.   DOI
23 B. E. Suzek, et al., "Uniref: comprehensive and non- reduncant uniprot reference clusters," Bioinformatics, vol. 23, no. 10, pp. 1282-1288, May. 2007.   DOI