Improved Deep Learning Algorithm

Kim, Byung Joo;

doi:10.14801/JAITC.2018.8.2.119

Journal of Advanced Information Technology and Convergence (한국정보기술학회 영문논문지)

Volume 8 Issue 2
/
Pages.119-127
/
2018
/
2234-1072(pISSN)
/
2234-0963(eISSN)

Korean Institute of Information Technology (한국정보기술학회)

DOI QR Code

Improved Deep Learning Algorithm

Kim, Byung Joo (School of Computer Engineering Youngsan University)

Received : 2018.11.15
Accepted : 2018.12.27
Published : 2018.12.31

https://doi.org/10.14801/JAITC.2018.8.2.119 Citation

⟨ Previous Next ⟩

Abstract

Training a very large deep neural network can be painfully slow and prone to overfitting. Many researches have done for overcoming the problem. In this paper, a combination of early stopping and ADAM based deep neural network was presented. This form of deep network is useful for handling the big data because it automatically stop the training before overfitting occurs. Also generalization ability is better than pure deep neural network model.

Keywords

Acknowledgement

Supported by : Youngsan University

References

S. Geman, E. Bienenstock, and R. Doursat. "Neural networks and the bias/variance dilemma", Neural Computation, Vol. 4, No. 1, pp. 1-58, Jan. 1992. https://doi.org/10.1162/neco.1992.4.1.1
S. E. Fahlman and C. Lebiere, "The Cascade-Correlation learning architecture", Advances in Neural Information Processing Systems 5, San Mateo, CA, Morgan Kaufman Publishers Inc, 1993. pp. 524-532.
Y. L. Cun, S. D. John, and S. A. Solla, "Second order derivatives for network pruning", Advances in Neural Information Processing Systems 5, San Mateo, CA,. Morgan Kaufman Publishers Inc, 1990, pp. 598-6055
Y. L. Cun, S. D. John, and S. A. Solla, "Optimal brain damage", Advances in Neural Information Processing Systems 5, San Mateo, CA, Morgan Kaufman Publishers Inc, 1993, pp. 164-171.
S. J. Nowlan and G. E. Hinton, "Simplifying neural networks by soft weight-sharing", Neural Computation, Vol. 4, No. 4, pp.473-493, July 1992. https://doi.org/10.1162/neco.1992.4.4.473
A. Krogh and J. A. Hertz, "A simple weight decay can improve generalization", Advances in Neural Information Processing Systems 5, San Mateo, CA, Morgan Kaufman Publishers Inc, 1993, pp. 950-957.
A.S. Weigend, D. E. Rumelhart, and B. A. Huberman, "Generalization by weight-elimination with application to forecasting", Advances in Neural Information Processing Systems 5, San Mateo, CA, Morgan Kaufman Publishers Inc, 1993, pp. 875-882.
N. Morgan and H. Bourlard, "Generalization and parameter estimation in feedforward nets: Some experiments", Advances in Neural Information Processing Systems 5, San Mateo, CA, Morgan Kaufman Publishers Inc, 1990, pp. 630-637.
R. Russel, "Pruning algorithms a survey", IEEE Transactions on Neural Networks, Vol. 4, No. 5, pp.740-746, Sep. 1993. https://doi.org/10.1109/72.248452
http://www.cis.pku.edu.cn/faculty/vision/zlin/1983 A Method of Solving a Convex Programming Problem with Convergence Rate O(k^(-2))_Nesterov.pdf
B. Polyak "Some methods of speeding up the convergence of iteration methods", USSR Computational Mathematics and Mathematical Physics, Vol. 4, Issue 5, pp. 1-17. 1964. https://doi.org/10.1016/0041-5553(64)90137-5
Y. Nesterov, "Gradient Methods for Minimizing Composite Objective Function", http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.332.931&rep=rep1&type=pdf.
G, Hinton, "Neural Networks for Machine Learning", http://goo.gl/RsQeis; video: https://goo.gl/XUbIyJ.
D. P. KIngma, "ADAM: A Method for Stochastic Optimization," https://arxiv.org/pdf/14126980.pdf.

Journal of Advanced Information Technology and Convergence (한국정보기술학회 영문논문지)

Improved Deep Learning Algorithm

Abstract

Keywords

Acknowledgement

References

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)