[KSCI] Korea Science Citation Index Service

http://dx.doi.org/10.14400/JDC.2021.19.10.253

An efficient machine learning for digital data using a cost function and parameters

Ji, Sangmin (Department of Mathematics, Chungnam National University)
Park, Jieun (Seongsan Liberal Arts College, Daegu University)

Publication Information

Journal of Digital Convergence / v.19, no.10, 2021 , pp. 253-263 More about this Journal

Abstract

Machine learning is the process of constructing a cost function using learning data used for learning and an artificial neural network to predict the data, and finding parameters that minimize the cost function. Parameters are changed by using the gradient-based method of the cost function. The more complex the digital signal and the more complex the problem to be learned, the more complex and deeper the structure of the artificial neural network. Such a complex and deep neural network structure can cause over-fitting problems. In order to avoid over-fitting, a weight decay regularization method of parameters is used. We additionally use the value of the cost function in this method. In this way, the accuracy of machine learning is improved, and the superiority is confirmed through numerical experiments. These results derive accurate values for a wide range of artificial intelligence data through machine learning.

Keywords

Optimization; Digital signal; Machine learning; Classification; Regularization;

Citations & Related Records

Reference

1	I. Goodfellow, Y. Bengio, & A. Courville. (2016). Regularization for deep learning. In Deep Learning. Cambridge : MIT Press.
2	Y. Wang, Z. P. Bian, J. Hou, & L. P. Chau. (2021). Convolutional Neural Networks With Dynamic Regularization. IEEE Transactions on Neural Networks and Learning Systems, 32(5), 2299-2304. DOI
3	M. D. Zeiler. (2012). Adadelta: An adaptive learning rate method. arXiv:1212.5701.
4	H. Zhao, Y. H. Tsai, R. Salakhutdinov, & G. J. Gordon. (2019). Learning Neural Networks with Adaptive Regularization. arXiv:1907.06288v2.
5	R. Pascanu, & Y. Bengio. (2013). Revisiting natural gradient for deep networks. arXiv:1301.3584.
6	R. He, L. Liu, H. Ye, Q. Tan, B. Ding, L. Cheng, J. W. Low, L. Bing, & L. Si. (2021). On the Effectiveness of Adapter-based Tuning for Pretrained Language Model Adaptation. arXiv:2106.03164v1.
7	K. He, X. Zhang, S. Ren, & J. Sun. (2016). Deep Residual Learning for Image Recognition. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Las Vegas, NV, USA, 27-30 June, 770-778.
8	S. Lau. (2017). Learning Rate Schedules and Adaptive Learning Rate Methods for Deep Learning. Towards Data Science. https://towardsdatascience.com/learning-rate-schedules-and-adaptive-learning-rate-methods-for-deep-learning-2c8f433990d1.
9	T. Tieleman, & G.E. Hinton. (2012). Lecture 6.5-RMSProp, COURSERA: Neural Networks for Machine Learning. Technical Report, University of Toronto, Toronto, ON, Canada.
10	M. J. Kochenderfer, & T. A. Wheeler. (2019). Algorithms for Optimization. Cambridge: The MIT Press.
11	C. T. Kelley. (1995). Iterative methods for linear and nonlinear equations. In Frontiers in Applied Mathematics; SIAM: Philadelphia, PA, USA. Volume 16.
12	Y. Zheng, R. Zhang, & Y. Mao. (2021). Regularizing Neural Networks via Adversarial Model Perturbation. arXiv:2010.04925v4.
13	J. Duchi, E. Hazan, & Y. Singer. (2011). Adaptive subgradient methods for online learning and stochastic optimization. J. Mach. Learn. Res. 12, 2121-2159.
14	Y. LeCun, L. Bottou, Y. Bengio, & P. Haffner. (1998). Gradient-based learning applied to document recognition. Proc. IEEE, 86, 2278-2324. DOI
15	J. Sohl-Dickstein, B. Poole, & S. Ganguli. (2014). Fast large-scale optimization by unifying stochastic gradient and quasi-newton methods. In Proceedings of the 31st International Conference on Machine Learning (ICML-14), Beijing, China, 21-26 June, 604-612.
16	S. Chaudhury, & T. Yamasaki. (2021). Robustness of Adaptive Neural Network Optimization Under Training Noise. IEEE Access, 9, 37039-37053. DOI
17	D. P. Kingma, & J. L. Ba. (2015). Adam: A Method for Stochastic Optimization. In Proceedings of the 3rd International Conference for Learning Representations, ICLR 2015, San Diego, CA, USA, 7-9 May
18	H. Zulkifli. (2018). Understanding Learning Rates and How It Improves Performance in Deep Learning. https://towardsdatascience.com/understanding-learning-rates-and-how-it-improves-performance-in-deep-learning-d0d4059c1c10
19	G. Aurelien. (2017). Gradient Descent. Hands-On Machine Learning with Scikit-Learn and TensorFlow. O'Reilly. pp. 113-124. ISBN 978-1-4919-6229-9.
20	I. Sutskever, J. Martens, G. Dahl, & G.E. Hinton. (2013). On the importance of initialization and momentum in deep learning. In Proceedings of the 30th International Conference on Machine Learning (ICML-13), Atlanta, GA, USA, 16-21 June, 1139-1147.

KSCI

An efficient machine learning for digital data using a cost function and parameters 비용함수와 파라미터를 이용한 효과적인 디지털 데이터 기계학습 방법론

An efficient machine learning for digital data using a cost function and parameters