Browse > Article
http://dx.doi.org/10.5370/KIEE.2017.66.12.1772

A Study on Polynomial Neural Networks for Stabilized Deep Networks Structure  

Jeon, Pil-Han (School of Electrical and Electronic Engineering, the University of Suwon)
Kim, Eun-Hu (School of Electrical and Electronic Engineering, the University of Suwon)
Oh, Sung-Kwun (School of Electrical and Electronic Engineering, the University of Suwon)
Publication Information
The Transactions of The Korean Institute of Electrical Engineers / v.66, no.12, 2017 , pp. 1772-1781 More about this Journal
Abstract
In this study, the design methodology for alleviating the overfitting problem of Polynomial Neural Networks(PNN) is realized with the aid of two kinds techniques such as L2 regularization and Sum of Squared Coefficients (SSC). The PNN is widely used as a kind of mathematical modeling methods such as the identification of linear system by input/output data and the regression analysis modeling method for prediction problem. PNN is an algorithm that obtains preferred network structure by generating consecutive layers as well as nodes by using a multivariate polynomial subexpression. It has much fewer nodes and more flexible adaptability than existing neural network algorithms. However, such algorithms lead to overfitting problems due to noise sensitivity as well as excessive trainning while generation of successive network layers. To alleviate such overfitting problem and also effectively design its ensuing deep network structure, two techniques are introduced. That is we use the two techniques of both SSC(Sum of Squared Coefficients) and $L_2$ regularization for consecutive generation of each layer's nodes as well as each layer in order to construct the deep PNN structure. The technique of $L_2$ regularization is used for the minimum coefficient estimation by adding penalty term to cost function. $L_2$ regularization is a kind of representative methods of reducing the influence of noise by flattening the solution space and also lessening coefficient size. The technique for the SSC is implemented for the minimization of Sum of Squared Coefficients of polynomial instead of using the square of errors. In the sequel, the overfitting problem of the deep PNN structure is stabilized by the proposed method. This study leads to the possibility of deep network structure design as well as big data processing and also the superiority of the network performance through experiments is shown.
Keywords
Polynomial neural networks; L2 regularization; Least squard estimation; Sum of Squared Coefficients(SSC); Deep network structure;
Citations & Related Records
Times Cited By KSCI : 4  (Citation Analysis)
연도 인용수 순위
1 V. Rouss, W. Charon, and G. Cirrincione, "Neural model of the dynamic behaviour of a non-linear mechanical system," Mechanical Systems and Signal Processing, Vol 23, pp. 1145-1159, 2009.   DOI
2 A. G. Ivahnenko, "Polynomial theory of complex systems," IEEE Trans. on Systems, Man and Cybernetics, Vol. SMC-1, pp. 364-378, 1971.   DOI
3 S. K. Oh and W. Pedrycz, "The Design of Self-Organizing Polynomial Neural Networks,"Information Sciences, Vol. 141, pp. 237-258, 2002.   DOI
4 S. K. Oh, T. C. Ahn, and W. Pedrycz, "Fuzzy Polynomial Neural Networks-Based Structure and Its Application to Nonlinear Process Systems," 7th IFSA World Conference, Vol. 2, pp. 495-499, 1997.
5 Ho-Sung Park, Ki-Sang Kim, Sung-Kwun Oh "Design of Particle Swarm Optimization-based Polynomial Neural Networks"THE TRANSACTION OF THE KOREAN INSTITUTE OF ELECTRICAL ENGINEERS 60(2), 2011.2, 398-406   DOI
6 AE Hoerl, RW Kennard. "Ridge regression: Biased estimation for nonorthogonal problems", Technometrics, Vol. 12, No. 1, p. 55-67, 1970   DOI
7 A. N. Tikhonov, and V. Y. "Arsenin, solution of ill-posed problems.", Washington: Winston & Sons, 1977
8 Q. Fan, J,. M. Zurada, W. Wu, "Convergence of online gradient method for feedforward neural networks with smoothing L1/L2 regylarization penalty", Neurocomputing, Vol 131, pp. 208-216, 2014   DOI
9 Y. H. Pyo, K. H. Lee, K. H. You, "Earthquake magnitude estimation using Recursive Least Squares", IEEE International Conference on Cloud Computing and Big Data, pp. 394-397, 2016
10 H. S. Park, Y, H. Jin, S. K Oh, "Evolutionary Design of Radial Basis Function-based Polynomial Neural Network with the aid of Information Granulation", The Tr.nsactions of Korean Institute of Electrical Engineers, Vol. 60, No. 4, pp. 862-870, 2011.   DOI
11 S. K. Oh, W. D. Kim, H. S. Park, M. H. Son, "Identification Methodology of FCM-based Fuzzy Model Using Particle Swarm Optimization", The Transactions of Korean Institute of Electrical Engineers, Vol. 60, No. 1, pp. 184-192, 2011.   DOI
12 S. K. Oh, Y. H. Kim, H. S. Park, J. T. Kim, "Design of Data-centroid Radial Basis Function Neural Network with Extended Polynomial Type and Its Optimization", The Transactions of Korean Institute of Electrical Engineers, Vol. 60, No. 3, pp. 639-647, 2011.   DOI
13 L. P. Maguire, B. Roche, T. M. McGinnity, L. J. McDaid, "Predicting a chaotic time series using a fuzzy neural network", Information Science, Vol. 112, pp. 125-136, 1998.   DOI