Comparison of Weight Initialization Techniques for Deep Neural Networks

Kang, Min-Jae;Kim, Ho-Chan;

doi:10.17703/IJACT.2019.7.4.283

International Journal of Advanced Culture Technology

Volume 7 Issue 4
/
Pages.283-288
/
2019
/
2288-7202(pISSN)
/
2288-7318(eISSN)

The International Promotion Agency of Culture Technology (국제문화기술진흥원)

DOI QR Code

Comparison of Weight Initialization Techniques for Deep Neural Networks

Kang, Min-Jae (Department of Electronic Engineering, Jeju National University) ;
Kim, Ho-Chan (Department of Electronic Engineering, Jeju National University)

Received : 2019.10.08
Accepted : 2019.11.08
Published : 2019.12.31

https://doi.org/10.17703/IJACT.2019.7.4.283 Citation PDF KSCI

Download PDF

⟨ Previous Next ⟩

Abstract

Neural networks have been reborn as a Deep Learning thanks to big data, improved processor, and some modification of training methods. Neural networks used to initialize weights in a stupid way, and to choose wrong type activation functions of non-linearity. Weight initialization contributes as a significant factor on the final quality of a network as well as its convergence rate. This paper discusses different approaches to weight initialization. MNIST dataset is used for experiments for comparing their results to find out the best technique that can be employed to achieve higher accuracy in relatively lower duration.

Keywords

References

Smith, Craig S, "The Man Who Helped Turn Toronto into a High-Tech Hotbed," The New York Times. Retrieved 27 June 2017.
Yann LeCun1,2, Yoshua Bengio3 & Geoffrey Hinton, "Deep learning," Nature volume521, pages436-444 (28 May 2015). https://doi.org/10.1038/nature14539
Xavier Glorot, Yoshua Bengio, "Understanding the difficulty of training deep feedforward neural networks," Proceedings of the 13th International Conf. on Artificial Intelligence and Statistics, Sardinia, Italy, 2010.
Kaiming He, Xaiangyu Zhang, Shaoqing Ren, and Jian Sun, "Developing Deep into Rectifiers: Surpassing Human-Level Performance on ImageNet Classification," Proceedings of the 2015 IEEE International Conf. on Computer Vision, Santiago, Chile, 2015.
Serge Ioffe and Christian Szegedy, "Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift," Proceedings of the 32nd International Conf. on Machine Learning, Lille, France, 2015.
LeCun, Y., Bottou, L., Bengio, Y., and Haffner, P., "Gradient based learning applied to document recognition," Proceedings of the IEEE, 86(11):2278-2324, November 1998. https://doi.org/10.1109/5.726791

International Journal of Advanced Culture Technology

Comparison of Weight Initialization Techniques for Deep Neural Networks

Abstract

Keywords

References

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)