Browse > Article
http://dx.doi.org/10.6109/jkiice.2017.21.6.1143

Efficient Implementation of Convolutional Neural Network Using CUDA  

Ki, Cheol-Min (Department of Computer Engineering, Korea University of Technology and Education)
Cho, Tai-Hoon (School of Computer Science and Engineering, Korea University of Technology and Education)
Abstract
Currently, Artificial Intelligence and Deep Learning are rising as hot social issues, and these technologies are applied to various fields. A good method among the various algorithms in Artificial Intelligence is Convolutional Neural Networks. Convolutional Neural Network is a form that adds Convolution Layers to Multi Layer Neural Network. If you use Convolutional Neural Networks for small amount of data, or if the structure of layers is not complicated, you don't have to pay attention to speed. But the learning should take long time when the size of the learning data is large and the structure of layers is complicated. In these cases, GPU-based parallel processing is frequently needed. In this paper, we developed Convolutional Neural Networks using CUDA, and show that its learning is faster and more efficient than learning using some other frameworks or programs.
Keywords
Convolutional Neural Network; Machine Learning; Parallel Processing; CUDA; Fast Speed;
Citations & Related Records
연도 인용수 순위
  • Reference
1 Y. Huang, K. Li, G. Wang, M. Cao, P. Li, Y. Zhang. (2015, May). Recognition of convolutional neural network based on CUDA technology. arXiv preprint arXiv:1506.00074 [Online]. Available: https://arxiv.org/abs/1506.00074.
2 Dan C. Ciresan, U. Meier, J. Masci, Luca M. gambardella, J, Schmidhuber. (2011, February). High-Performance Neural Networks for Visual Object Classification. Arxiv preprint arXiv:1102.0183 [Online]. Available: https://arxiv.org/abs/1102.0183.
3 P. Sermanet, D. Eigen, X. Zhang, M. Mathieu, R. Fergus, and Y. LeCun, "Overfeat: Integrated recognition, localization and detection using convolutional networks," Proceedings of the ICLR, 2014.
4 Kumar Chellapila, Sidd Puri, and Patrice Simard, "High performance convolutional neural network for document processing", International Workshop on Frontiers in Handwriting Recognition, 2006.
5 Joseph Chet Redmon's Homepage. Darknet Framework [Internet]. Available: https://pjreddie.com/darknet
6 Dan Ciresan's Homepage. Net CPU Version Program [Internet]. Available: https://people.idsia.ch/-ciresan/index.htm.
7 University of Science and Technology of China. CUDA-CNN program [Internet]. Available: https://github.com/zhxfl/CUDA-CNN.
8 Berkeley AI Research. Caffe framework [Internet]. Available: http://caffe.berkeleyvision.org.
9 Y.jia, E. Shelhamer, J. Donahue, S. Karayev, J.Long, R. Girshick, S. Guadarrama, and T. Darrell. (2014, June). Caffe: Convolutional architecture for fast feature embedding. arXiv preprint arXiv:1408.5093 [Online]. Available: https://arxiv.org/abs/1408.5093.
10 Y. LeCun, L. Bottou, Y. Bengio, and P.Haffner, "Gradient-Based Learning Applied to Document Recognition," Proceedings of the IEEE, vol. 86, no. 11, pp. 2278-2324, Nov. 1998.   DOI
11 Patrice Y. Simard, Dave Steinkraus, John Platt, "Best Practices for Convolutional Neural Networks Applied to Visual Document Analysis," Proceedings of the ICDAR, pp. 958-962, 2003.
12 Yann Lecun's Homepage. The MNIST DATABASE of Handwritten Digits [Internet], Available: https://yann.lecun.com/exdb/mnist.
13 Wikipedia. Description of GPGPU [Internet]. Available: https://ko.wikipedia. org/wiki/GPGPU.
14 Wikipedia. CUDA Processing Flow [Internet]. Available: https://upload.wikimedia.org/wikipedia/commons/5/59/CUDA_processing_flow_%28En%29.PNG.
15 J. Long, E. Shelhamer, T. Darrell, "Fully Convolutional Networks for Semantic Segmentation," Proceeding of the CVPR, pp. 3431-3440, 2015.