Browse > Article
http://dx.doi.org/10.17661/jkiiect.2019.12.5.467

YOLO Model FPS Enhancement Method for Determining Human Facial Expression based on NVIDIA Jetson TX1  

Bae, Seung-Ju (Electronic Engineering, Kookmin University)
Choi, Hyeon-Jun (Electronic Engineering, Kookmin University)
Jeong, Gu-Min (Electronic Engineering, Kookmin University)
Publication Information
The Journal of Korea Institute of Information, Electronics, and Communication Technology / v.12, no.5, 2019 , pp. 467-474 More about this Journal
Abstract
In this paper, we propose a novel method to improve FPS while maintaining the accuracy of YOLO v2 model in NVIDIA Jetson TX1. In general, in order to reduce the amount of computation, a conversion to an integer operation or reducing the depth of a network have been used. However, the accuracy of recognition can be deteriorated. So, we use methods to reduce computation and memory consumption through adjustment of the filter size and integrated computation of the network The first method is to replace the $3{\times}3$ filter with a $1{\times}1$ filter, which reduces the number of parameters to one-ninth. The second method is to reduce the amount of computation through CBR (Convolution-Add Bias-Relu) among the inference acceleration functions of TensorRT, and the last method is to reduce memory consumption by integrating repeated layers using TensorRT. For the simulation results, although the accuracy is decreased by 1% compared to the existing YOLO v2 model, the FPS has been improved from the existing 3.9 FPS to 11 FPS.
Keywords
Deep Learning; Embedded system; Facial expression recognition; TensorRT; YOLO;
Citations & Related Records
연도 인용수 순위
  • Reference
1 Large Scale Visual Recognition Challenge (2018). http://www.image-net.org/challenges/LSVRC/ (accessed Oct., 15, 2018).
2 Ren, S., He, K., Girshick, R., Sun. J, "Faster r-cnn: Towards real-time object detection with region proposal networks. In Advances in neural," information processing systems, pp. 91-99, 2015
3 Redmon, J., Divvala, S., Girshick, R., Farhadi, A, "You only look once: Unified, real-time object detection," Proceedings of the IEEE conference on computer vision and pattern recognition, pp. 779-788, 2016
4 Redmon, J., Farhadi, A., "YOLO9000: better, faster, stronger," arXiv preprint, 2017
5 M.H Lee, W.C Kang, "Performance Enhancement and Evaluation of a Deep Learning Framework on Embedded Systems using Unified Memory," KIISE Transactions on Computing Practices, Vol.23, No.7, 2017
6 Szegedy, C., Liu, W., Jia, Y., Sermanet, P., Reed, S., Anguelov, D., Rabinovich, A., "Going deeper with convolutions," Proceedings of the IEEE conference on computer vision and pattern recognition, pp. 1-9, 2015
7 Gong, Y., Liu, L., Yang, M., Bourdev, L., "Compressing deep convolutional networks using vector quantization," arXiv preprint arXiv:1412.6115, 2014
8 Han, S., Mao, H., & Dally, W. J., "Deep compression: Compressing deep neural networks with pruning, trained quantization and huffman coding,"arXiv preprint arXiv:1510.00149, 2015