[KSCI] Korea Science Citation Index Service

http://dx.doi.org/10.3745/JIPS.04.0246

Development of a Real-Time Automatic Passenger Counting System using Head Detection Based on Deep Learning

Kim, Hyunduk (Division of Automotive Technology, Daegu Gyeongbuk Institute of Science & Technology)
Sohn, Myoung-Kyu (Division of Automotive Technology, Daegu Gyeongbuk Institute of Science & Technology)
Lee, Sang-Heon (Division of Automotive Technology, Daegu Gyeongbuk Institute of Science & Technology)

Publication Information

Journal of Information Processing Systems / v.18, no.3, 2022 , pp. 428-442 More about this Journal

Abstract

A reliable automatic passenger counting (APC) system is a key point in transportation related to the efficient scheduling and management of transport routes. In this study, we introduce a lightweight head detection network using deep learning applicable to an embedded system. Currently, object detection algorithms using deep learning have been found to be successful. However, these algorithms essentially need a graphics processing unit (GPU) to make them performable in real-time. So, we modify a Tiny-YOLOv3 network using certain techniques to speed up the proposed network and to make it more accurate in a non-GPU environment. Finally, we introduce an APC system, which is performable in real-time on embedded systems, using the proposed head detection algorithm. We implement and test the proposed APC system on a Samsung ARTIK 710 board. The experimental results on three public head datasets reflect the detection accuracy and efficiency of the proposed head detection network against Tiny-YOLOv3. Moreover, to test the proposed APC system, we measured the accuracy and recognition speed by repeating 50 instances of entering and 50 instances of exiting. These experimental results showed 99% accuracy and a 0.041-second recognition speed despite the fact that only the CPU was used.

Keywords

Automatic Passenger Counting; Deep Learning; Embedded System; Head Detection;

Citations & Related Records

Times Cited By KSCI : 1 (Citation Analysis)

Reference
Cited By KSCI

1	J. Redmon, "DarkNet: open source neural networks in C," 2013 [Online]. Available: https://pjreddie.com/darknet/.
2	C. H. Chen, T. Y. Chen, D. J. Wang, and T. J. Chen, "A cost-effective people-counter for a crowd of moving people based on two-stage segmentation," Journal of Information Hiding and Multimedia Signal Processing, vol. 3, no. 1, pp. 12-23, 2012.
3	X. X. Wang an Y. Shen, "A video traffic flow detection system based on machine vision," Journal of Information Processing Systems, vol. 15, no. 5, pp. 1218-1230, 2019. DOI
4	N. Bernini, L. Bombini, M. Buzzoni, P. Cerri, and P. Grisleri, "An embedded system for counting passengers in public transportation vehicles," in Proceedings of 2014 IEEE/ASME 10th International Conference on Mechatronic and Embedded Systems and Applications (MESA), Senigallia, Italy, 2014, pp. 1-6.
5	P. Lengvenis, R. Simutis, V. Vaitkus, and R. Maskeliunas, "Application of computer vision systems for passenger counting in public transport," Elektronika ir Elektrotechnika, vol. 19, no. 3, pp. 69-72, 2013.
6	M. Saqib, S. D. Khan, N. Sharma, and M. Blumenstein, "Person head detection in multiple scales using deep convolutional neural networks," in Proceedings of 2018 International Joint Conference on Neural Networks (IJCNN), Rio de Janeiro, Brazil, 2018, pp. 1-7.
7	J. Redmon and A. Farhadi, "Yolov3: an incremental improvement," 2018 [Online]. Available: https://arxiv.org/abs/1804.02767.
8	J. F. Henriques, R. Caseiro, P. Martins, and J. Batista, "High-speed tracking with kernelized correlation filters," IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 37, no. 3, pp. 583-596, 2014. DOI
9	F. Zhang, T. Y. Wu, J. S. Pan, G. Ding, and Z. Li, "Human motion recognition based on SVM in VR art media interaction environment," Human-centric Computing and Information Sciences, vol. 9, article no. 40, 2019. https://doi.org/10.1186/s13673-019-0203-8 DOI
10	H. F. Nweke, Y. W. The, G. Mujtaba, U. R. Alo, and M. A. Al-garadi, "Multi-sensor fusion based on multiple classifier systems for human activity identification," Human-centric Computing and Information Sciences, vol. 9, article no. 34, 2019. https://doi.org/10.1186/s13673-019-0194-5 DOI
11	A. S. A. Nasir, N. K. A. Gharib, and H. Jaafar, "Automatic passenger counting system using image processing based on skin colour detection approach," in Proceedings of 2018 International Conference on Computational Approach in Smart Systems Design and Applications (ICASSDA), Kuching, Malaysia, 2018, pp. 1-8.
12	D. Lefloch, F. A. Cheikh, J. Y. Hardeberg, P. Gouton, and R. Picot-Clemente, "Real-time people counting system using a single video camera," in Proceedings of SPIE 6811: Real-Time Image Processing 2008. Bellingham, WA: International Society for Optics and Photonics, 2008, pp. 71-82.
13	D. Cao, Z. Chen, and L. Gao, "An improved object detection algorithm based on multi-scaled and deformable convolutional neural networks," Human-centric Computing and Information Sciences, vol. 10, article no. 14, 2020. https://doi.org/10.1186/s13673-020-00219-9 DOI
14	R. Girshick, "Fast R-CNN," in Proceedings of the IEEE International Conference on Computer Vision, Santiago, Chile, 2015, pp. 1440-1448.
15	M. T. N. Truong and S. Kim, "A tracking-by-detection system for pedestrian tracking using deep learning technique and color information," Journal of Information Processing Systems, vol. 15, no. 4, pp. 1017-1028, 2019. DOI
16	G. Chen, X. Cai, H. Han, S. Shan, and X. Chen, "HeadNet: pedestrian head detection utilizing body in context," in Proceedings of 2018 13th IEEE International Conference on Automatic Face & Gesture Recognition (FG), Xi'an, China, 2018, pp. 556-563.
17	R. Girshick, J. Donahue, T. Darrell, and J. Malik, "Rich feature hierarchies for accurate object detection and semantic segmentation," in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Columbus, OH, 2014, pp. 580-587.
18	J. Redmon, S. Divvala, R. Girshick, and A. Farhadi, "You only look once: unified, real-time object detection," in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, NV, 2016, pp. 779-788.
19	S. Ren, K. He, R. Girshick, and J. Sun, "Faster R-CNN: towards real-time object detection with region proposal networks," IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 39, no. 6, pp. 1137-1149, 2017. DOI
20	W. Liu, D. Anguelov, D. Erhan, C. Szegedy, S. Reed, C. Y. Fu, and A. C. Berg, "SSD: single shot multibox detector," in Computer Vision - ECCV 2016. Cham, Switzerland: Springer, 2016, pp. 21-37.
21	J. Redmon and A. Farhadi, "YOLO9000: better, faster, stronger," in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Honolulu, HI, 2017, pp. 6517-6525.
22	A. G. Howard, M. Zhu, B. Chen, D. Kalenichenko, W. Wang, T. Weyand, M. Andreetto, and H. Adam, "MobileNets: efficient convolutional neural networks for mobile vision applications," 2017 [Online], Available: https://arxiv.org/abs/1704.04861.
23	R. Huang, J. Pedoeem, and C. Chen, "YOLO-LITE: a real-time object detection algorithm optimized for nonGPU computers," in Proceedings of 2018 IEEE International Conference on Big Data (Big Data), Seattle, WA, 2018, pp. 2503-2510.
24	T. H. Vu, A. Osokin, and I. Laptev, "Context-aware CNNs for person head detection," in Proceedings of the IEEE International Conference on Computer Vision, Santiago, Chile, 2015, pp. 2893-2901.
25	S. Shao, Z. Zhao, B. Li, T. Xiao, G. Yu, X. Zhang, and J. Sun, "Crowdhuman: a benchmark for detecting human in a crowd," 2018 [Online]. Available: https://arxiv.org/abs/1805.00123.
26	D. Peng, Z. Sun, Z. Chen, Z. Cai, L. Xie, and L. Jin, "Detecting heads using feature refine net and cascaded multi-scale architecture," in Proceedings of 2018 24th International Conference on Pattern Recognition (ICPR), Beijing, China, 2018, pp. 2528-2533.