Browse > Article
http://dx.doi.org/10.9717/kmms.2019.22.2.167

Real-time Multiple Pedestrians Tracking for Embedded Smart Visual Systems  

Nguyen, Van Ngoc Nghia (Dept. of Information and Telecommunication Eng., Graduate School, Soongsil University)
Nguyen, Thanh Binh (CublickDigital, Inc.)
Chung, Sun-Tae (Dept. of Intelligent Systems, Graduate School, Soongsil University)
Publication Information
Abstract
Even though so much progresses have been achieved in Multiple Object Tracking (MOT), most of reported MOT methods are not still satisfactory for commercial embedded products like Pan-Tilt-Zoom (PTZ) camera. In this paper, we propose a real-time multiple pedestrians tracking method for embedded environments. First, we design a new light weight convolutional neural network(CNN)-based pedestrian detector, which is constructed to detect even small size pedestrians, as well. For further saving of processing time, the designed detector is applied for every other frame, and Kalman filter is employed to predict pedestrians' positions in frames where the designed CNN-based detector is not applied. The pose orientation information is incorporated to enhance object association for tracking pedestrians without further computational cost. Through experiments on Nvidia's embedded computing board, Jetson TX2, it is verified that the designed pedestrian detector detects even small size pedestrians fast and well, compared to many state-of-the-art detectors, and that the proposed tracking method can track pedestrians in real-time and show accuracy performance comparably to performances of many state-of-the-art tracking methods, which do not target for operation in embedded systems.
Keywords
Pedestrian Tracking; Object Detection; Deep Learning; Object Association;
Citations & Related Records
Times Cited By KSCI : 1  (Citation Analysis)
연도 인용수 순위
1 Hungarian Algorithm, https://en.wikipedia.org/wiki/Hungarian_algorithm (accessed Feb., 12, 2019).
2 S. Ren, K. He, R. Girshick, and J. Sun, "Faster R-CNN: Towards Real-time Object Detection with Region Proposal Networks," Proceedings of the 28th International Conference on Neural Information Processing Systems - Vol. 1, pp. 91-99, 2015.
3 J. Redmon and A. Farhadi, "YOLO9000: Better, Faster, Stronger," Proceeding of 2017 IEEE Conference on Computer Vision and Pattern Recognition, pp. 6517-6525, 2017.
4 W. Liu, D. Anguelov, D. Erhan, C. Szegedy, S. Reed, C.Y. Fu, et al., "SSD: Single Shot MultiBox Detector," Proceeding of European Conference on Computer Vision, pp. 1-17, 2016.
5 J. Redmon and A. Farhadi, "YOLOv3: An Incremental Improvement," arXiv:1804.02767, 2018.
6 S. Bell, C.L. Zitnick, K. Bala, and R. Girshick. "Inside-outside Net: Detecting Objects in Context with Skip Pooling and Recurrent Neural Networks," Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 2874-2883, 2016.
7 J. Li, X. Liang, Y. Wei, T. Xu, J. Feng, S. Yan, et al., "Perceptual Generative Adversarial Networks for Small Object Detection," Proceeding of 2017 IEEE Conference on Computer Vision and Pattern Recognition, pp. 1222-1230, 2017.
8 J. Redmon, S. Divvala, R. Girshick, and A. Farhadi, "You Only Look Once Unified, Realtime Object Detection," Proceeding of 2016 IEEE Conference on Computer Vision and Pattern Recognition, pp. 779-788, 2016.
9 Jetson TX2, https://elinux.org/Jetson_TX2 (accessed Feb., 12, 2019).
10 H. Yanga, L. Shaoa, F. Zhenga, L. Wangd, and Z. Songa, "Recent Advances and Trends in Visual Tracking: A Review," Journal of Neu rocomputing, Vol. 74, No. 18, pp. 3823-3831, 2011.
11 A. Bewley, Z. Ge, L. Ott, F. Ramos, and B. Upcroft, "Simple Online and Realtime Tracking," Proceeding of IEEE International Conference on Image Processing, pp. 3464-3468, 2016.
12 N. Wojke, A. Bewley, and D. Paulus, "Simple Online and Realtime Tracking with a Deep Association Metric," Proceeding of IEEE International Conference on Image Processing, pp. 3645-3649, 2017.
13 E. Bochinski, V. Eiselein, and T. Sikora. "High-Speed Tracking-by-Detection Without Using Image Information," Proceeding of International Workshop on Traffic and Street Surveillance for Safety and Security at IEEE Advanced Video and Signal-based Sureillance, pp. 1-8, 2017.
14 Q.D. Vu, T.B. Nguyen, and S.T. Chung, “Simple Online Multiple Pedestrian Tracking Based on LK Feature Tracker and Detection for Embedded Surveillance,” Journal of Korea Multimedia Society, Vol. 20, No. 6, pp. 893-910, 2017.   DOI
15 T.Y Lin, P. Dollar, R. Girshick, K. He, B. Hariharan, S. Belongie, et al., "Feature Pyramid Networks for Object Detection," Proceeding of 2017 IEEE Conference on Computer Vision and Pattern Recognition, pp. 936-944, 2017.
16 L. Leal-Taix, C.C. Ferrer, and K. Schindler, "Learning by Tracking: Siamese CNN for Robust Target Association," Proceeding of 2016 IEEE Computer Vision and Pattern Recognition Conference Workshops, pp. 418-425, 2016.
17 D. Kim, J. Park, and C. Lee "Object-tracking System Using Combination of CAMshift and Kalman Filter Algorithm," Journal of Korea Multimedia Society, Vol. 16, No. 5, pp. 619-628, 2013.   DOI
18 A.G. Howard, M. Zhu, B. Chen, D. Kalenichenko, W. Wang, T. Weyand, et al., "Mobile Nets: Efficient Convolutional Neural Networks for Mobile Vision Applications," arXiv: 1704.04861, 2017.
19 INRIA Person Dataset, http://pascal.inrialpes.fr/data/pedestrian/ (accessed Feb., 12, 2019).
20 MOT Challenge, https://motchallenge.net/ (accessed Feb., 12, 2019).
21 P. Felzenszwalb, R. Girshick, D. McAllester, and D. Ramanan, “Object Detection with Discriminatively Trained Part Based Models,” IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol. 32, No. 9, pp. 1627-1645, 2010.   DOI
22 M. Everingham, L. Gool, C.K. Williams, J. Winn, and A. Zisserman, "The PASCAL Visual Object Classes (VOC) Challenge," Journal of Computer Vision. Vol. 88, Issue 2, pp. 303-338, 2010.   DOI
23 Multiple Object Tracking Challenge Development Kit, https://bitbucket.org/amilan/motchallenge-devkit/ (accessed Feb., 12, 2019).