DOI QR코드

DOI QR Code

A Real-time People Counting Algorithm Using Background Modeling and CNN

배경모델링과 CNN을 이용한 실시간 피플 카운팅 알고리즘

  • Yang, HunJun (Dept. of Electronic Engineering, Inha University) ;
  • Jang, Hyeok (ETRI (Electronics and Telecommunications Research Institute)) ;
  • Jeong, JaeHyup (Dept. of Electronic Engineering, Inha University) ;
  • Lee, Bowon (Dept. of Electronic Engineering, Inha University) ;
  • Jeong, DongSeok (Dept. of Electronic Engineering, Inha University)
  • Received : 2016.09.29
  • Accepted : 2017.02.28
  • Published : 2017.03.25

Abstract

Recently, Internet of Things (IoT) and deep learning techniques have affected video surveillance systems in various ways. The surveillance features that perform detection, tracking, and classification of specific objects in Closed Circuit Television (CCTV) video are becoming more intelligent. This paper presents real-time algorithm that can run in a PC environment using only a low power CPU. Traditional tracking algorithms combine background modeling using the Gaussian Mixture Model (GMM), Hungarian algorithm, and a Kalman filter; they have relatively low complexity but high detection errors. To supplement this, deep learning technology was used, which can be trained from a large amounts of data. In particular, an SRGB(Sequential RGB)-3 Layer CNN was used on tracked objects to emphasize the features of moving people. Performance evaluation comparing the proposed algorithm with existing ones using HOG and SVM showed move-in and move-out error rate reductions by 7.6 % and 9.0 %, respectively.

최근 IoT 및 딥러닝 관련 기술요소들이 영상보안감시시스템에서도 다양하게 응용되고 있다. 그 중 CCTV를 통해 촬영된 동영상에서 자동으로 특정 객체를 검출, 추적, 분류 하는 감시 기능이 점점 지능화되고 있다. 본 논문에서는 보급형 CPU만 사용하는 PC 환경에서도 실시간 처리가 가능한 알고리즘을 목표로 하였다. GMM(Gaussian Mixture Model)을 이용한 배경 모델링과 헝가리안 알고리즘, 그리고 칼만 필터를 조합한 추적 알고리즘은 전통적이며 복잡도가 비교적 적지만 검출 오류가 높다. 이를 보강하기 위해 대용량 데이터 학습에 적합한 딥러닝을 기술을 적용하였다. 특히 움직임이 있는 사람의 특징을 강조하기 위해 추적된 객체에 대해 SRGB-3 Layer CNN을 사용하였다. 성능 평가를 위해 기존의 HOG와 SVM을 이용한 시스템과 비교했을 때 Move-in은 7.6%, Move-out은 9.0%의 오류율 감소가 있었다.

Keywords

References

  1. K. Shvachko, H. Kuang, S. Radia, and R. Chansler, "The Hadoop Distributed File System," IEEE 26th Symposium on Mass Storage Systems and Technologies(MSST), pp. 1-10, 3-7 May 2010.
  2. C. Stauffer and W. Grimson, "Learning Patterns of Activity Using Real-Time Tracking," IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 22, pp. 747-757, 2000. https://doi.org/10.1109/34.868677
  3. P. Viola, M.J. Jones, and D. Snow, "Detecting pedestrians using patterns of motion and appearance," IJCV, vol. 63, pp. 153-161, 2005. https://doi.org/10.1007/s11263-005-6644-8
  4. N. Dalal and B. Triggs, "Histograms of oriented gradients for human detection," In CVPR, pp. 886-893, 2005.
  5. X. Wang, T.X. Han, S. Yan, "An HOG-LBP human detector with partial occlusion handling," ICCV, pp. 32-39, 2009.
  6. P. Dollar, Z. Tu, P. Perona, and S. Belongie, "Integral channel features," In BMVC, pp. 1-11, 2009.
  7. P.F. Felzenszwalb, R.B. Girshick, D. McAllester, and D. Ramanan, "Object detection with discriminatively trained part-based models," TPAMI, vol. 32, pp. 1627-1645, 2010. https://doi.org/10.1109/TPAMI.2009.167
  8. Y. Lecun, L. Bottou, Y. Bengio, and P. Haffner, "Gradient-based learning applied to document recognition," Proceedings of the IEEE, vol. 86, pp. 2278-2324, 1998. https://doi.org/10.1109/5.726791
  9. A. Krizhevsky, I. Sutskever, and G.E. Hinton, "Imagenet classification with deep convolutional neural networks," NIPS 2012, pp. 1097-1105, 2012.
  10. C. Szegedy et al, "Going deeper with convolutions," CoRR, vol. abs/1409.4842, 2014.
  11. K. He, X. Zhang, S. Ren, and J. Sun, "Deep residual learning for image recognition," arXiv preprint arXiv:1512.03385, 2015.
  12. J. Munkres, "Algorithms for the Assignment and Transportation Problems," Journal of the Society for Industrial and Applied Mathematics, vol. 5, no. 1, pp. 32-38, 1957. https://doi.org/10.1137/0105003
  13. F. Lutteke, X. Zhang, and J. Franke, "Implementation of the Hungarian Method for object tracking on a camera monitored transportation system," ROBOTIK 2012, pp. 1-6, 2012.
  14. R. Rad and M. Jamzad, "Real-time classification and tracking of multiple vehicles in highways," Pattern Recognition Letters, vol. 26, pp. 1597-1607, 2005. https://doi.org/10.1016/j.patrec.2005.01.010