Browse > Article
http://dx.doi.org/10.7472/jksii.2017.18.4.61

A Robust Object Detection and Tracking Method using RGB-D Model  

Park, Seohee (Department of Computer Science, Kyonggi University)
Chun, Junchul (Department of Computer Science, Kyonggi University)
Publication Information
Journal of Internet Computing and Services / v.18, no.4, 2017 , pp. 61-67 More about this Journal
Abstract
Recently, CCTV has been combined with areas such as big data, artificial intelligence, and image analysis to detect various abnormal behaviors and to detect and analyze the overall situation of objects such as people. Image analysis research for this intelligent video surveillance function is progressing actively. However, CCTV images using 2D information generally have limitations such as object misrecognition due to lack of topological information. This problem can be solved by adding the depth information of the object created by using two cameras to the image. In this paper, we perform background modeling using Mixture of Gaussian technique and detect whether there are moving objects by segmenting the foreground from the modeled background. In order to perform the depth information-based segmentation using the RGB information-based segmentation results, stereo-based depth maps are generated using two cameras. Next, the RGB-based segmented region is set as a domain for extracting depth information, and depth-based segmentation is performed within the domain. In order to detect the center point of a robustly segmented object and to track the direction, the movement of the object is tracked by applying the CAMShift technique, which is the most basic object tracking method. From the experiments, we prove the efficiency of the proposed object detection and tracking method using the RGB-D model.
Keywords
3D CCTV; Video Surveillance; Object Detection; Object Tracking; Depth Map; Image Segmentation;
Citations & Related Records
연도 인용수 순위
  • Reference
1 Dalal, Navneet, and Bill Triggs, "Histograms of oriented gradients for human detection," Computer Vision and Pattern Recognition (CVPR), Vol 1, pp. 886-893, 2005. https://doi.org/10.1109/cvpr.2005.177
2 Xia, Lu, Chia-Chih Chen, and Jake K. Aggarwal, "Human detection using depth information by kinect," Computer Vision and Pattern Recognition Workshops (CVPRW), pp. 15-22, 2011. https://doi.org/10.1109/cvpr.2005.177
3 Hirschmuller, Heiko, "Stereo processing by semiglobal matching and mutual information," IEEE Transactions on pattern analysis and machine intelligence, Vol 30, No. 2, pp. 328-341, 2008. https://doi.org/10.1109/tpami.2007.1166   DOI
4 Bradski, Gary R, "Computer vision face tracking for use in a perceptual user interface," Intel Technology Journal, pp. 214-219, 1998. http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.14.7673
5 Nguyen, Duc Thanh, Wanqing Li, and Philip O. Ogunbona, "Human detection from images and videos: A survey," Pattern Recognition, pp. 148-175, 2016. https://doi.org/10.1016/j.patcog.2015.08.027
6 Shotton, Jamie, et al., "Real-time human pose recognition in parts from single depth images," Communications of the ACM, Vol 56, No. 1, pp. 116-124, 2013. https://doi.org/10.1145/2398356.2398381   DOI
7 Wei, Shih-En, et al., "Convolutional pose machines," Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 4724-4732, 2016. https://doi.org/10.1109/cvpr.2016.511
8 Cao, Zhe, et al., "Realtime multi-person 2d pose estimation using part affinity fields," Computer Vision and Pattern Recognition (CVPR), 2017. https://arxiv.org/abs/1611.08050
9 Grant, Jason M., and Patrick J. Flynn., "Crowd Scene Understanding from Video: A Survey," ACM Transactions on Multimedia Computing, Communications, and Applications (TOMM), Vol 13, No. 2, pp. 19, 2017. https://doi.org/10.1145/3052930
10 Dixit, Astha, Manoj Verma, and Kailash Patidar., "Survey on Video Object Detection & Tracking," International Journal of Current Trends in Engineering & Technology, Vol 2, No. 2, pp. 264-268, 2016. http://ijctet.org/issuedetail.php?id=317
11 Zivkovic, Zoran, "Improved adaptive Gaussian mixture model for background subtraction," Proceedings of the 17th International Conference on Pattern Recognition, pp. 28-31, 2004. https://doi.org/10.1109/icpr.2004.1333992
12 Viola, Paul, and Michael Jones, "Rapid object detection using a boosted cascade of simple features," Computer Vision and Pattern Recognition (CVPR), Vol 1, pp. 511-518, 2001. https://doi.org/10.1109/cvpr.2001.990517