DOI QR코드

DOI QR Code

향상된 캠쉬프트를 사용한 실시간 얼굴추적 방법

Real-time Face Tracking Method using Improved CamShift

  • Lee, Jun-Hwan (Department of Electrical Engineering, KwangWoon University) ;
  • Yoo, Jisang (Department of Electrical Engineering, KwangWoon University)
  • 투고 : 2016.09.07
  • 심사 : 2016.11.07
  • 발행 : 2016.11.30

초록

본 논문에서는 실시간 얼굴 추적을 위하여 기존의 CamShift 알고리즘의 단점을 보완한 새로운 CamShift 알고리즘을 제안한다. 배경 내 추적 객체와 색상이 유사한 객체가 존재할 경우 기존 CamShift 알고리즘은 불안정한 추적을 보여준다. 이러한 문제점을 화소 단위로 거리정보를 획득할 수 있는 Kinect의 깊이 정보와 HSV 색공간 기반의 피부색 후보영역을 추출하는 Skin Detection 알고리즘을 이용하여 색상분포만 이용하는 기존의 CamShift의 단점을 보완한다. 또한 추적하던 객체가 사라지거나 가려짐이 발생할 경우에도 다시 추적할 수 있는 특징점 기반의 매칭 알고리즘을 통하여 차폐영역에 강인한 특성을 가지게 한다. 이러한 향상된 CamShift 알고리즘을 사람의 얼굴 추적에 적용함으로써 다양한 분야에 활용 가능한 강인한 얼굴추적 알고리즘을 제안하고자 한다. 실험결과 제안하는 알고리즘은 기존의 추적 알고리즘인 TLD보다 월등히 빠른 처리속도와 더 우수한 추적성능을 보여주었고, CamShift 보다 조금 느리지만 기존의 CamShift가 가지고 있는 문제점들을 해결하였다.

This paper first discusses the disadvantages of the existing CamShift Algorithm for real time face tracking, and then proposes a new Camshift Algorithm that performs better than the existing algorithm. The existing CamShift Algorithm shows unstable tracking when tracing similar colors in the background of objects. This drawback of the existing CamShift is resolved by using Kinect’s pixel-by-pixel depth information and the Skin Detection algorithm to extract candidate skin regions based on HSV color space. Additionally, even when the tracking object is not found, or when occlusion occurs, the feature point-based matching algorithm makes it robust to occlusion. By applying the improved CamShift algorithm to face tracking, the proposed real-time face tracking algorithm can be applied to various fields. The results from the experiment prove that the proposed algorithm is superior in tracking performance to that of existing TLD tracking algorithm, and offers faster processing speed. Also, while the proposed algorithm has a slower processing speed than CamShift, it overcomes all the existing shortfalls of the existing CamShift.

키워드

참고문헌

  1. Viola, Paul, and Michael J. Jones. "Robust real-time face detection." International journal of computer vision 57.2 (2004): 137-154. https://doi.org/10.1023/B:VISI.0000013087.49260.fb
  2. Rowley, Henry A., Shumeet Baluja, and Takeo Kanade. "Neural network-based face detection." IEEE Transactions on pattern analysis and machine intelligence 20.1 (1998): 23-38. https://doi.org/10.1109/34.655647
  3. Osuna, Edgar, Robert Freund, and Federico Girosit. "Training support vector machines: an application to face detection." Computer vision and pattern recognition, 1997. Proceedings., 1997 IEEE computer society conference on. IEEE, 1997.
  4. Hsu, Rein-Lien, Mohamed Abdel-Mottaleb, and Anil K. Jain. "Face detection in color images." IEEE transactions on pattern analysis and machine intelligence 24.5 (2002): 696-706. https://doi.org/10.1109/34.1000242
  5. Hjelmas, Erik, and Boon Kee Low. "Face detection: A survey." Computer vision and image understanding 83.3 (2001): 236-274. https://doi.org/10.1006/cviu.2001.0921
  6. Kalal, Zdenek, Krystian Mikolajczyk, and Jiri Matas. "Face-tld: Tracking-learning-detection applied to faces." Image Processing (ICIP), 2010 17th IEEE International Conference on. IEEE, 2010.
  7. Kalal, Zdenek, Krystian Mikolajczyk, and Jiri Matas. "Tracking-learning-detection." Pattern Analysis and Machine Intelligence, IEEE Transactions on vol. 34, no. 7, pp. 1409-1422, 2012. https://doi.org/10.1109/TPAMI.2011.239
  8. Young-Gon Kim, Rae-Hong Park, and Seong-Su Mun "Face Detection Using Adaboost and Template Matching of Depth Map based Block Rank Patterns",JBE, Vol. 17, No. 3, pp. 437-446, May 2012.
  9. Hoo Hyun Kim, Dong-Chan Cho, Jong Yeop Bae, Whoi-Yul Kim. "Rotation Invariant Face Detection with Boosted Random Ferns." The Korean Institute of Broadcast and Media Engineers Summer Conference, (2013.6): 52-55.
  10. Kyong-Ho Lee. "Face Tracking Using Face Feature and Color Information." Journal of the Korea Society of Computer and Information , 18.11 (2013.11): 167-174.
  11. Viola, Paul, and Michael Jones. "Rapid object detection using a boosted cascade of simple features." Computer Vision and Pattern Recognition, 2001. CVPR 2001. Proceedings of the 2001 IEEE Computer Society Conference on. vol. 1, pp. 511, 2001
  12. Viola, Paul, and Michael Jones. "Fast and robust classification using asymmetric adaboost and a detector cascade." Advances in Neural Information Processing System 14 (2001).
  13. Jones, Michael J., and James M. Rehg. "Statistical color models with application to skin detection." International Journal of Computer Vision 46.1 (2002): 81-96. https://doi.org/10.1023/A:1013200319198
  14. Vezhnevets, Vladimir, Vassili Sazonov, and Alla Andreeva. "A survey on pixel-based skin color detection techniques." Proc. Graphicon. vol. 3, pp.85-92, September, 2003.
  15. Bradski, Gary R. "Computer vision face tracking for use in a perceptual user interface." (1998).
  16. Allen, John G., Richard YD Xu, and Jesse S. Jin. "Object tracking using camshift algorithm and multiple quantized feature spaces." Proceedings of the Pan-Sydney area workshop on Visual information processing. Australian Computer Society, Inc., 2004.
  17. Wang, Zhaowen, et al. "CamShift guided particle filter for visual tracking." Pattern Recognition Letters 30.4 (2009): 407-413. https://doi.org/10.1016/j.patrec.2008.10.017
  18. Zhang, Zhengyou. "A flexible new technique for camera calibration." IEEE Transactions on pattern analysis and machine intelligence 22.11 (2000): 1330-1334. https://doi.org/10.1109/34.888718
  19. Tsai, Roger. "A versatile camera calibration technique for high-accuracy 3D machine vision metrology using off-the-shelf TV cameras and lenses." IEEE Journal on Robotics and Automation 3.4 (1987): 323-344. https://doi.org/10.1109/JRA.1987.1087109
  20. Weng, Juyang, Paul Cohen, and Marc Herniou. "Camera calibration with distortion models and accuracy evaluation." IEEE Transactions on pattern analysis and machine intelligence 14.10 (1992): 965-980. https://doi.org/10.1109/34.159901
  21. Muhlmann, Karsten, et al. "Calculating dense disparity maps from color stereo images, an efficient implementation." International Journal of Computer Vision 47.1-3 (2002): 79-88. https://doi.org/10.1023/A:1014581421794
  22. Zhang, Zhengyou. "Microsoft kinect sensor and its effect." IEEE multimedia 19.2 (2012): 4-10. https://doi.org/10.1109/MMUL.2012.24
  23. Pagliari, Diana, and Livio Pinto. "Calibration of kinect for xbox one and comparison between the two generations of Microsoft sensors." Sensors 15.11 (2015): 27569-27589. https://doi.org/10.3390/s151127569
  24. Rosten, Edward, and Tom Drummond. "Machine learning for high-speed corner detection." Computer Vision-ECCV 2006. Springer Berlin Heidelberg. pp. 430-443, 2006.
  25. Calonder, Michael, et al. "Brief: Binary robust independent elementary features." Computer Vision-ECCV 2010 pp. 778-792, 2010.
  26. https://www.flickr.com/photos/unavoidablegrain/6884354772/in/photostream/ (Image by Greg Borenstein)
  27. Comaniciu, Dorin, and Peter Meer. "Mean shift: A robust approach toward feature space analysis." IEEE Transactions on pattern analysis and machine intelligence 24.5 (2002): 603-619. https://doi.org/10.1109/34.1000236
  28. Bhattacharyya, Anil. "On a measure of divergence between two multinomial populations." Sankhya: the indian journal of statistics (1946): 401-406.
  29. Trzcinski, Tomasz, and Vincent Lepetit. "Efficient discriminative projections for compact binary descriptors." Computer Vision-ECCV 2012. Springer Berlin Heidelberg. pp. 228-242, 2012.
  30. Danielsson, Per-Erik. "Euclidean distance mapping." Computer Graphics and image processing 14.3 (1980): 227-248. https://doi.org/10.1016/0146-664X(80)90054-4
  31. Muller, Meinard. "Dynamic time warping." Information retrieval for music and motion (2007): 69-84.
  32. Lowe, David G. "Distinctive image features from scale-invariant keypoints."International journal of computer vision vol. 60, no. 2, pp. 91-110, 2004. https://doi.org/10.1023/B:VISI.0000029664.99615.94
  33. Harris, Chris, and Mike Stephens. "A combined corner and edge detector."Alvey vision conference. vol. 15. 1988.
  34. Bay, Herbert, et al. "Speeded-up robust features (SURF)." Computer vision and image understanding vol. 110, no. 3, pp. 346-359, 2008. https://doi.org/10.1016/j.cviu.2007.09.014
  35. Hamming, Richard W. "Error detecting and error correcting codes." Bell System technical journal 29.2 (1950): 147-160. https://doi.org/10.1002/j.1538-7305.1950.tb00463.x
  36. Fischler, Martin A., and Robert C. Bolles. "Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography." Communications of the ACM 24.6 (1981): 381-395. https://doi.org/10.1145/358669.358692
  37. http://tracking.cs.princeton.edu/index.html
  38. Song, Shuran, and Jianxiong Xiao. "Tracking revisited using rgbd camera: Unified benchmark and baselines." Proceedings of the IEEE international conference on computer vision. 2013.
  39. http://darkpgmr.tistory.com/80