Template-Matching-based High-Speed Face Tracking Method using Depth Information

Kim, Wooyoul;Seo, Youngho;Kim, Dongwook;

doi:10.5909/JBE.2013.18.3.349

Journal of Broadcast Engineering (방송공학회논문지)

Volume 18 Issue 3
/
Pages.349-361
/
2013
/
1226-7953(pISSN)
/
2287-9137(eISSN)

The Korean Institute of Broadcast and Media Engineers (한국방송∙미디어공학회)

DOI QR Code

Template-Matching-based High-Speed Face Tracking Method using Depth Information

깊이 정보를 이용한 템플릿 매칭 기반의 고속 얼굴 추적 방법

Kim, Wooyoul (Dept. Electronic Materials Eng., Kwangwoon University) ;
Seo, Youngho (College of Liberal Arts, Kwangwoon University) ;
Kim, Dongwook (Dept. Electronic Materials Eng., Kwangwoon University)

김우열 (광운대학교 전자재료공학과) ;
서영호 (광운대학교 교양학부) ;
김동욱 (광운대학교 전자재료공학과)

Received : 2013.03.29
Accepted : 2013.05.22
Published : 2013.05.30

https://doi.org/10.5909/JBE.2013.18.3.349 Citation PDF KSCI

Download PDF

⟨ Previous Next ⟩

Abstract

This paper proposes a fast face tracking method with only depth information. It is basically a template matching method, but it uses a early termination scheme and a sparse search scheme to reduce the execution time to solve the problem of a template matching method, large execution time. Also a refinement process with the neighboring pixels is incorporated to alleviate the tracking error. The depth change of the face being tracked is compensated by predicting the depth of the face and resizing the template. Also the search area is adjusted on the basis of the resized template. With home-made test sequences, the parameters to be used in face tracking are determined empirically. Then the proposed algorithm and the extracted parameters are applied to the other home-made test sequences and a MPEG multi-view test sequence. The experimental results showed that the average tracking error and the execution time for the home-made sequences by Kinect ($640{\times}480$) were about 3% and 2.45ms, while the MPEG test sequence ($1024{\times}768$) showed about 1% of tracking error and 7.46ms of execution time.

본 논문에서는 깊이 정보만을 이용하여 얼굴을 고속으로 추적하는 방법을 제안하다. 그 방법으로는 템플릿 매칭 방법을 사용하며, 템플릿 매칭 방법의 문제점인 과다한 수행시간의 문제를 해결하여 고속으로 얼굴을 추적하기 위하여 조기종료 기법과 sparse 탐색 기법을 적용하고, 그에 따른 추적오류를 보정하고자 주변 화소들을 대상으로 매칭보정을 수행한다. 얼굴의 움직임에 따른 깊이의 변화를 보정하기 위해 추적할 얼굴의 깊이 값을 추정하고 그 결과에 따라 템플릿의 크기를 조정한다. 또한 조정된 템플릿의 크기에 따라 템플릿 매칭을 수행할 탐색영역을 조정한다. 자체 제작한 테스트 시퀀스들을 사용하여 추적에 필요한 파리미터들을 결정하였으며, 또 다른 자체 제작한 테스트 시퀀스들과 MPEG에서 제공한 다시점 테스트 시퀀스를 제안한 방법에 적용하는 실험을 수행하였다. 실험결과 Kinect을 이용하여 자체제작($640{\times}480$) 시퀀스에서는 약 3%의 추적오류와 2.45ms의 수행시간을 보였으며, Lovebird1($1024{\times}768$) 시퀀스에서는 약 1%의 추적 오류와 7.46ms의 수행시간을 보였다.

Keywords

References

G, Q, Zhao, et al., "A Simple 3D face Tracking Method based on Depth Information," Int'l Conf. on Machine Learning and Cybernetics, pp. 5022-5027, Aug. 2005.
C. X. Wang and Z. Y. Li, "A New Face Tracking Algorithm Based on Local Binary Pattern and Skin Color Information," ISCSCT, Vol. 2, pp. 20-22, Dec. 2008.
K. Hariharakrishnan and D. Schonfeld, "Fast object tracking using adaptive block matching," IEEE Trans. Multimedia, vol. 7, no. 5, 2005.
M. Lievin and F. Luthon; "Nonlinear Color Space and Spatiotemporal MRF for Hierarchical Segmentation of Face Features in Video," IEEE Trans. Image Processing, vol. 13, No. 1, Jan. 2004.
Y. Lin et al., "Real-time Face Tracking and Pose Estimation with Partitioned Sampling and Relevance Vector Machine," IEEE Intl. Conf. Robotics and Automation, pp. 453-458, 2009.
A. An and M. Chung, "Robust Real-time 3D Head Tracking based on Online Illumination Modeling and its Application to Face Recognition," IEEE Intl. Conf. Intelligent Robots and Systems, pp. 1466-1471, 2009.
R. Okada, Y. Shirai, and J. Miura, "Tracking a Person with 3-D Motion by Integrating Optical Flow and Depth," Proc. Fourth IEEE Int'l Conf. Automatic Face and Gesture Recognition, pp. 336-341, 2000.
G. Zhao, et al., "A Simple 3D Face Tracking Method based on Depth Information," Intl Conf. Machine Learning and Cybernetics, pp. 5022-5027, 2005.
Y. H. Lee et al., "A Robust Face Tracking using Stereo Camera," SICE Annual Conf., pp. 1985-1989, Sept. 2007.
S. Kosov et al., "Rapid Stereo-vision Enhanced Face Recognition," IEEE Intl. Conf. Image Processing, pp. 2437-2440, Sept. 2010.
Mesa Imaging, SR4000 user manual v2.0, May 2011.
M. Hacker, et al., "Geometric Invariants for Facial Feature Tracking with 3D TOF Cameras," Int'l Symposium on Signals, Circuits and Systems, Vol. 1, pp. 1-4, 2007.
J. L. Wilson, Microsoft kinect for Xbox 360, PC Mag. Com, Nov. 10, 2010.
M. Hacker, et al., "Geometric Invariants for Facial Feature Tracking with 3D TOF Cameras," Int'l Symposium on Signals, Circuits and Systems, Vol. 1, pp. 1-4, 2007.
X. Suau, J. Ruiz-Hidalgo and J. Casas, "Real-Time Head and Hand Tracking Based on 2.5D Data", IEEE Trans. Multimedia, Vol. 14, No. 3, pp. 575-585, June 2012. https://doi.org/10.1109/TMM.2012.2189853
Y.-J. Bae, H.-J. Choi, Y.-H Seo and D.-W. Kim, "A Fast and Accurate Face Detection and Tracking Method by using Depth Information," J. Korean Institute of Communications and Information Science (KICS), Vol. 37A, No. 07, pp. 586-599 https://doi.org/10.7840/KICS.2012.37.7A.586
P. Viola and M. J. Jones, "Robust Real-Time Face Detection," Computer Vision, Vol. 52, No. 2, pp. 137-154, 2004.
S.-K. Kwon and S.-W. Kim, "Motion Estimation Method by Using Depth Camera," J. Broadcast Engineering, Vol. 17, No. 4, pp. 676-683, Jul. 2012. https://doi.org/10.5909/JBE.2012.17.4.676
S.-K. Kwon, Y.-H. Park and K.-R. Kwon, "Zoom Motion Estimation Method by Using Depth Information," J. Korea Multimedia Society, Vol. 16, No. 2, pp. 131-137, Feb. 2013. https://doi.org/10.9717/kmms.2013.16.2.131