Browse > Article
http://dx.doi.org/10.3837/tiis.2015.08.023

An Improved Approach for 3D Hand Pose Estimation Based on a Single Depth Image and Haar Random Forest  

Kim, Wonggi (Department of Computer Science, Kyonggi University)
Chun, Junchul (Department of Computer Science, Kyonggi University)
Publication Information
KSII Transactions on Internet and Information Systems (TIIS) / v.9, no.8, 2015 , pp. 3136-3150 More about this Journal
Abstract
A vision-based 3D tracking of articulated human hand is one of the major issues in the applications of human computer interactions and understanding the control of robot hand. This paper presents an improved approach for tracking and recovering the 3D position and orientation of a human hand using the Kinect sensor. The basic idea of the proposed method is to solve an optimization problem that minimizes the discrepancy in 3D shape between an actual hand observed by Kinect and a hypothesized 3D hand model. Since each of the 3D hand pose has 23 degrees of freedom, the hand articulation tracking needs computational excessive burden in minimizing the 3D shape discrepancy between an observed hand and a 3D hand model. For this, we first created a 3D hand model which represents the hand with 17 different parts. Secondly, Random Forest classifier was trained on the synthetic depth images generated by animating the developed 3D hand model, which was then used for Haar-like feature-based classification rather than performing per-pixel classification. Classification results were used for estimating the joint positions for the hand skeleton. Through the experiment, we were able to prove that the proposed method showed improvement rates in hand part recognition and a performance of 20-30 fps. The results confirmed its practical use in classifying hand area and successfully tracked and recovered the 3D hand pose in a real time fashion.
Keywords
3D hand-pose estimation; Random Forest algorithm; Depth map; Kinect Sensor;
Citations & Related Records
Times Cited By KSCI : 1  (Citation Analysis)
연도 인용수 순위
1 Fanelli G., Gall J. and Van Gool L.,"Real time head pose estimation with random regression forests," in Proc. of IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2011, pp. 617-624, 2011. Article(CrossRef Link)
2 Wenkai Xu and Eung-Joo Lee, “A Novel Method for Hand Posture Recognition Based on Depth Information Descriptor,” KSII Transcation on Internet and Information Systems, vol. 9, no. 2, pp. 763-774, 2015. Article(CrossRef Link)   DOI
3 T. Dang, T,-H. Yu and T.-K. Kim, "Real-time articulated hand pose estimation using semi-supervised transductive regression forests," in Proc. of 15th IEEE International Conference on Computer Vision, ICCV 2013, pp. 3225-3231, 2013. Article(CrossRef Link)
4 L. Ballan, A. Taneja, J. Gall et al, "Motion capture of hands in action using discriminative salient points," in Proc. of 12th European Conference on Computer Vision, ECCV 2012, Lecture Notes in Computer Science 7577, pp. 640-653, 2012. Article(CrossRef Link)
5 M. de La Gorce, D. Fleet, and N. Paragios, “Model-based 2D hand pose estimation from monocular video,” IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 33, no. 9, pp. 1793-1805, 2011. Article(CrossRef Link)   DOI
6 S. Malassiotis and M. Strintzis, “Real-time hand posture recognition using range data,” Image and Vision Computing, vol. 26, no. 7, pp. 1027-1-37, 2008. Article(CrossRef Link)   DOI
7 Z. Mo and U. Neumann, "Real-time Hand Pose Recognition Using Low-Resolution Depth Images," in Proc. of IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2006, vol. 2 pp. 1499-1505, 2006. Article(CrossRef Link)
8 A. Erol, G. Bebis, M. Nicolescu et al, “Vision-based hand pose estimation: A review,” Computer Vision and Image Understanding, vol. 108, pp. 52-73, 2007. Article(CrossRef Link)   DOI
9 Cheng, Yizong, “Mean Shift, Mode Seeking, and Clustering,” IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 17, no. 8, pp. 790–799, 1995. Article(CrossRef Link)   DOI
10 I. Oikonomidis, N. Kyriazis, and A. Argyros, "Markerless and efficient 26-DOF hand pose recovery," in Proc. of 10th Asian conference on Computer Vision-Vol. Part III, pp. 744-757, 2011. Article(CrossRef Link)
11 C. Keskin, F. Kirac, Y. Kara, L. Akarun, "Real Time Hand Pose Estimation using Depth Sensors," in Proc. of 13th IEEE International Conference on Computer Vision, ICCV 2011, pp. 1228-1234, 2011. Article(CrossRef Link)
12 J. Taylor, J. Shotton, T. Sharp, and A. Fitzgibbon, "The vitruvian manifold: Inferring dense correspondence for one-shot human pose estimation," in Proc. of 12th IEEE International Conference on Computer Vision and Pattern Recognition, CVPR 2012, pp. 103-110, 2012. Article(CrossRef Link)
13 J. Shotton, A. Fitzgibbon, M. Cook et al, “Real-time human pose recognition in parts from single depth image,” Communication of the ACM, vol. 56, no. 1, pp. 116-124, 2013. Article(CrossRef Link)   DOI
14 R. Girshick, J. Shotton, P. Kohli, A. Criminisi, and A. Fitzgibbon, "Efficient regression of general-activity human pose from depth images," in 13th Proc. of International Conference on Computer Vision, ICCV 2011, pp. 415-422, 2011. Article(CrossRef Link)
15 G. Pons-Moll, J. Taylor, J. Shotton, A. Hertzmann, and A. Fizgibbon, "Metric regression forest for human pose estimation," in Proc. of 24th British Machine Vision Conference, BMVC 2013, pp. 4.1-4.11, 2013. Article(CrossRef Link)
16 M. Sun, P. Kohil, and J. Sotton, "Conditional regression forest for human pose estimation," in Proc. of 12th IEEE International Conference on Computer Vision and pattern Recognition, CVPR 2012, pp. 3394-3401, 2012. Article(CrossRef Link)
17 Microsoft Corp. Redmond WA. Kinect for xbox 360.
18 L. Breiman, “Random Forests,” Machine Learning, vol. 45, no.1, pp. 5-32, 2001. Article(Cross Ref Link)   DOI