Browse > Article
http://dx.doi.org/10.4218/etrij.13.2013.0063

Fast Random-Forest-Based Human Pose Estimation Using a Multi-scale and Cascade Approach  

Chang, Ju Yong (Creative Content Research Laboratory, ETRI)
Nam, Seung Woo (Creative Content Research Laboratory, ETRI)
Publication Information
ETRI Journal / v.35, no.6, 2013 , pp. 949-959 More about this Journal
Abstract
Since the recent launch of Microsoft Xbox Kinect, research on 3D human pose estimation has attracted a lot of attention in the computer vision community. Kinect shows impressive estimation accuracy and real-time performance on massive graphics processing unit hardware. In this paper, we focus on further reducing the computation complexity of the existing state-of-the-art method to make the real-time 3D human pose estimation functionality applicable to devices with lower computing power. As a result, we propose two simple approaches to speed up the random-forest-based human pose estimation method. In the original algorithm, the random forest classifier is applied to all pixels of the segmented human depth image. We first use a multi-scale approach to reduce the number of such calculations. Second, the complexity of the random forest classification itself is decreased by the proposed cascade approach. Experiment results for real data show that our method is effective and works in real time (30 fps) without any parallelization efforts.
Keywords
Human pose estimation; depth data; random forest; interactive digital contents;
Citations & Related Records
Times Cited By KSCI : 1  (Citation Analysis)
연도 인용수 순위
1 S. Knoop, S. Vacek, and R. Dillamnn, "Sensor Fusion for 3D Human Body Tracking with an Articulated 3D Body Model," IEEE Int. Conf. Robotics Autom., May 2006, pp. 1686-1691.
2 A. Baak et al., "A Data-Driven Approach for Real-Time Full Body Pose Reconstruction from a Depth Camera," Int. Conf. Comput. Vis., 2011, pp. 1092-1099.
3 V. Ganapathi, "Real Time Motion Capture Using a Single Timeof- Flight Camera," Comput. Vis. Pattern Recog., 2010, pp. 755- 762.
4 R. Girshick et al., "Efficient Regression of General-Activity Human Poses from Depth Images," Int. Conf. Comput. Vis., 2011, pp. 415-422.
5 D. Grest, J. Woetzel, and R. Koch, "Nonlinear Body Pose Estimation from Depth Images," DAGM, 2005, pp. 285-292.
6 C. Plagemann et al., "Real-Time Identification and Localization of Body Parts from Depth Images," IEEE Int. Conf. Robotics Autom., May 2010, pp. 3108-3113.
7 J. Shotton et al., "Real-Time Human Pose Recognition in Parts from Single Depth Images," Comput. Vis. Pattern Recog., June 2011, pp. 1297-1304.
8 J. Shotton et al., "Efficient Human Pose Estimation from Single Depth Images," IEEE Trans. Pattern Anal. Mach. Intell., vol. 35, 2012, pp. 2821-2840.
9 M. Siddiqui and G. Medioni, "Human Pose Estimation from a Single View Point, Real-Time Range Sensor," CVCG CVPR, San Francisco, CA, USA, June 2010.
10 M. Sun, P. Kohli, and J. Shotton, "Conditional Regression Forests for Human Pose Estimation," Comput. Vis. Pattern Recog., 2012, pp. 3394-3401.
11 J. Taylor et al., "The Vitruvian Manifold: Inferring Dense Correspondences for One-Shot Human Pose Estimation," Comput. Vis. Pattern Recog., 2012, pp. 103-110.
12 M. Ye and X. Wang, "Accurate 3D Pose Estimation from a Single Depth Image," Int. Conf. Comput. Vis., 2011, pp. 731-738.
13 Y. Zhu, B. Dariush, and K. Fujimura, "Controlled Human Pose Estimation from Depth Image Streams," Comput. Vis. Pattern Recog., Anchorage, AK, USA, June 2008.
14 Y. Zhu and K. Fujimura, "Constrained Optimization for Human Pose Estimation from Depth Sequences," Asian Conf. Comput. Vis., Nov. 2007, pp. 408-418.
15 Microsoft Corp, "Kinect for XBOX 360," Redmond, WA, USA.
16 J. Ziegler, K. Nickel, and R. Stiefelhagen, "Tracking of the Articulated Upper Body on Multi-view Stereo Image Sequences," Comput. Vis. Pattern Recog., 2006, pp. 774-781.
17 D. Comaniciu and P. Meer, "Mean Shift: A Robust Approach toward Feature Space Analysis," IEEE Trans. Pattern Anal. Mach. Intell., vol. 24, 2002, pp. 603-619.   DOI   ScienceOn
18 A. Hernández-Vela et al., "Graph Cuts Optimization for Multilimb Human Segmentation in Depth Maps," Comput. Vis. Pattern Recog., June 2012, pp. 726-732.
19 D. Jo et al., "Tracking and Interaction Based on Hybrid Sensing for Virtual Environments," ETRI J., vol. 35, no. 2, Apr. 2013, pp. 356-359.   DOI   ScienceOn
20 L. Breiman, "Random Forests," Mach. Learning, vol. 45, 2001, pp. 5-32.   DOI   ScienceOn
21 S. Escalera, "Human Behavior Analysis from Depth Maps," AMDO, LNCS, vol. 7378, 2012, pp. 282-292.
22 T.B. Moeslund, A. Hilton, and V. Krüger, "A Survey of Advances in Vision-Based Human Motion Capture and Analysis," Comput. Vis. Image Understanding, vol. 104, 2006, pp. 90-126.   DOI   ScienceOn
23 R. Poppe, "Vision-Based Human Motion Analysis: An Overview," Comput. Vis. Image Understanding, vol. 108, 2007, pp. 4-18.   DOI   ScienceOn
24 V. Lepetit, P. Lagger, and P. Fua, "Randomized Trees for Real- Time Keypoint Recognition," Comput. Vis. Pattern Recog., 2005, pp. 775-781.
25 F. Moosmann, B. Triggs, and F. Jurie, "Fast Discriminative Visual Codebooks Using Randomized Clustering Forests," Neural Inf. Process. Syst., 2006, pp. 985-992.
26 J. Shotton, M. Johnson, and R. Cipolla, "Semantic Texton Forests for Image Categorization and Segmentation," Comput. Vis. Pattern Recog., Anchorage, AK, USA, June 2008.
27 L. Breiman, "Bagging Predictors," Mach. Learning, vol. 24, 1996, pp. 123-140.
28 T.K. Ho, "The Random Subspace Method for Constructing Decision Forests," IEEE Trans. Pattern Anal. Mach. Intell., vol. 20, 1998, pp. 832-844.   DOI   ScienceOn
29 Y. Amit and D. Geman, "Shape Quantization and Recognition with Randomized Trees," Neural Comput., vol. 9, 1997, pp. 1545-1588.   DOI   ScienceOn