Browse > Article
http://dx.doi.org/10.3745/JIPS.02.0110

A Spatial-Temporal Three-Dimensional Human Pose Reconstruction Framework  

Nguyen, Xuan Thanh (Ecole Superieure d'Ingenieurs en Electrotechnique et Electronique Paris (ESIEE Paris))
Ngo, Thi Duyen (The VNU University of Engineering and Technology (VNU-UET))
Le, Thanh Ha (The VNU University of Engineering and Technology (VNU-UET))
Publication Information
Journal of Information Processing Systems / v.15, no.2, 2019 , pp. 399-409 More about this Journal
Abstract
Three-dimensional (3D) human pose reconstruction from single-view image is a difficult and challenging topic. Existing approaches mostly process frame-by-frame independently while inter-frames are highly correlated in a sequence. In contrast, we introduce a novel spatial-temporal 3D human pose reconstruction framework that leverages both intra and inter-frame relationships in consecutive 2D pose sequences. Orthogonal matching pursuit (OMP) algorithm, pre-trained pose-angle limits and temporal models have been implemented. Several quantitative comparisons between our proposed framework and recent works have been studied on CMU motion capture dataset and Vietnamese traditional dance sequences. Our framework outperforms others by 10% lower of Euclidean reconstruction error and more robust against Gaussian noise. Additionally, it is also important to mention that our reconstructed 3D pose sequences are more natural and smoother than others.
Keywords
3D Human Pose; Reconstruction; Spatial-Temporal Model;
Citations & Related Records
연도 인용수 순위
  • Reference
1 X. T. Nguyen, T. H. Le, and H. Yu, "Motion style extraction based on sparse coding decomposition," 2018 [Online]. Available: https://arxiv.org/abs/1811.06616.
2 C. Sminchisescu and B. Triggs, "Building roadmaps of local minima of visual models," in Computer Vision-ECCV 2002. Heidelberg: Springer, 2002, pp. 566-582.
3 V. Ramakrishna, T. Kanade, and Y. Sheikh, "Reconstructing 3D human pose from 2D image landmarks," in Computer Vision-ECCV 2002. Heidelberg: Springer, 2012, pp. 573-586.
4 I. Akhter and M. J. Black, "Pose-conditioned joint angle limits for 3D human pose reconstruction," in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Boston, Mam 2015, pp. 1446-1455.
5 C. Sminchisescu and B. Triggs, "Estimating articulated human motion with covariance scaled sampling," The International Journal of Robotics Research, vol. 22, no. 6, pp. 371-391, 2003.   DOI
6 H. K. Kien, N. K. Hung, M. T. Chau, N. T. Duyen, and N. X. Thanh, "Single view image based-3D human pose reconstruction," in Proceedings of 2017 9th International Conference on Knowledge and Systems Engineering (KSE), Hue, Vietnam, 2017, pp. 118-123.
7 J. Chen, S. Nie, and Q. Ji, "Data-free prior model for upper body pose estimation and tracking," IEEE Transactions on Image Processing, vol. 22, no. 12, pp. 4627-4639, 2013.   DOI
8 J. M. Rehg, D. D. Morris, and T. Kanade, "Ambiguities in visual tracking of articulated objects using two- and three-dimensional models," The International Journal of Robotics Research, vol. 22, no. 6, pp. 393-418, 2003.   DOI
9 L. Pishchulin, E. Insafutdinov, S. Tang, B. Andres, M. Andriluka, P. V. Gehler, and B. Schiele, "Deepcut: joint subset partition and labeling for multi person pose estimation," in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, NV, 2016, pp. 4929-4937.
10 F. Bogo, A. Kanazawa, C. Lassner, P. Gehler, J. Romero, and M. J. Black, "Keep it SMPL: automatic estimation of 3D human pose and shape from a single image," in Computer Vision-ECCV 2016. Heidelberg: Springer, 2016, pp. 561-578.
11 B. Tekin, I. Katircioglu, M. Salzmann, V. Lepetit, and P. Fua, "Structured prediction of 3D human pose with deep neural networks," 2016 [Online]. Available: https://arxiv.org/abs/1605.05180.
12 J. Redmon and A. Farhadi, "YOLO9000: better, faster, stronger," 2016 [Online]. Available: https://arxiv.org/abs/1612.08242.
13 CMU Graphics Lab Motion Capture Database [Online]. Available: http://mocap.cs.cmu.edu/.
14 F. L. Bookstein, Morphometric Tools for Landmark Data: Geometry and Biology. Cambridge, UK: Cambridge University Press, 1991.