Browse > Article
http://dx.doi.org/10.9717/kmms.2022.25.4.575

Motion Recognition of Workers using Skeleton and LSTM  

Jeon, Wang Su (Dept. of Computer Engineering., Kyungnam University)
Rhee, Sang Yong (Dept. of Computer Engineering., Kyungnam University)
Publication Information
Abstract
In the manufacturing environment, research to minimize robot collisions with human beings have been widespread, but in order to interact with robots, it is important to precisely recognize and predict human actions. In this research, after enhancing performance by applying group normalization to the Hourglass model to detect the operator motion, the skeleton was estimated and data were created using this model. And then, three types of operator's movements were recognized using LSTM. As results of the experiment, the accuracy was enhanced by 1% using group normalization, and the recognition accuracy was 99.6%.
Keywords
Pose Estimation; Pose Classification; Hourglass; LSTM; CNN; Group Normalization;
Citations & Related Records
Times Cited By KSCI : 1  (Citation Analysis)
연도 인용수 순위
1 N, Alejandro, K. Yang, and J. Deng, "Stacked Hourglass Networks for Human Pose Estimation," European Conference on Computer Vision, pp. 483-499, 2016.
2 3D Human Pose Estimation Experiments and Analysis(2020), https://www.kdnuggets.com/2020/08/3d-human-pose-estimation-experiments-analysis.html (accessed March 14, 2022).
3 P.F. Felzenszwalb and D.P. Huttenlocher, "Pictorial Structures for Object Recognition," International Journal of Computer Vision, Vol. 61, pp. 55-79, 2005.   DOI
4 W. Umer, H. Li, G.P.Y. Szeto, and A.Y.L. Wong, "Identification of Biomechanical Risk Factors for The Development of Lower-Back Disorders During Manual Rebar Tying," Journal of Construction Engineering and Management, Vol. 143, No. 1, 2017.
5 Y. Wang and G. Mori, "Multiple Tree Models for Occlusion and Spatial Constraints in Human Pose Estimation," European Conference on Computer Vision, pp. 710-724, 2008.
6 Z. Cao, T. Simon, S.E. Wei, and Y. Sheikh, "Real Time Multi-Person 2D Pose Estimation using Part Affinity Fields," IEEE Conference on Computer Vision and Pattern Recognition, pp. 7291-7299, 2017.
7 B. Tekin, S.N. Sudipta, and P. Fua, "Real-Time Seamless Single Shot 6D Object Pose Prediction," IEEE Conference on Computer Vision and Pattern Recognition, pp. 292-301, 2018.
8 S. Peng, Y. Liu, Q. Huang, X. Zhou, and H. Bao, "PVNet: Pixel-Wise Voting Network for 6-DOF Pose Estimation," IEEE Conference on Computer Vision and Pattern Recognition, pp. 4561-4570, 2019.
9 C. Jiayu, Q. Jun, and A. Changbum, "Construction Worker's Awkward Posture Recognition through Supervised Motion Tensor Decomposition," Automation in Construction, Vol. 77, pp. 67-81, 2017.   DOI
10 K. Osmo, K. Pekka, and K. likka, "Correcting Working Postures in Industry a Practical Method for Analysis," Applied Ergonomics, Vol. 8, No. 4, pp. 199-201, 1977.   DOI
11 K. Adhikari, H. Bouchachia, and H.N. Charif, "Deep Learning Based Fall Detection Using Silplified Human Posture," International Journal of Computer and Systems Engineering, Vol. 13, No. 5, 2019.
12 M. Andriluka, S. Roth, and B. Schiele, "Monocular 3D Pose Estimation and Tracking by Detection," IEEE Conference on Computer Vision and Pattern Recognition, pp. 623-630, 2010.
13 D. Ramanan, D.A. Forsyth, and A. Zisserman, "Strike a Pose: Tracking People by Finding Stylized Poses," IEEE Conference on Computer Vision and Pattern Recognition, pp. 271-278, 2005.
14 L. Sigal and M.J. Black, "Measure Locally, Reason Globally: Occlusion-Sensitive Articulated Pose Estimation," IEEE Conference on Computer Vision and Pattern Recognition, pp. 2041-2048, 2006.
15 X. Lan and D. P. Huttenlocher, "Beyond Trees : Common-Factor Models for 2D Human Pose Recovery," International Conference on Computer Vision, pp. 470-477, 2005.
16 K. He, G. Gkioxari, P. Dollar, and R. Girshick, "Mask R-CNN," International Conference on Computer Vision, pp. 2961-2669, 2017.
17 L. Karlinsky and S. Ullman, "Using Linking Features in Learning Non-Parametric Part Models," European Conference on Computer Vision, pp. 326-339, 2012.
18 J.J. Tompson, A. Jain, Y. LeCun, and C. Bregler, "Joint Training of a Convolutional Network and a Graphical Model for Human Pose Estimation," Advance in Neural Information Processing Systems, pp. 1799-1807, 2014.
19 T. Pfister, J. Charles, and A. Zisserman, "Flowing Convnets for Human Pose Estimation in Videos," International Conference on Computer Vision, pp. 1913-1921, 2015.
20 S.E. Wei, V. Ramakrishna, T. Kanade, and Y. Sheikh, "Convolutional Pose Machines," IEEE Conference on Computer Vision and Pattern Recognition, pp. 4724-4732, 2016.
21 J.Y. Kim and J.I. Park, "HOG-HOD Algorithm for Recognition of Multi-cultural Hand Gestures," Journal of Korea Multimedia Society, Vol. 20. No. 8, pp. 1187-1199, 2017.   DOI
22 M. Andriluka, L. Pishchulin, P. Gehler, and B. Schiele, "2D Human Pose Estimation: New Benchmark and State of the art Analysis," IEEE Conference on computer Vision and Pattern Recognition, pp. 3686-36893, 2014.
23 K.J. Cheoi and J.H. Han, "A Novel Door Security System using Hand Gesture Recognition," Journal of Korea Multimedia Society, Vol. 19, No. 8, pp. 1320-1328, 2016.   DOI
24 M. Sandler, A. Howard, M. Zhu, W. Yuxin, and K. He. "Group Normalization," European Conference on Computer Vision, pp. 3-19, 2018.
25 S. Johnson and M. Everingham, "Clustered Pose and Nonlinear Appearance Models for Human Pose Estimation," British Machine Vision Conference, Vol. 2, No. 4, pp. 1-11, 2010.
26 M. Sandler, A. Howard, M. Zhu, A. Zhmoginov, and L.C. Chen. "Mobilenet v2: Inverted Residuals and Linear Bottlenecks," IEEE Conference on Computer Vision and P attern Recognition, pp. 4510-4520, 2018.
27 S. Hochreiter and J. Schmidhuber, "Long Short-Term Memory," Neural Computation, No. 9. Vol. 8, pp. 1735-1780, 1997.   DOI