Browse > Article
http://dx.doi.org/10.9708/jksci.2022.27.05.047

A Study on Recognition of Dangerous Behaviors using Privacy Protection Video in Single-person Household Environments  

Lim, ChaeHyun (Dept. of Software, Soongsil University)
Kim, Myung Ho (Dept. of Software, Soongsil University)
Abstract
Recently, with the development of deep learning technology, research on recognizing human behavior is in progress. In this paper, a study was conducted to recognize risky behaviors that may occur in a single-person household environment using deep learning technology. Due to the nature of single-person households, personal privacy protection is necessary. In this paper, we recognize human dangerous behavior in privacy protection video with Gaussian blur filters for privacy protection of individuals. The dangerous behavior recognition method uses the YOLOv5 model to detect and preprocess human object from video, and then uses it as an input value for the behavior recognition model to recognize dangerous behavior. The experiments used ResNet3D, I3D, and SlowFast models, and the experimental results show that the SlowFast model achieved the highest accuracy of 95.7% in privacy-protected video. Through this, it is possible to recognize human dangerous behavior in a single-person household environment while protecting individual privacy.
Keywords
Deep Learning; Privacy; Action Recognition; YOLOv5; Single-person household;
Citations & Related Records
Times Cited By KSCI : 1  (Citation Analysis)
연도 인용수 순위
1 WonJong Kim, "Single-Person Households and Crime", Korean Journal of Law and Economics, Vol. 17, No. 1, pp.137-160, 2020. DOI: 10.46758/kjle.2020.04.17.1.137   DOI
2 JangHo Kim, "Lonely death, safety net urgently needed", https://www.kongje.or.kr/news/articleView.html?idxno=880
3 JunHwi Park, "Research on for Strengthening the Efficacy ofCriminal Policy for Public Safety (II) : Improving Safety in Single-Person Household Concentration Areas", korean institute of criminology, pp.1-938, 2017.
4 H. S. Fang, S. Xie, Y. W. Tai and C. Lu, "Rmpe: Regional multi-person pose estimation," Proceedings of the IEEE International Conference on Computer Vision, pp.2334-2343, 2017. DOI: 10.1109/ICCV.2017.256   DOI
5 YoonKyu Kang, HeeYong Kang, DalSoo Weon, "Human Skeleton Keypoints based Fall Detection using GRU", Journal of the Korea Academia-Industrial, Vol. 22, No. 2, pp.127-133, 2021. DOI: 10.5762/KAIS.2021.22.2.127   DOI
6 M. Li, S. Chen, X. Chen, Y. Zhang, Y. Wang, Q. Tian, "Actional-structural graph convolutional networks for skeleton-based action recognition," In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp.3595-3603, 2019. DOI: 10.48550/arXiv.1904.12659   DOI
7 K. Cho, B. Van Merrienboer, C. Gulcehre, D. Bahdanau, F. Bougares, H. Schwenk, Y. Bengio, "Learning phrase representations using RNN encoder-decoder for statistical machine translation," arXiv preprint arXiv:1406.1078, 2014. DOI: 10.3115/v1/D14-1179   DOI
8 HyunJae Bae, GyuJin Jang, YoungHun Kim, JinPyung Kim, "LSTM(Long Short-Term Memory)-Based Abnormal Behavior Recognition Using AlphaPose", KIPS Trans. Softw. and Data Eng, Vol. 10, No. 5, pp.187-194, 2021. DOI: 10.3745/KTSDE.2021.10.5.187   DOI
9 J. Carreira, and A. Zisserman, "Quo vadis, action recognition? a new model and the kinetics dataset," In proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp.6299-6308, 2017. DOI: 10.1109/CVPR.2017.502   DOI
10 C. Feichtenhofer, H. Fan, J. Malik, K. He, "Slowfast networks for video recognition," In Proceedings of the IEEE/CVF international conference on computer vision, pp. 6202-6211, 2019. DOI: 10.1109/ICCV.2019.00630   DOI
11 G. Jocher, K. Nishimura, K. Mineeva, T. Vilarino, YOLOv5, https://github.com/ultralytics/yolov5.
12 Aihub, "Introduction of abnormal behavior CCTV video", https://www.aihub.or.kr/aidata/139.
13 A. Shahroudy, J. Liu, T. Ng and G. Wang, G, "Ntu rgb+ d: A large scale dataset for 3d human activity analysis," In Proceedings of the IEEE conference on computer vision and pattern recognition, pp.1010-1019, 2016. DOI: 10.1109/CVPR.2016.115   DOI
14 S. Yan, Y. Xiong, D. Lin, "Spatial temporal graph convolutional networks for skeleton-based action recognition," In Thirty-second AAAI conference on artificial intelligence, 2018. DOI: 10.1186/s13640-019-0476-x   DOI
15 J. Redmon, S. Divvala, R. Girshick and A. Farhadi, "You only look once: Unified, real-time object detection," Proceedings of the IEEE conference on computer vision and pattern recognition, pp779-788, 2016. DOI: 10.1109/CVPR.2016.91   DOI
16 Z. Cao, T. Simon, S. E. Wei, and Y. Sheikh, "Realtime multi-person 2d pose estimation using part affinity fields," Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp.7291-7299, 2017. DOI: 10.1109/CVPR.2017.143   DOI
17 F. Gers, J. Schmidhuber, F. Cummins, "Learning to forget: Continual prediction with LSTM," Neural computation, vol.12, no.10, pp.2451-2471, 2000. DOI: 10.1162/089976600300015015   DOI
18 R. Girshick, J. Donahue, T. Darrell and J. Malik. "Rich feature hierarchies for accurate object detection and semantic segmentation," Proceedings of the IEEE conference on computer vision and pattern recognition, pp.580-587, 2014. DOI: 10.1109/CVPR.2014.81   DOI