Browse > Article
http://dx.doi.org/10.9717/kmms.2020.24.3.416

Activity Object Detection Based on Improved Faster R-CNN  

Zhang, Ning (Dept. of Information and Communication Engineering, Tongmyong University)
Feng, Yiran (Dept. of Information and Communication Engineering, Tongmyong University)
Lee, Eung-Joo (Dept. of Information and Communication Engineering, Tongmyong University)
Publication Information
Abstract
Due to the large differences in human activity within classes, the large similarity between classes, and the problems of visual angle and occlusion, it is difficult to extract features manually, and the detection rate of human behavior is low. In order to better solve these problems, an improved Faster R-CNN-based detection algorithm is proposed in this paper. It achieves multi-object recognition and localization through a second-order detection network, and replaces the original feature extraction module with Dense-Net, which can fuse multi-level feature information, increase network depth and avoid disappearance of network gradients. Meanwhile, the proposal merging strategy is improved with Soft-NMS, where an attenuation function is designed to replace the conventional NMS algorithm, thereby avoiding missed detection of adjacent or overlapping objects, and enhancing the network detection accuracy under multiple objects. During the experiment, the improved Faster R-CNN method in this article has 84.7% target detection result, which is improved compared to other methods, which proves that the target recognition method has significant advantages and potential.
Keywords
Human activity object; Faster R-CNN; Dense-Net; Soft-NMS;
Citations & Related Records
연도 인용수 순위
  • Reference
1 R. Girshick, J. Donahue, T. Darrell, and J. Malik, "Region-Based Convolutional Networks for Accurate Object Detection and Segmentation," IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol. 38, Issue 1, pp. 142-158, 2016.   DOI
2 L. Hang, Statistical learning method, Beijing, Tsinghua University Press, pp. 36-58, 2012.
3 Z. Wenda, X. Yuelei, and N. Jiacheng, "Image Target Recognition Method Based on Multi-Scale Block Convolutional Neural Network," Journal of Computer Applications, Vol. 36, No. 4, pp. 1033-1038, 2016.
4 R. Girshick, "Fast R-CNN," Proceedings of 2015 IEEE International Conference on Computer Vision. Santiago, Chile, pp. 1440-1448, 2015.
5 R. Shaoqing, H. Kaiming, and R. Girshick, "Faster R-CNN: Towards Real-time Object Detection with Region Proposal Networks," Proceedings of the 28th International Conference on Neural Information Processing Systems. Montreal, Canada, pp. 91-99, 2015.
6 H. Kaiming, Z. Xiangyu, and R. Shaoqing, "Spatial Pyramid Pooling in Deep Convolutional Networks for Visual Recognition," Proceedings of the 13th European Conference on Computer Vision. Zurich, Switzerland, pp. 346-361, 2014.
7 Q. Liu, X. Xiang, and J. Qin, "Coverless Image Steganography Based on DenseNet Feature Mapping," EURASIP Journal on Image and Video Processing, Article No. 39, 2020.
8 Y. Liu, X. Wu, and G. Xue, "Multi-target Real-time Detection for Road Traffic Signs Based on Deep Learning," Journal of Guangxi Normal University(Natural Science Edition), Vol. 38, No. 2, pp. 96-106, 2020.
9 X. Wu and X. Ling, "Facial expression recognition based on improved Faster RCNN," CAAI Transactions on Intelligent Systems, DOI:10.11992/tis.201910020.
10 B. Chen, T. Zhao, and J. Liu, "Multipath Feature Recalibration Densenet for Image Classification," International Conference of Machine Learning and Cybernetics, pp. 1-10, 2020.
11 Z. Li, Y. Lin, and A. Elofsson, "Protein Contact Map Prediction Based on ResNet and Dense Net," BioMed Research International, pp. 1-12, 2020.
12 B. Chen, W. Chen, and X. Wei, "Characterization of Elastic Parameters for Functionally Graded Material by a Meshfree Method Combined with the NMS Approach," Inverse Problems in Science and Engineering. Vol. 26, No. 4, pp. 601-617, 2018.   DOI
13 Y. Zhou, Y. Zhang, X. Xie, and S.Y. Kung, "Image Super-Resolution Based on Dense Convolutional Auto-Encoder Blocks," Neurocomputing. Vol. 6, pp. 98-109, 2020.
14 J. Xiao, J. Hays, K.A. Ehinger, A. Oliva, and A. Torralba, "SUN Database: Large-scale Scene Recognition from Abbey to Zoo," IEEE Conference on Computer Vision and Pattern Recognition, 2010, doi: 10.1109/CVPR.2010.55 39970.   DOI
15 J.H. Park and E.-J. Lee, "Human Activity Recognition Based on 3D Residual Dense Network," Journal of Korea Multimedia Society, Vol. 23, No. 12, pp. 1540-1551, 2020.   DOI
16 J. Deng, W. Dong, R. Socher, L.J. Li, K. Li, and L. Fei-Fei, "ImageNet: A Large-Scale Hierarchical Image Database," IEEE Conference on Computer Vision and Pattern Recognition, 2009, doi: 10.1109/CVPR.2009.5206848.   DOI
17 M. Everingham, L. Van Gool, C.K.I. Williams, J. Winn, and A. Zisserman, "The PASCAL Visual Object Classes (VOC) Challenge," International Journal of Computer Vision, Vol. 88, No. 2, pp. 303-338, 2010.   DOI