Refinement Module 기반 Three-Scale 보행자 검출 기법

A Three-scale Pedestrian Detection Method based on Refinement Module

  • 투고 : 2023.04.06
  • 심사 : 2023.07.16
  • 발행 : 2023.10.31


Pedestrian detection is used to effectively detect pedestrians in various situations based on deep learning. Pedestrian detection has difficulty detecting pedestrians due to problems such as camera performance, pedestrian description, height, and occlusion. Even in the same pedestrian, performance in detecting them can differ according to the height of the pedestrian. The height of general pedestrians encompasses various scales, such as those of infants, adolescents, and adults, so when the model is applied to one group, the extraction of data becomes inaccurate. Therefore, this study proposed a pedestrian detection method that fine-tunes the pedestrian area by Refining Layer and Feature Concatenation to consider various heights of pedestrians. Through this, the score and location value for the pedestrian area were finely adjusted. Experiments on four types of test data demonstrate that the proposed model achieves 2-5% higher average precision (AP) compared to Faster R-CNN and DRPN.



본 논문은 문화체육관광부 및 한국콘텐츠진흥원의 2022년도 문화체육관광 연구개발사업으로 수행되었음 (과제명: 인지·신체 복합중재 재활운동 증강 디바이스 기술 개발, 과제번호: SR202106002, 기여울: 100%).


  1. K. He, X. Zhang, S. Ren, J. Sun, "Deep Residual Learning for Image Recognition," Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. pp. 770-778, 2016.
  2. Z. Zou, K. Chen, Z. Shi, Y. Guo, J. Ye, "Object Detection in 20 years: A Survey," arXiv preprint arXiv:1905.05055, 2019.
  3. J. Li, X. Liang, S. M. Shen, T. Xu, J. Feng, S. Yan, "Scale-aware Fast R-CNN for Pedestrian Detection," IEEE Transactions on Multimedia, Vol. 20. No. 4, pp. 985-996, 2017.
  4. S. H. Song, H. B. Hyeon, H. Lee, "A Pedestrian Detection Method using Deep Neural Network," Journal of KIISE, Vol. 44, No. 1, pp. 44-50, 2017.
  5. M. C. Roh, J. Lee, "Refining Faster-RCNN for Accurate Object Detection," 2017 Fifteenth IAPR International Conference on Machine Vision Applications (MVA). IEEE, pp. 514-517, 2017.
  6. O. Ronneberger, P. Fischer, T. Brox, "U-net: Convolutional Networks for Biomedical Image Segmentation," Medical Image Computing and Computer-Assisted Intervention-MICCAI 2015: 18th International Conference, Proceedings, Part III 18, pp. 234-241, Springer International Publishing, 2015.
  7. K. He, X. Zhang, S. Ren, J. Sun, "Deep Residual Learning for Image Recognition," Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 770-778, 2016.
  8. S. Ren, K. He, R. Girshick, J. Sun. "Faster R-cnn: Towards Real-time Object Detection with Region Proposal Networks," Advances in Neural Information Processing Systems 28, pp. 91-99, 2015.
  9. P. Dollar, C. Wojek, B. Schiele, P. Perona, "Pedestrian Detection: A Benchmark," 2009 IEEE Conference on Computer Vision and Pattern Recognition. IEEE, pp. 304-311, 2009.
  10. R. Girshick. "Fast r-cnn," arXiv Preprint, 2015.
  11. R. Girshick, J. Donahue, T. Darrell, "Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation," Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 580-587, 2014.
  12. K. Simonyan, A. Zisserman, "Very Deep Convolutional Networks for Large-Scale Image Recognition," arXiv preprint 1409.1556. 2014.
  13. M. Everingham, L. Van Gool, C. K. I. Williams, J. Winn, A. Zisserman, "The Pascal Visual Object Classes (voc) Challenge," International Journal of Computer Vision 88.2, pp. 303-338, 2010.
  14. N. Dalal, B. Triggs, "Histograms of Oriented Gradients for Human Detection," 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05), Vol. 1, pp. 886-893, 2005.
  15. A. Ess, B. Leibe, L. Van Gool, "Depth and Appearance for Mobile Scene Analysis," 2007 IEEE 11th International Conference on Computer Vision. IEEE, pp. 1-8, 2007.
  16. A. Ess, B. Leibe, K. Schindler, L. Van Gool, "A Mobile Vision System for Robust Multi-person Tracking," 2008 IEEE Conference on Computer Vision and Pattern Recognition. IEEE, pp. 1-8, 2008.
  17. A. Geiger, P. Lenz, R. Urtasun, "Are we Ready for Autonomous Driving? the Kitti Vision Benchmark Suite," 2012 IEEE Conference on Computer Vision and Pattern Recognition. IEEE, pp. 3354-3361, 2012.
  18. S. Ruder, "An Overview of Gradient Descent Optimization Algorithms," arXiv preprint arXiv:1609.04747 (2016).