Browse > Article
http://dx.doi.org/10.5909/JBE.2020.25.3.362

Fast Object Detection with DPM using Adaptive Bilinear Interpolated Image Pyramid  

Han, Gyu-Dong (Department of Electronics Engineering, Korea Polytechnic University)
Kim, Eung-Tae (Department of Electronics Engineering, Korea Polytechnic University)
Publication Information
Journal of Broadcast Engineering / v.25, no.3, 2020 , pp. 362-373 More about this Journal
Abstract
Recently, as autonomous vehicles and intelligent CCTV are growing more interest, the efficient object detection is essential technique. The DPM(Deformable Part Models) which is basis of this paper have used a typical object system that represents highly variable objects using mixtures of deformable part for object. Although it shows high detection performance by capturing part shape and configuration of object model, but it is limited to use in real application due to the complicated algorithm. In this paper, instead of image feature pyramid that takes up a large amount of computation in one part of the detector, we propose a method to reduce the computation speed by reconstructing a new image feature pyramid that uses adaptive bilinear interpolation of feature maps obtained on a specific image scale. As a result, the detection performance for object was lowered a little by 2.82%, however, the proposed detection method improved the speed performance by 10% in comparison with original DPM.
Keywords
object detection; deformable part models; image pyramid; bilinear interpolation;
Citations & Related Records
연도 인용수 순위
  • Reference
1 P. Viola and M. Jones, "Rapid object detection using a boosted cascade of simple features," in Proc. 2001 IEEE Comput. Soc. Conf. Comput. Vision and Pattern Recognition, vol. 1, pp. 511-518, Hawaii, U.S.A., Dec. 2001. http://dx.doi.org/10.1109/CVPR.2001.990517
2 Dalal, N and Triggs, B, "Histograms of oriented gradients for human detection". Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 886-893, 2005. http://dx.doi.org/10.1109/CVPR.2005.177
3 Qiang Zhu, Shi Avidan, Mei-Chen Yeh, Kwang-Ting Cheng, "Fast Human Detection Using a Cascade of Histograms of Oriented Gradients", Proceedings of the 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, vol.2, pp.1491-1498, 2006. http://dx.doi.org/10.1109/CVPR.2006.119
4 P. Dollar and Z. Tu, P. Perona, S. Belongie, "Integral channel features", British Machine Vision Conference, pp. 1-11, 2009. doi:10.5244/C.23.91
5 P. F. Felzenszwalb, R. B. Girshick, D. McAllester, D. Ramanan, "Object Detection with Discriminatively Trained Part Based Models", Pattern Analysis and Machine Intelligence, Vol.32, No.9, pp.1627-1645, 2010. https://doi.org/10.1109/TPAMI.2009.167   DOI
6 R. Benenson, M. Mathias, R. Timofte, and L. Van Gool, "Pedestrian detection at 100 frames per second", In IEEE Conference on Computer Vision and Pattern Recognition, pp. 2903-2910, 16-21 June, 2012. https://doi.org/10.1109/CVPR.2012.6248017
7 M. Everinghan, L. Van Gool, C. K. I. Williams, J. Winn, A. Zisserman, "The PASCAL Visual Object Classes(VOC) Challenge", International Journal of Computer Vision, Vol.88, pp.303-338, 2010. https://doi.org/10.1007/s11263-009-0275-4   DOI
8 P. Felzenszwalb and D. Huttenlocher. Distance transforms of sampled functions. Cornell Computing and Information Science Technical Report TR2004-1963, September 2004. https://hdl.handle.net/1813/5663
9 P. Felzenszwalb and D. Huttenlocher. Pictorial structures for object recognition. Internat. J. Computer Vision, Vol.61, No.1, pp.55-79, Jan, 2005. https://doi.org/10.1023/B:VISI.0000042934.15159.49   DOI
10 P. Burt and E. Adelson. "The Laplacian Pyramid as a Compact Image Code," IEEE Transaction on Communications, Vol.31, No.4, pp.532-540, April 1983 https://doi.org/10.1109/TCOM.1983.1095851   DOI