Browse > Article
http://dx.doi.org/10.5909/JBE.2018.23.3.361

DNN Based Multi-spectrum Pedestrian Detection Method Using Color and Thermal Image  

Lee, Yongwoo (School of Electronic and Electrical Engineering, Sungkyunkwan University)
Shin, Jitae (School of Electronic and Electrical Engineering, Sungkyunkwan University)
Publication Information
Journal of Broadcast Engineering / v.23, no.3, 2018 , pp. 361-368 More about this Journal
Abstract
As autonomous driving research is rapidly developing, pedestrian detection study is also successfully investigated. However, most of the study utilizes color image datasets and those are relatively easy to detect the pedestrian. In case of color images, the scene should be exposed by enough light in order to capture the pedestrian and it is not easy for the conventional methods to detect the pedestrian if it is the other case. Therefore, in this paper, we propose deep neural network (DNN)-based multi-spectrum pedestrian detection method using color and thermal images. Based on single-shot multibox detector (SSD), we propose fusion network structures which simultaneously employ color and thermal images. In the experiment, we used KAIST dataset. We showed that proposed SSD-H (SSD-Halfway fusion) technique shows 18.18% lower miss rate compared to the KAIST pedestrian detection baseline. In addition, the proposed method shows at least 2.1% lower miss rate compared to the conventional halfway fusion method.
Keywords
CNN; pedestrian detection; multi-spectrum; network fusion;
Citations & Related Records
연도 인용수 순위
  • Reference
1 P. Dollar, R. Appel, S. Belongie, and P. Perona, “Fast feature pyramids for object detection,” IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol. 36, No. 8, pp. 1532-1545, Jan. 2014.   DOI
2 N. Dalal and B. Triggs, "Histograms of oriented gradients for human detection," IEEE Conference on Computer Vision and Pattern Recognition, San Diego, USA, pp. 886-893, 2005.
3 S. Zhang, R. Benenson, M. Omran, J. Hosang, and B. Schiele, "How far are we from solving pedestrian detection?," IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, USA, pp.1259-1267, 2016.
4 J. Wagner, V. Fischer, M. Herman, and S. Behnke, "Multispectral pedestrian detection using deep fusion convolutional neural networks," European Symposium on Artificial Neural Networks, Bruges, Belgium, pp. 509-514, 2016.
5 R. Girshick, J. Donahue, T. Darrell, and J. Malik, "Rich feature hierarchies for accurate object detection and semantic segmentation," IEEE Conference on Computer Vision and Pattern Recognition, Columbus, USA, pp. 580-587, 2014.
6 R. Girshick, "Fast r-cnn," arXiv preprint arXive:1504.08083, 2015.
7 S. Ren, K. He, R. Girshick, and J. Sun, "Faster R-CNN: Towards real- time object detection with region proposal networks," Neural Information Processing Systems, Montreal, Canada, pp. 91-99, 2015.
8 K. He, G. Gkioxari, P. Dollar, R. Girshick, "Mask R-CNN," IEEE International Conference on Computer Vision, Venice, Italy, pp. 2980-2988, 2017.
9 J. Liu, S. Zhang, S. Wang, D. N. Metaxas, "Multispectral deep neural networks for pedestrian detection," arXiv preprint arXiv:1611.02644, 2016.
10 W. Liu, D. Anguelov, D. Erhan, C. Szegedy, S. Reed, C. -Y. Fu, and A. C. Berg, "SSD: Single Shot MultiBox Detector," European Conference on Computer Vision, Amsterdam, the Netherlands, pp. 21-37, 2016.
11 S. Hwang, J. Park, N. Kim, Y. Choi, and I. S. Kweon, "Multispectral pedestrian detection: benchmark dataset and baseline," IEEE Conference on Computer Vision and Pattern Recognition, Boston, USA, pp. 1037-1045, 2015.
12 K. Simonyan, A. Zisserman, "Very deep convolutional networks for large-scale image recognition," arXiv preprint arXiv:1409.1556, 2014.
13 M. Lin, Q. Chen, S. Yan, "Network in Network," arXiv preprint arXive:1312.4400, 2013.
14 P. Dollar, C. Wojek, B. Schiele, and P. Perona, "Pedestrian detection: A benchmark," IEEE Conference on Computer Vision and Pattern Recognition, Miami, USA, pp. 304-311, 2009.