Browse > Article
http://dx.doi.org/10.9717/kmms.2016.19.12.1909

Multi-spectral Vehicle Detection based on Convolutional Neural Network  

Choi, Sungil (Dept. of Electrical and Electronic Engineering, Yonsei University)
Kim, Seungryong (Dept. of Electrical and Electronic Engineering, Yonsei University)
Park, Kihong (Dept. of Electrical and Electronic Engineering, Yonsei University)
Sohn, Kwanghoon (Dept. of Electrical and Electronic Engineering, Yonsei University)
Publication Information
Abstract
This paper presents a unified framework for joint Convolutional Neural Network (CNN) based vehicle detection by leveraging multi-spectral image pairs. With the observation that under challenging environments such as night vision and limited light source, vehicle detection in a single color image can be more tractable by using additional far-infrared (FIR) image, we design joint CNN architecture for both RGB and FIR image pairs. We assume that a score map from joint CNN applied to overall image can be considered as confidence of vehicle existence. To deal with various scale ratios of vehicle candidates, multi-scale images are first generated scaling an image according to possible scale ratio of vehicles. The vehicle candidates are then detected on local maximal on each score maps. The generation of overlapped candidates is prevented with non-maximal suppression on multi-scale score maps. The experimental results show that our framework have superior performance than conventional methods with a joint framework of multi-spectral image pairs reducing false positive generated by conventional vehicle detection framework using only single color image.
Keywords
Vehicle Detection; Convolutional Neural Network; Multi-spectral Imaging;
Citations & Related Records
Times Cited By KSCI : 2  (Citation Analysis)
연도 인용수 순위
1 S. Zehang, B. George, and M. Ronald, "Onroad Vehicle Detection Using Gabor Filters and Support Vector Machines," Proceeding of International Conference on Digital Signal Processing, pp. 1019-1022, 2002.
2 J. Seo and K. Sohn, "Superpixel-based Vehicle Detection Using Plane Normal Vector in Disparity Space," Journal of Korea Multimedia Society, Vol. 19, No. 6, pp. 1003-1013, 2016.   DOI
3 Y. Lee, T. Kim, and J. Shim, "Two-wheeler Detection System Using Histogram of Oriented Gradients Based on Local Correlation Coefficients and Curvature," Journal of Korea Multimedia Society, Vol. 2, No. 4, pp. 303-310, 2015.
4 F. Pedro, M. David, and R. Deva, "A Discriminatively Trained, Multiscale, Deformable Part Model," Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 1-8, 2008.
5 P. Dollar, R. Appel, and S. Belongie, "Fast Feature Pyramids for Object Detection," Journal of Pattern Analysis and Machine Intelligence, Vol. 36, No. 8, pp. 1532-1545, 2014.   DOI
6 J. Deng, W. Dong, R. Socher, L.J. Li, K. Li, and L. Fei-Fei, "ImageNet: Large-scale Hierarchical Image Database," Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 248-255, 2009.
7 G. Ross, D. Jeff, D. Trevor, and M. Jitendra, "Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation," Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 580-587, 2014.
8 X. Glorot and Y. Bengio, "Understanding the Difficulty of Training Deep Feedforward Neural Networks," Proceeding of International Conference on Artificial Intelligence and Statistics, pp. 249-256, 2010.
9 M. Tarek, A. Nabil, S.A. Domingo, A. Cristhian, and T. Ricardo, "Multispectral Stereo Odometry," IEEE Transactions on Intelligent Transportation Systems, Vol. 16, No. 3, pp. 1210-1224, 2015.   DOI
10 J. Donahue, Y. Jia, O. Vinyals, J. Hoffman, N. Zhang, T. Darrell, et al., "DeCAF: A Deep Convolutional Activation Feature for Generic Visual Recognition," Proceeding of the International Confidence on Machine Learning, pp. 647-655, 2014.
11 J. Long, E. Shelhamer, and T. Darrell, "Fully Convolutional Networks for Semantic Segmentation," Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 3431-3440, 2015.
12 C. Papageorgiou and T. Poggio, "A Trainable System for Object Detection," International Journal of Computer Vision, Vol. 38, No. 1, pp. 15-33, 2000.   DOI
13 N. Srivastava, G. Hinton, A. Krizhevsky, Alex, I. Sutskever, and R. Salakhutdinov, "Dropout: A Simple Way to Prevent Neural Networks from Overfitting," The Journal of Machine Learning Research, Vol. 15, No. 1, pp. 1929-1958, 2014.
14 A. Vedaldi and K. Lenc, "MatConvNet-Convolutional Neural Networks for MATLAB," Proceedings of the 23rd ACM International Confidence on Multimedia, pp. 689-692, 2015.
15 [Available] P. Dollar, Piotr's Computer Vision Matlab Toolbox, http://vision.ucsd.edu/˜pdollar/toolbox/doc/index.html.
16 S. Hwang, J. Park, N. Kim, Y. Choi, and S. Kweon, "Multispectral Pedestrian Detection: Benchmark Dataset and Baseline," Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 1037-1045, 2015.
17 National Highway Traffic Safety Administration, Traffic Safety Facts, Annals of Emergency Medicine, 2013.
18 D.W. James and V. Sharma, "Background-Subtraction in Thermal Imagery Using Contour Saliency," International Journal of Computer Vision, Vol. 71, No. 2, pp. 161-181, 2007.   DOI
19 L. Walchshäusl, R. Lindl, K. Vogel, and T. Tatschke, Advanced Microsystems for Automotive Applications, Springer Publishers, Berlin Heidelberg, 2006.
20 P. Sermanet, D. Eigen, X. Zhang, M. Mathieu, R. Fergus, and Y. LeCun, Overfeat: Integrated Recognition, Localization and Detection Using Convolutional Networks, arXiv preprint arXiv:1312.6229, 2013.
21 LSI Far Infrared Pedestrian Dataset, http://www.uc3m.es/islab/repository (accessed July, 2013).
22 C.J. Burges, "A Tutorial on Support Vector Machines for Pattern Recognition," Data Mining and Knowledge Discovery, Vol. 2, No. 2, pp. 121-167, 1988.   DOI
23 F. Yoav and S.E. Robert, "A Decision-theoretic Generalization of On-line Learning and an Application to Boosting," Journal of Computer and System Sciences, Vol. 55, No. 1, pp. 119-139, 1997.   DOI
24 D. Navneet and T. Bill, "Histograms of Oriented Gradients for Human Detection," Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 886-893, 2005.
25 L. Rainer and M. Jochen, "An Extended Set of Haar-like Features for Rapid Object Detection," Proceeding of International Conference on Image Processing, pp. 900-903, 2002.