DOI QR코드

DOI QR Code

A New Object Region Detection and Classification Method using Multiple Sensors on the Driving Environment

다중 센서를 사용한 주행 환경에서의 객체 검출 및 분류 방법

  • Kim, Jung-Un (Dept. of Media Eng., The Catholic University of Korea) ;
  • Kang, Hang-Bong (Dept. of Media Eng., The Catholic University of Korea)
  • Received : 2017.07.17
  • Accepted : 2017.07.31
  • Published : 2017.08.31

Abstract

It is essential to collect and analyze target information around the vehicle for autonomous driving of the vehicle. Based on the analysis, environmental information such as location and direction should be analyzed in real time to control the vehicle. In particular, obstruction or cutting of objects in the image must be handled to provide accurate information about the vehicle environment and to facilitate safe operation. In this paper, we propose a method to simultaneously generate 2D and 3D bounding box proposals using LiDAR Edge generated by filtering LiDAR sensor information. We classify the classes of each proposal by connecting them with Region-based Fully-Covolutional Networks (R-FCN), which is an object classifier based on Deep Learning, which uses two-dimensional images as inputs. Each 3D box is rearranged by using the class label and the subcategory information of each class to finally complete the 3D bounding box corresponding to the object. Because 3D bounding boxes are created in 3D space, object information such as space coordinates and object size can be obtained at once, and 2D bounding boxes associated with 3D boxes do not have problems such as occlusion.

Keywords

References

  1. A. Krizhevsky, I. Sutskever, and G.E. Hinton, "Imagenet Classification with Deep Convolutional Neural Networks," Advances in Neural Information Processing Systems, pp. 1097-1105, 2012. (NIPS)
  2. R. Girshick, J. Donahue, T. Darrell, and J. Malik, "Rich Feature Hierarchies for Accurate Object Setection Detection and Semantic Segmentation," Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 580-587, 2014.
  3. S. Ren, K. He, R. Girshick, and J. Sun, "Faster R-CNN: Towards Real-time Object Detection with Region Proposal Networks," Advances in Neural Information Processing Systems, pp. 91-99, 2015.
  4. T. Kong, A. Yao, Y. Chen, and F. Sun, "Hypernet: Towards Accurate Region Proposal Generation and Joint Object Detection," Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 845-853, 2016.
  5. J. Dai, Y. Li, K. He, and J. Sun, "R-FCN: Object Detection Via Region-based Fully Convolutional Networks," Advances in Neural Information Processing Systems, pp. 379-387, 2016.
  6. J. Redmon, S. Divvala, R. Girshick, and A. Farhadi, "You Only Look Once: Unified, Real-time Object Detection," Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 779-788, 2016.
  7. W. Liu, D. Anguelov, D. Erhan, C. Szegedy, S. Reed, and C.Y. Fu, et al., "SSD: Single Shot Multibox Detector," Proceeding of European Conference on Computer Vision, pp. 21-37, 2016.
  8. X. Chen, K. Kundu, Y. Zhu, A.G. Berneshawi, H. Ma, and S. Fidler, et al., "3D Object Proposals for Accurate Object Class Detection," Advances in Neural Information Processing Systems, pp. 424-432, 2015.
  9. G.R. Cross and A.K. Jain, "Markov Random Field Texture Models," IEEE Transactions on Pattern Analysis and Machine Intelligence, No. 1, pp. 25-39, 1983
  10. C. Cortes and V. Vapnik, “Support-vector Networks,” Machine Learning, Vol. 20, No. 3, pp. 273-297, 1995. https://doi.org/10.1007/BF00994018
  11. D.Z. Wang and I. Posner, "Voting for Voting in Online Point Cloud Object Detection," Proceeding of Robotics: Science and Systems, 2015.
  12. Y. Xiang, W. Choi, Y. Lin, and S. Savarese, "Data-driven 3D Voxel Patterns for Object Category Recognition," Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 1903-1911, 2015.
  13. Y. Xiang, W. Choi, Y. Lin, and S. Savarese, "Subcategory-aware Convolutional Neural Networks for Object Proposals and Detection," Proceeding of IEEE Winter Conference on Applications of Computer Vision, pp. 924-933, 2016.
  14. T. Huang, G. Yang, and G. Tang, “A Fast Two-dimensional Median Filtering Algorithm,” IEEE Transactions on Acoustics, Speech, and Signal Processing, Vol. 27, No. 1, pp. 13-18, 1979. https://doi.org/10.1109/TASSP.1979.1163188
  15. C.L. Zitnick and P. Dollar, "Edge Boxes: Locating Object Proposals from Edges," Proceeding of European Conference on Computer Vision, pp. 391-405, 2014.
  16. K. He, X. Zhang, S. Ren, and J. Sun, "Deep Residual Learning for Image Recognition," Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 770-778, 2016.
  17. Y. Jia, E. Shelhamer, J. Donahue, S. Karayev, J. Long, and R. Girshick, et al., "Caffe: Convolutional Architecture for Fast Feature Embedding," Proceeding of the 22nd Association for Computing Machinery International Conference on Multimedia, pp. 675-678, 2014.
  18. M. Everingham, L. Van Gool, C.K.I. Williams, J. Winn, and A. Zisserman, "The Pascal Visual Object Classes (voc) Challenge," International Journal of Computer Vision, Vol. 88, No. 2, pp. 303-338, 2010. https://doi.org/10.1007/s11263-009-0275-4
  19. A. Geiger, P. Lenz, and R. Urtasun, "Are We Ready for Autonomous Driving-the Kitti Vision Benchmark Suite," Proceeding of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 3354-3361, 2012.
  20. J.R.R. Uijlings, K.E.A. Van De Sande, T. Gevers, and A.W.M. Smeulders, “Selective Search for Object Recognition,” International J ournal of Computer Vision, Vol. 104, No. 2, pp. 154-171, 2013. https://doi.org/10.1007/s11263-013-0620-5
  21. X. Wang, M. Yang, S. Zhu, and Y. Lin, "Regionlets for Generic Object Detection," Proceedings of the IEEE International Conference on Computer Vision, pp. 17-24, 2013.
  22. E. Ohn-Bar and M.M. Trivedi, “Learning to Detect Vehicles by Clustering Appearance Patterns,” IEEE Transactions on Intelligent Transportation Systems, Vol. 16, No. 5, pp. 2511-2521, 2015. https://doi.org/10.1109/TITS.2015.2409889
  23. F. Yang, W. Choi, and Y. Lin, "Exploit All the Layers: Fast and Accurate CNN Object Detector with Scale Dependent Pooling and Cascaded Rejection Classifiers," Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 2129-2137, 2016.
  24. D. Kim and G. Hyun, "Object Analysis on Outdoor Environment Using Multiple Features for Autonomous Navigation Robot". Journal of Korea Multimedia Society, Vol. 13, No. 5, pp. 651-662. 2010.

Cited by

  1. 자율주행 환경에서 이미지 객체 분할을 위한 강화된 DFCN 알고리즘 성능연구 vol.9, pp.4, 2017, https://doi.org/10.30693/smj.2020.9.4.9