Browse > Article
http://dx.doi.org/10.5392/JKCA.2018.18.11.447

Depth Map Estimation Model Using 3D Feature Volume  

Shin, Soo-Yeon (충북대학교)
Kim, Dong-Myung (충북대학교)
Suh, Jae-Won (충북대학교)
Publication Information
Abstract
This paper proposes a depth image generation algorithm of stereo images using a deep learning model composed of a CNN (convolutional neural network). The proposed algorithm consists of a feature extraction unit which extracts the main features of each parallax image and a depth learning unit which learns the parallax information using extracted features. First, the feature extraction unit extracts a feature map for each parallax image through the Xception module and the ASPP(Atrous spatial pyramid pooling) module, which are composed of 2D CNN layers. Then, the feature map for each parallax is accumulated in 3D form according to the time difference and the depth image is estimated after passing through the depth learning unit for learning the depth estimation weight through 3D CNN. The proposed algorithm estimates the depth of object region more accurately than other algorithms.
Keywords
Depth Map; Convolutional Neural Network; Deep Learning; Stereo Image;
Citations & Related Records
연도 인용수 순위
  • Reference
1 ISO/IEC JTC1/SC29/WG11, "Appolication and Requirements on FTV," N9466, 2007.
2 C. Stentoumis, L. Grammatikopulos, I. Kalisperakis, and G. Karras, "On accurate dense stereo-matching using a local adaptive multi-cost approach," ISPRS J. of Photogrammetry and Remote Sensing, Vol. 91, pp. 29-49, 2014.   DOI
3 A. Miron, S. Ainouz, A. Rogozan, and A. Bensrhair, "A robust cost function for stereo matching of road scenes," Pattern Recognition Letters, Vol. 38, pp. 70-77, 2014.   DOI
4 C. Stentoumis, L. Grammatikopoulos, I. Kalisperakis, and G. Karras, "Stereo matching based on census transformation of image gradients," in Proceeding of the SPIE Optical Metrology, International Society for Optics and Photonics, 2015.
5 J. Zbontar and Y. Lecun, "Stereo matching by training a convolutional neural network to compare image patches," International J. of Machine Learning Research, Vol. 17, pp. 1-32, 2016.
6 A. Kendall, H. Martirosyan, S. Dasgupta, P. Henry, R. Kennedy, A. Bachrach, and A. Bry, "End-to-end learning of geometry and context for deep stereo regression," in Proceeding of the IEEE International Conference on Computer Vision(ICCV), pp. 1-8, 2017.
7 J. R. Chang and Y. S. Chen, "Pyramid Stereo Matching Network," in Proceeding of the IEEE International Conference on Computer Vision and Pattern Recognition(CVPR), pp. 5410-5418, 2018.
8 K. He, X. Zhang, S. Ren, and J. Sun, "Spatial pyramid pooling in deep convolutional networks for visual recognition," in European Conference on Computer Vision, pp. 346-361, 2014.
9 F. Chollet, "Xception : Deep Learning with Depthwise Separable Convolutions," in Proceeding of the IEEE International Conference on Computer Vision and Pattern Recognition(CVPR), 2017.
10 L. C. Chen, G. Papandreou, L. Kokkinos, K. Murphy, and A. L. Yuille, "DeepLab : Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs," in Proceeding of the IEEE International Conference on Computer Vision and Pattern Recognition(CVPR), 2016.
11 M. Menze and A. Geiger, "Object Scene Flow for Autonomous Vehicles," in Proceeding of the IEEE Conference on Computer Vision and Pattern Recognition(CVPR), 2015.
12 A. Newell, K. Yang, and J. Deng, "Stacked hourglass networks for human pose estimation," in European Conference on Computer Vision, pp. 483-499, 2016.
13 N. Mayer, E. Ilg, P. Hausser, P. Fischer, D. Cremers, A. Dosovitskiy, and T. Brox, "A large dataset to train convolutional networks for disparity, optical flow, and scene flow estimation," in Proceeding of the IEEE International Conference on Computer Vision and Pattern Recognition (CVPR), 2016.