[KSCI] Korea Science Citation Index Service

http://dx.doi.org/10.5392/JKCA.2018.18.11.447

Depth Map Estimation Model Using 3D Feature Volume

Shin, Soo-Yeon (충북대학교)
Kim, Dong-Myung (충북대학교)
Suh, Jae-Won (충북대학교)

Publication Information

The Journal of the Korea Contents Association / v.18, no.11, 2018 , pp. 447-454 More about this Journal

Abstract

This paper proposes a depth image generation algorithm of stereo images using a deep learning model composed of a CNN (convolutional neural network). The proposed algorithm consists of a feature extraction unit which extracts the main features of each parallax image and a depth learning unit which learns the parallax information using extracted features. First, the feature extraction unit extracts a feature map for each parallax image through the Xception module and the ASPP(Atrous spatial pyramid pooling) module, which are composed of 2D CNN layers. Then, the feature map for each parallax is accumulated in 3D form according to the time difference and the depth image is estimated after passing through the depth learning unit for learning the depth estimation weight through 3D CNN. The proposed algorithm estimates the depth of object region more accurately than other algorithms.

Keywords

Depth Map; Convolutional Neural Network; Deep Learning; Stereo Image;

Citations & Related Records

Reference

1	ISO/IEC JTC1/SC29/WG11, "Appolication and Requirements on FTV," N9466, 2007.
2	C. Stentoumis, L. Grammatikopulos, I. Kalisperakis, and G. Karras, "On accurate dense stereo-matching using a local adaptive multi-cost approach," ISPRS J. of Photogrammetry and Remote Sensing, Vol. 91, pp. 29-49, 2014. DOI
3	A. Miron, S. Ainouz, A. Rogozan, and A. Bensrhair, "A robust cost function for stereo matching of road scenes," Pattern Recognition Letters, Vol. 38, pp. 70-77, 2014. DOI
4	C. Stentoumis, L. Grammatikopoulos, I. Kalisperakis, and G. Karras, "Stereo matching based on census transformation of image gradients," in Proceeding of the SPIE Optical Metrology, International Society for Optics and Photonics, 2015.
5	J. Zbontar and Y. Lecun, "Stereo matching by training a convolutional neural network to compare image patches," International J. of Machine Learning Research, Vol. 17, pp. 1-32, 2016.
6	A. Kendall, H. Martirosyan, S. Dasgupta, P. Henry, R. Kennedy, A. Bachrach, and A. Bry, "End-to-end learning of geometry and context for deep stereo regression," in Proceeding of the IEEE International Conference on Computer Vision(ICCV), pp. 1-8, 2017.
7	J. R. Chang and Y. S. Chen, "Pyramid Stereo Matching Network," in Proceeding of the IEEE International Conference on Computer Vision and Pattern Recognition(CVPR), pp. 5410-5418, 2018.
8	K. He, X. Zhang, S. Ren, and J. Sun, "Spatial pyramid pooling in deep convolutional networks for visual recognition," in European Conference on Computer Vision, pp. 346-361, 2014.
9	F. Chollet, "Xception : Deep Learning with Depthwise Separable Convolutions," in Proceeding of the IEEE International Conference on Computer Vision and Pattern Recognition(CVPR), 2017.
10	L. C. Chen, G. Papandreou, L. Kokkinos, K. Murphy, and A. L. Yuille, "DeepLab : Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs," in Proceeding of the IEEE International Conference on Computer Vision and Pattern Recognition(CVPR), 2016.
11	M. Menze and A. Geiger, "Object Scene Flow for Autonomous Vehicles," in Proceeding of the IEEE Conference on Computer Vision and Pattern Recognition(CVPR), 2015.
12	A. Newell, K. Yang, and J. Deng, "Stacked hourglass networks for human pose estimation," in European Conference on Computer Vision, pp. 483-499, 2016.
13	N. Mayer, E. Ilg, P. Hausser, P. Fischer, D. Cremers, A. Dosovitskiy, and T. Brox, "A large dataset to train convolutional networks for disparity, optical flow, and scene flow estimation," in Proceeding of the IEEE International Conference on Computer Vision and Pattern Recognition (CVPR), 2016.

KSCI

Depth Map Estimation Model Using 3D Feature Volume 3차원 특징볼륨을 이용한 깊이영상 생성 모델

Depth Map Estimation Model Using 3D Feature Volume