Browse > Article
http://dx.doi.org/10.3837/tiis.2014.03.020

GPU-Accelerated Single Image Depth Estimation with Color-Filtered Aperture  

Hsu, Yueh-Teng (A-MTK Corp.)
Chen, Chun-Chieh (Department of Electronic Engineering, National Taipei University of Technology)
Tseng, Shu-Ming (Department of Electronic Engineering, National Taipei University of Technology)
Publication Information
KSII Transactions on Internet and Information Systems (TIIS) / v.8, no.3, 2014 , pp. 1058-1070 More about this Journal
Abstract
There are two major ways to implement depth estimation, multiple image depth estimation and single image depth estimation, respectively. The former has a high hardware cost because it uses multiple cameras but it has a simple software algorithm. Conversely, the latter has a low hardware cost but the software algorithm is complex. One of the recent trends in this field is to make a system compact, or even portable, and to simplify the optical elements to be attached to the conventional camera. In this paper, we present an implementation of depth estimation with a single image using a graphics processing unit (GPU) in a desktop PC, and achieve real-time application via our evolutional algorithm and parallel processing technique, employing a compute shader. The methods greatly accelerate the compute-intensive implementation of depth estimation with a single view image from 0.003 frames per second (fps) (implemented in MATLAB) to 53 fps, which is almost twice the real-time standard of 30 fps. In the previous literature, to the best of our knowledge, no paper discusses the optimization of depth estimation using a single image, and the frame rate of our final result is better than that of previous studies using multiple images, whose frame rate is about 20fps.
Keywords
Real time; graphics processing unit; parallel processing technique; compute shade;
Citations & Related Records
Times Cited By KSCI : 2  (Citation Analysis)
연도 인용수 순위
1 H. Hirschmueller, "Improvements in Realtime Correlation-based Stereo Vision," in Proc. of IEEE Workshop on Stereo and Multi- Baseline Vision, pp. 141-148, December, 2001.
2 S. Sengupta, A. E. Lefohn and J. D. Owens, "A Work-efficient Step-efficient Prefix Sum Algorithm," in Proc. of the Workshop on Edge Computing Using New Commodity Architecture, pp. 26-27, May, 2006.
3 M. J. McDonnell, "Box-filtering Techniques," Computer Graphics and Image Processing, vol. 17, pp. 65-70, 1981.   DOI   ScienceOn
4 D. W. Hillis and G.L.Steele, Jr, "Data Parallel Algorithms," Communications of the ACM, vol. 29, no. 12, pp. 1170-1183, December,1986.   DOI   ScienceOn
5 Richard Williams, "All in Good Timecode," Adobe Magazine, pp. 57-59, 1999.
6 Ze-Nian Li and Mark S.Drew, "Fundamentals of Multimedia," China Machine Press, p. 104, 2004.
7 J. P. Lewis, "Fast Template Matching," in Proc of Vision Interface, pp. 120-123, May, 1995.
8 I.T. Jolliffe, Principal Component Analysis, Springer-Verlag, New York, 1986.
9 J. Kim, V. Kolmogorov and R. Zabih, "Visual Correspondence Using Energy Minimization and Mutual Information, " in Proc. of International Conference on Computer Vision (ICCV) vol.2, pp. 1033-1040, 2003.
10 L. Wang, L. Miao, G. Minglun, R. Yang and D Nister,"High-quality Real-time Stereo Using Adaptive Cost Aggregation and Dynamic Programming," in Proc. of Third International Symposium on 3D Data Processing, Visualization and Transmission, pp. 798-805, 2006.
11 Rongchun Li, Yong Dou, Jie Zhou, Baofeng Li and Jinbo Xu, "From WiFi to WiMAX: Efficient GPU-based Parameterized Transceiver across Different OFDM Protocols," KSII Transactions on Internet and Information systems, vol. 7, no. 8, pp. 1911-1932, August, 2013.   DOI
12 J. Kim and S. Hyeon, "Implementation of an SDR System Using Graphics Processing Unit," IEEE Communications Magazine, vol. 48, no. 3, pp. 156-162, March, 2010.
13 K. Kim, S. Lee, D. Hong and J. C. Ryou, "GPU-Accelerated Password Cracking of PDF Files," KSII Transactions on Internet and Information Systems, vol. 5, no. 11, pp. 2235-2253, November, 2011.
14 J. Woetzel and R. Koch, "Real-time Multi-stereo Depth Estimation on GPU with Approximative Discontinuity Handling," in Proc. of 1st European Conference on Visual Media Production (CVMP), pp. 245-254, March, 2004.
15 R. Yang and M. Pollefeys, "Multiresolution Real-time Stereo on Commodity Graphics Hardware," in Proc. of Conference Society on Computer Vision and Pattern Recognition (CVPR ), vol. 1, pp. 211-217, June, 2003.
16 C. Zach, A. Klaus, B. Reitinger and K. Karner, "Optimized Stereo Reconstruction Using 3D Graphics Hardware," in Proc. of Workshop of Vision, Modeling and Visualization (VMV), pp. 119-126, August, 2003.
17 Y. Bando, B. Chen and T. Nishita, "Extracting Depth and Matte Using a Color-filtered Aperture," ACM Transactions on Graphics (TOG), vol. 27, no. 5, pp.1-9, December, 2008.
18 B. Liu, S. Gould and D. Koller, "Single Image Depth Estimation from Predicted Semantic Labels," in Proc. of IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 1253-1260, June, 2010.
19 S. H. Lai, C. W. Fu & S. Chang, "A Generalized Depth Estimation Algorithm with a Single Image," IEEE Transactions on Pattern Analysis ans Machine Intelligence (PAMI), vol.14, no. 4, pp. 405-411, April, 1992.   DOI   ScienceOn
20 R.Ng, M. Levoy, M. Br'e Dif, G. Duval, M. Horowitz and P. Hanrahan, "Light Field Photography with Hand-held Plenoptic Camera," Tech. Rep. CSTR 2005-02, Stanford Computer Science, April, 2005.
21 A. Veeraraghavan, R. Raskar, A. Agrawal, A. Mohan and J. Tumblin, "Dappled photography: mask enhanced cameras for heterodyned light fields and coded aperture refocusing," ACM Transactions on Graphics (TOG), vol. 26, no. 3, pp. 1-12, 2007.
22 V. Kolmogorov and R. Zabih, "Multi-camera Scene Reconstruction via Graph Cuts," in Proc. of Seventh European Conf. Computer Vision, vol 3, pp. 82-96, May, 2002.