[KSCI] Korea Science Citation Index Service

http://dx.doi.org/10.3837/tiis.2015.03.016

Depth Map Coding Using Histogram-Based Segmentation and Depth Range Updating

Lin, Chunyu (Institute of Information Science, Beijing Jiaotong University Beijing Key Laboratory of Advanced Information Science and Network)
Zhao, Yao (Institute of Information Science, Beijing Jiaotong University Beijing Key Laboratory of Advanced Information Science and Network)
Xiao, Jimin (Department of Electrical and Electronic Engineering, Xian Jiaotong-Liverpool University)
Tillo, Tammam (Department of Electrical and Electronic Engineering, Xian Jiaotong-Liverpool University)

Publication Information

KSII Transactions on Internet and Information Systems (TIIS) / v.9, no.3, 2015 , pp. 1121-1139 More about this Journal

Abstract

In texture-plus-depth format, depth map compression is an important task. Different from normal texture images, depth maps have less texture information, while contain many homogeneous regions separated by sharp edges. This feature will be employed to form an efficient depth map coding scheme in this paper. Firstly, the histogram of the depth map will be analyzed to find an appropriate threshold that segments the depth map into the foreground and background regions, allowing the edge between these two kinds of regions to be obtained. Secondly, the two regions will be encoded through rate distortion optimization with a shape adaptive wavelet transform, while the edges are lossless encoded with JBIG2. Finally, a depth-updating algorithm based on the threshold and the depth range is applied to enhance the quality of the decoded depth maps. Experimental results demonstrate the effective performance on both the depth map quality and the synthesized view quality.

Keywords

Depth map coding; 3D coding; histogram-based segmentation;

Citations & Related Records

Reference

1	P. Merkle, A. Smolic, K. Muller, and T. Wiegand, "Multi-view video plus depth representation and coding," in Proc. of IEEE International Conference on Image Processing (ICIP), pp. 201-204, 2007.
2	C. Fehn, "Depth-image-based rendering (DIBR), compression, and transmission for a new approach on 3D-TV," in Proc. of SPIE 5291, pp. 93-104, 2004.
3	W. Hu, G. Cheung, X. Li, and O. Au, "Depth map compression using multi-resolution graph-based transform for depth-image-based rendering," in Proc. of IEEE International Conference on Image Processing (ICIP), pp. 1297-1300, 2012.
4	G. Shen, W.-S. Kim, A. Ortega, J. Lee, and H. Wey, "Edge-aware intra prediction for depth-map coding," in Proc. of IEEE International Conference on Image Processing (ICIP), pp. 3393-3396, 2010.
5	M. Maitre and M. N. Do, "Depth and depth-color coding using shape-adaptive wavelets," J. Vis. Comun. Image Represent., vol. 21, no. 5-6. pp. 513-522, 2010. DOI
6	Morvan Y., Farin, D. de With P.H.N., "Depth-image compression based on an R-D optimized quadtree decomposition for the transmission of multiview images," in Proc. of IEEE International Conference on Image Processing (ICIP), pp. 105-108, 2007.
7	Shinya Shimizu, Hideaki Kimata, Shiori Sugimoto and Norihiko Matsuura, "Block-adaptive palette-based prediction for depth map coding," in Proc. of IEEE International Conference on Image Processing (ICIP), pp. 117-120, 2011.
8	P. Merkle, C. Bartnik, K. Muller, D. Marpe, and T. Wiegand, "3D video: Depth coding based on inter-component prediction of block partitions," in Proc. of Picture Coding Symposium (PCS), pp. 149-152, 2012.
9	K. Muller, P. Merkle, G. Tech, and T. Wiegand, "3D video coding with depth modeling modes and view synthesis optimization," in Proc. of Asia-Pacific Signal Information Processing Association Annual Summit and Conference, pp. 1-4, 2012.
10	F. Jager, M. Wien, and P. Kosse, "Model-based intra coding for depth maps in 3D video using a depth lookup table," in Proc. of 3DTV-Conference: The True Vision - Capture, Transmission and Display of 3D Video (3DTV-CON), pp. 1-4, 2012.
11	W. Pearlman, A. Islam, N. Nagaraj, and A. Said, "Efficient, low-complexity image coding with a set-partitioning embedded block coder," IEEE Transactions on Circuits and Systems for Video Technology, vol. 14, no. 11, pp. 1219-1235, 2004. DOI
12	F. Ono, W. Rucklidge, R. Arps, and C. Constantinescu, "JBIG2-the ultimate bi-level image coding standard," in Proc. International Conference on Image Processing (ICIP), pp. 140-143, 2000.
13	R. Gonzalez and R. Woods, Digital image processing (2nd edition), Prentice-Hall Inc., 2002.
14	D. I. Cohen, A. and J.-C. Feauveau, "Biorthogonal bases of compactly supported wavelets," Communications on Pure and Applied Mathematics, vol. 45, no. 5, pp. 485-560, 1992. DOI
15	A. Said and W. Pearlman, "A new, fast, and efficient image codec based on set partitioning in hierarchical trees," IEEE Transactions on Circuits and Systems for Video Technology, vol. 6, no. 3, pp. 243-250, 1996. DOI
16	W. S. Kim, A. Ortega, P. Lai, D. Tian, and C. Gomila, "Depth map distortion analysis for view rendering and depth coding," in Proc. of IEEE International Conference on Image Processing (ICIP), pp. 721-724, 2009.
17	B. T. Oh, J. Lee, and D.-S. Park, "Depth map coding based on synthesized view distortion function," IEEE Journal of Selected Topics in Signal Processing, vol. 5, no. 7, pp. 1344-1352, 2011. DOI
18	M. Adams and F. Kossentini, "Jasper: a software-based JPEG-2000 codec implementation," in Proc. of International Conference on Image Processing (ICIP), pp. 53-56, 2000.
19	D. Scharstein and R. Szeliski, "A taxonomy and evaluation of dense two-frame stereo correspondence algorithms," Int. J. Comput. Vision, vol. 47, no. 1-3, pp. 7-42, 2002. DOI
20	C. L. Zitnick, S. B. Kang, M. Uyttendaele, S. Winder, and R. Szeliski, "High-quality video view interpolation using a layered representation," ACM Trans. Graph., vol. 23, no. 3, pp. 600-608, 2004. DOI
21	Oliver, J.; Malumbres, M.P., "Low-Complexity Multiresolution Image Compression Using Wavelet Lower Trees," IEEE Transactions on Circuits and Systems for Video Technology, vol. 16, no. 11, pp. 1437-1444, 2006. DOI
22	L. Zhang and W. J. Tam, "Stereoscopic image generation based on depth images for 3D TV," IEEE Transactions on Broadcasting, vol. 51, no. 2, pp. 191-199, 2005. DOI
23	I. JTC1/SC29/WG11, "View synthesis algorithm in view synthesis reference software 2.0 (VSRS2.0)," Doc. M16090, 2009.
24	G. Bjntegaard, "Improvements of the BD-PSNR model," ITU-T SG16 Q.6 Document, VCEG-AI11, 2008.
25	C. Lee and Y. S. Ho, "View synthesis tools for 3D video," ISO/IEC JTC1/SC29/WG11 MPEG2008/M15851, 2008.
26	S. Li and W. Li, "Shape-adaptive discrete wavelet transforms for arbitrarily shaped visual object coding," IEEE Transactions on Circuits and Systems for Video Technology, vol. 10, no. 5, pp. 725-743, 2000. DOI