Browse > Article
http://dx.doi.org/10.9723/jksiis.2022.27.2.035

Intra Prediction Method by Quadric Surface Modeling for Depth Video  

Lee, Dong-seok (동의대학교 인공지능그랜드ICT연구센터)
Kwon, Soon-kak (동의대학교 컴퓨터소프트웨어공학과)
Publication Information
Journal of Korea Society of Industrial Information Systems / v.27, no.2, 2022 , pp. 35-44 More about this Journal
Abstract
In this paper, we propose an intra-picture prediction method by a quadratic surface modeling method for depth video coding. The pixels of depth video are transformed to 3D coordinates using distance information. A quadratic surface with the smallest error is found by least square method for reference pixels. The reference pixel can be either the upper pixels or the left pixels. In the intra prediction using the quadratic surface, two predcition values are computed for one pixel. Two errors are computed as the square sums of differences between each prediction values and the pixel values of the reference pixels. The pixel sof the block are predicted by the reference pixels and prediction method that they have the lowest error. Comparing with the-state-of-art video coding method, simulation results show that the distortion and the bit rate are improved by up to 5.16% and 5.12%, respectively.
Keywords
Depth Video coding; Intra Prediction; Quadric Surface Modeling;
Citations & Related Records
연도 인용수 순위
  • Reference
1 Lee, D. S., Kim, B. K., and Kwon, S. K. (2021). Efficient depth data coding method based on plane modeling for intra prediction, IEEE Access, 9, pp. 29153-29164.   DOI
2 Stankiewicz, O., Wegner, K., and Domanski, M. (2013). Nonlinear depth representation for 3D video coding, IEEE International Conference on Image Processing, Sep. 15-18, Melbourne, Australia, pp. 1752-1756.
3 Aguilar, W. G., Rodriguez, G. A., Alvarez, L., Sandoval, S., Quisaguano, F., and Limaico, A. (2017). Visual SLAM with a RGB-D camera on a quadrotor UAV using on-board processing, International Work-Conference on Artificial Neural Networks, Jun. 14-16, Cadiz, Spain, pp. 596-606.
4 Fu, J., Miao, D., Yu, W., Wang, S., Lu, Y., and Li, S. (2013). Kinect-like depth data compression, IEEE Transactions on Multimedia, 15(6), pp. 1340-1352.   DOI
5 Kwon, S. K., Tamhankar, A., and Rao, K. R. (2006). Overview of H.264/MPEG-4 part 10, Journal of Visual Communication and Image Representation, 17(2), pp. 186-216.   DOI
6 Wang, X., Sekercioglu, Y., Drummond, T., Natalizio, E., Fantoni, I., and Fremont, V. (2016). Fast depth video compression for mobile RGB-D sensors, IEEE Transactions on Circuits and Systems for Video Technology, 26(4), pp. 673-686.   DOI
7 Zhao, Y., Carraro, M., Munaro, M., and Menegatti, E. (2017). Robust Multiple Object Tracking in RGB-D Camera Networks, IEEE/RSJ International Conference on Intelligent Robots and Systems, Sep. 24-28, Vancouver, Canada, pp. 6625-6632.
8 Wang, X., Sekercioglu, Y., Drummond, T., Fremont, V., Natalizio, E., and Fantoni, I. (2018). Relative pose based redundancy removal: collaborative RGB-D data transmission in mobile visual sensor networks, Sensors, 18(8), pp. 1-23.   DOI
9 Sun, Y., Liu, M., and Meng, M. Q. H. (2017). Improving RGB-D SLAM in dynamic environments: A motion removal approach, Robotics and Autonomous Systems, 89, pp. 110-122.   DOI
10 Bross, B., Wang, Y. K., Ye, Y., Liu, S., Chen, J., Sullivan, G. J., and Ohm, J. R. (2021). Overview of the versatile video coding (VVC) standard and its applications, IEEE Transactions on Circuits and Systems for Video Technology, 31(10), pp. 3736-3764.   DOI
11 Shen, G., Kim, W. S., Ortega, A., Lee, J., and Wey, H. (2010). Edge-aware intra prediction for depth-map coding. IEEE International Conference on Image Processing, Sep. 26-29, Hong Kong, pp. 3393-3396.
12 Lee, D. S. and Kwon, S. K. (2019). Prediction method for depth picture through spherical modeling mode, Journal of Korea Multimedia Society, 22(12), pp. 1368-1375.   DOI
13 Liu, S., Lai, P., Tian, D., and Chen, C. W. (2011). New depth coding techniques with utilization of corresponding video, IEEE Transactions on Broadcasting, 57(2), pp. 551-561.   DOI
14 Nenci, F., Spinello, L., and Stachniss, C. (2014). Effective compression of range data streams for remote robot operations using H.264, IEEE/ RSJ International Conference on Intelligent Robots and Systems, Sep. 14-18, Chicago, USA, pp. 3794-3799.
15 Silberman, N., Hoiem, D., Kohli, P,, and Fergus, R. (2012), Indoor segmentation and support inference from rgbd images, European Conference on Computer Vision, Oct. 7-13 Firenze, Italy, pp. 746-760.
16 Ren, C. Y., Prisacariu, V. A., Kahler, O., Reid, I. D., and Murray, D. W. (2017). "Real-time Tracking of Single and Multiple Objects from Depth-colour Imagery Using 3D Signed Distance Functions, International Journal of Computer Vision, 124(1), pp. 80-95.   DOI
17 Hartley, R. and Zisserman, A. (2000). Multiple View Geometry in Computer Vision, New York, USA, Cambridge University Press.
18 Lai, K., Bo, L., Ren, X., and Fox, D. (2011). A large-scale hierarchical multi-view RGB-D object dataset, IEEE International Conference on Robotics and Automation, May 9-13, Shanghai, China, pp. 1817-1824.
19 Sullivan, G. J., Ohm, J., Han, W. J., and Wiegand, T. (2012). Overview of the high efficiency video coding (HEVC) standard, IEEE Transactions on Circuits and Systems for Video Technology, 22(12), pp. 1649-1668.   DOI
20 Fraunhofer Heinrich Hertz Institute (2022). V VC Test Model, https://vcgit.hhi.fraunhofer.de/jvet/VVCSoftware_VTM (Accessed on A pr. 14th, 2022)
21 Jiang, M., Luo, X., Hai, T., Wang, H., Yang, S., Abdalla, A. N. (2019). Visual Object Tracking in RGB-D Data via Genetic Feature Learning, Complexity, 2019(2019), pp. 1-8.