DOI QR코드

DOI QR Code

Multi-view Image Generation from Stereoscopic Image Features and the Occlusion Region Extraction

가려짐 영역 검출 및 스테레오 영상 내의 특징들을 이용한 다시점 영상 생성

  • Received : 2012.06.19
  • Accepted : 2012.08.30
  • Published : 2012.09.30

Abstract

In this paper, we propose a novel algorithm that generates multi-view images by using various image features obtained from the given stereoscopic images. In the proposed algorithm, we first create an intensity gradient saliency map from the given stereo images. And then we calculate a block-based optical flow that represents the relative movement(disparity) of each block with certain size between left and right images. And we also obtain the disparities of feature points that are extracted by SIFT(scale-invariant We then create a disparity saliency map by combining these extracted disparity features. Disparity saliency map is refined through the occlusion detection and removal of false disparities. Thirdly, we extract straight line segments in order to minimize the distortion of straight lines during the image warping. Finally, we generate multi-view images by grid mesh-based image warping algorithm. Extracted image features are used as constraints during grid mesh-based image warping. The experimental results show that the proposed algorithm performs better than the conventional DIBR algorithm in terms of visual quality.

본 논문에서는 스테레오 영상에서 얻은 다양한 특징들을 이용하여 다시점 영상을 생성하는 방법을 제안한다. 제안된 기법에서는 먼저 주어진 스테레오 영상에서 명암변화 주목도 지도(intensity gradient saliency map)를 생성한다. 다음으로 좌우 영상 간에 블럭 단위의 움직임을 나타내는 광류(optical flow)를 계산하고 scale-invariant feature transform(SIFT) 기법을 통해 사물의 크기와 회전에 변하지 않는 영상의 특징 점을 구하여 이 특징점 간의 변이를 구한 다음, 이 두 변이 정보들을 결합하여 변이 주목도 지도(disparity saliency map)를 생성 한다. 생성된 변이 주목도 지도는 가려짐 영역 검출을 통해 오류 변이가 제거된다. 세 번째로 영상 워핑시에 직선의 왜곡을 최소화하기 위해 직선 세그먼트를 얻는다. 마지막으로 다시점 영상은 이렇게 추출된 영상 특징들을 제한 조건으로 사용하여 그리드 메쉬(grid-mesh) 기반 영상 워핑(warping) 기법에 의해 생성된다. 실험 결과를 통해 제안한 기법으로 생성된 다시점 영상의 화질이 기존 DIBR 기법보다 우수한 것을 확인할 수 있었다.

Keywords

References

  1. B. Bartczak and R. Koch, "Dense depth maps from low resolution time-of-flight depth and high resolution color views," Proc. of 5th International Symposium on Visual Computing, pp.1-12, Nov. 2009.
  2. A. Mancini and J. Konrad, "Robust quad-tree based disparity estimation for the reconstruction of intermediate stereoscopic images," Proc. SPIE Stereoscopic Displays and Virtual Reality Systems, vol. 3295, pp.53-64, Jan. 1998.
  3. ISO/IEC JTC1/SC29/WG11, Draft call for proposals on 3D video coding technology, N11830, Daegu, Korea, Jan. 2011.
  4. ISO/IEC JTC1/SC29/WG11, Applications and requirements on 3D video coding, N11829, Daegu, Korea, Jan. 2011.
  5. ISO/IEC JTC1/SC29/WG11, Boundary noise removal and common hole filling method for VSRS 3.5, M19356, Daegu, Korea, Jan. 2011.
  6. ISO/IEC JTC1/SC29/WG11, Image domain warping as alternative to DIBR for advanced 3DV applications, M19995, Geneva, Switzerland, March 2011.
  7. David G. Lowe, "Distinctive image features from scale-invariant keypoints," International Journal of Computer Vision(IJCV), vol. 60, pp.91-110, Nov. 2004 https://doi.org/10.1023/B:VISI.0000029664.99615.94
  8. M. Lang. A. Hornung, O. Wang. S. Poulakos, A. Smolic, and Gross, "Non-linear disparity mapping for stereoscopic 3D," ACM Transactions on Graph(SIGGRAPH 2010), vol. 29, July 2010.
  9. R. Achanta, F. Estrada, P. Wils, and S. Susstrunk, "Salient region detection and segmentation," International Conference on Computer Vision Systems, vol. 5008, pp.66-75, 2008.
  10. B. D. Lucas, T. Kanade, "An iterative image registration technique with an application to stereo vision", Proc. of the 1981 DARPA Imaging Understanding Workshop, pp. 121-130, 1981
  11. O. Barinova, V. Lempitsky, E. Tretiak, and P. Kohli, "Geometric image parsing in man-made environments," in ECCV, 2010.
  12. R.G. von Gioi, J. Jakubowicz, J. M. Morel, G. Randall, "LSD: A Fast Line Segment Detector with a False Detection Control," IEEE Trans. on Pattern Analysis and Machine Intelligence, vol. PAMI 32, no.4, pp. 722-732, April. 2010.
  13. Nguyen Cao Truong Hai, Do-Yeon Kim, Hyuk-Ro Park, "Obtaining Object by Using Optimal Threshold for Saliency Map Thresholding,"(in Korean), JKCA, vol. 11, no. 6, pp.18-25, June, 2011. https://doi.org/10.5392/JKCA.2011.11.6.018
  14. S. Montabone and A. Soto, "Human detection using a mobile platform and novel features derived from a visual saliency mechanism," Image and Vision Computing, vol. 28, no. 3, pp. 391-402, 2010. https://doi.org/10.1016/j.imavis.2009.06.006
  15. Che-han Chang, Chia-Kai Liang, and Yung-Yu Chuang, "Content-aware display adaptation and interactive editing for stereoscopic images." IEEE Transactions on Multimedia, vol. 13, no. 4, pp.589-601, Aug. 2011. https://doi.org/10.1109/TMM.2011.2116775
  16. J. H. Park and H. W. Park, "A mesh-based disparity representation method for view interpolation and stereo image compression," IEEE Transaction on Image Processing, vol.15, no. 7, pp.1751-1762. July 2006. https://doi.org/10.1109/TIP.2006.877070
  17. Ilkwon Park and Hyeran Byun, "Efficient data representation of stereo images using edge-based mesh optimization," Journal of Broadcast Engineering, vol. 14, no. 3, pp.322-331, May 2009. https://doi.org/10.5909/JBE.2009.14.3.322
  18. Methodology for subjective assessment of the quality of television picture, ITU-R Recommendation BT.500-11

Cited by

  1. Multi-view Image Generation using Grid-mesh based Image Domain Warping and Occlusion Region Information vol.18, pp.6, 2013, https://doi.org/10.5909/JBE.2013.18.6.859