DOI QR코드

DOI QR Code

Single-View Reconstruction of a Manhattan World from Line Segments

  • Lee, Suwon (School of Computer Science and the Research Institute of Natural Science, Gyeongsang National University) ;
  • Seo, Yong-Ho (Department of AI and Robot Convergence, Mokwon University)
  • Received : 2022.01.03
  • Accepted : 2022.01.10
  • Published : 2022.03.31

Abstract

Single-view reconstruction (SVR) is a fundamental method in computer vision. Often used for reconstructing human-made environments, the Manhattan world assumption presumes that planes in the real world exist in mutually orthogonal directions. Accordingly, this paper addresses an automatic SVR algorithm for Manhattan worlds. A method for estimating the directions of planes using graph-cut optimization is proposed. After segmenting an image from extracted line segments, the data cost function and smoothness cost function for graph-cut optimization are defined by considering the directions of the line segments and neighborhood segments. Furthermore, segments with the same depths are grouped during a depth-estimation step using a minimum spanning tree algorithm with the proposed weights. Experimental results demonstrate that, unlike previous methods, the proposed method can identify complex Manhattan structures of indoor and outdoor scenes and provide the exact boundaries and intersections of planes.

Keywords

References

  1. J.M. Coughlan and A.L. Yuille, "Manhattan world: Compass direction from a single image by Bayesian inference," in Proc. 7th IEEE International Conference on Computer Vision, pp. 941-947, Sep. 20-27, 1999. DOI: https://doi.org/10.1109/ICCV.1999.790349.
  2. P. Sturm and S. Maybank, "A method for interactive 3D reconstruction of piecewise planar objects from single images," in Proc. 10th British Machine Vision Conference, pp. 265-274, Sep. 13-16, 1999.
  3. P. Muller, G. Zeng, P. Wonka, and L. Van Gool, "Image-based procedural modeling of facades," ACM Transactions on Graphics, Vol. 26, No. 3, pp. 85-es, July 2007. DOI: https://doi.org/10.1145/1276377.1276484.
  4. D. Hoiem, A. Efros, and M. Hebert, "Automatic photo pop-up," ACM Transactions on Graphics, Vol. 24, No. 3, pp. 577-584, July 2005. DOI: https://doi.org/10.1145/1186822.1073232.
  5. A. Saxena, A, M. Sun, and A.Y. Ng, "Make3D: learning 3D scene structure from a single still image," IEEE Transaction on Pattern Analysis and Machine Intelligence, Vol. 31, No. 5, pp. 824-840, May 2009. DOI: https://doi.org/10.1109/TPAMI.2008.132.
  6. D. Lee, M. Hebert, and T. Kanade, "Geometric reasoning for single image structure recovery," in Proc. 22nd IEEE Conference on Computer Vision and Pattern Recognition, pp. 2136-2143, June 23-25, 1999. DOI: https://doi.org/10.1109/CVPR.2009.5206872.
  7. A. Flint, C. Mei, D. Murray, and I. Reid, "A dynamic programming approach to reconstructing building interiors," in Proc. 11th European Conference on Computer Vision, pp. 394-407, Sep. 5-11, 2010. DOI: https://doi.org/10.1007/978-3-642-15555-0_29.
  8. A. Zaheer, M. Rashid, and S. Khan, "Shape from angle regularity," in Proc. 12th European Conference on Computer Vision, pp. 1-14, Oct. 7-13, 2012. DOI: https://doi.org/10.1007/978-3-642-33783-3_1.
  9. C. Wu, J. Frahm, and M. Pollefeys, "Repetition-based dense single-view reconstruction," in Proc. 24th IEEE Conference on Computer Vision and Pattern Recognition, pp. 3113-3120, June 20-25, 2011. DOI: https://doi.org/10.1109/CVPR.2011.5995551.
  10. S. Ramalingam and M. Brand, "Lifting 3D Manhattan lines from a single image," In Proc. 15th IEEE International Conference on Computer Vision, pp. 497-504, Dec. 1-8, 2013. DOI: https://doi.org/10.1109/ICCV.2013.67.
  11. R.G. Von Gioi, J. Jakubowicz, J.M. Morel, and G. Randall, "LSD: a fast line segment detector with a false detection control," IEEE Transaction on Pattern Analysis and Machine Intelligence, Vol. 32, No. 4, pp. 722-732, April 2010. DOI: https://doi.org/10.1109/TPAMI.2008.300.
  12. Y. Boykov, O. Veksler, and R. Zabih, "Fast approximate energy minimization via graph cuts," IEEE Transaction on Pattern Analysis and Machine Intelligence, Vol. 23, No. 11, pp. 1222-1239, Nov. 2001. DOI: https://doi.org/10.1109/34.969114.
  13. V. Kolmogorov and R. Zabih, "What energy functions can be minimized via graph cuts?," IEEE Transaction on Pattern Analysis and Machine Intelligence, Vol. 26, No. 2, pp. 147-159, June 2004. DOI: https://10.1109/TPAMI.2004.1262177.
  14. P. Denis, J.H. Elder and F.J. Estrada, "Efficient edge-based methods for estimating Manhattan frames in urban imagery," in Proc. 10th European Conference on Computer Vision, pp. 197-210, Oct. 12-18, 2008. DOI: https://doi.org/10.1007/978-3-540-88688-4_15.