[KSCI] Korea Science Citation Index Service

http://dx.doi.org/10.7236/IJASC.2022.11.1.1

Single-View Reconstruction of a Manhattan World from Line Segments

Lee, Suwon (School of Computer Science and the Research Institute of Natural Science, Gyeongsang National University)
Seo, Yong-Ho (Department of AI and Robot Convergence, Mokwon University)

Publication Information

International journal of advanced smart convergence / v.11, no.1, 2022 , pp. 1-10 More about this Journal

Abstract

Single-view reconstruction (SVR) is a fundamental method in computer vision. Often used for reconstructing human-made environments, the Manhattan world assumption presumes that planes in the real world exist in mutually orthogonal directions. Accordingly, this paper addresses an automatic SVR algorithm for Manhattan worlds. A method for estimating the directions of planes using graph-cut optimization is proposed. After segmenting an image from extracted line segments, the data cost function and smoothness cost function for graph-cut optimization are defined by considering the directions of the line segments and neighborhood segments. Furthermore, segments with the same depths are grouped during a depth-estimation step using a minimum spanning tree algorithm with the proposed weights. Experimental results demonstrate that, unlike previous methods, the proposed method can identify complex Manhattan structures of indoor and outdoor scenes and provide the exact boundaries and intersections of planes.

Keywords

single-view reconstruction; 3D reconstruction; Manhattan world; line segment detection;

Citations & Related Records

Reference

1	D. Hoiem, A. Efros, and M. Hebert, "Automatic photo pop-up," ACM Transactions on Graphics, Vol. 24, No. 3, pp. 577-584, July 2005. DOI: https://doi.org/10.1145/1186822.1073232. DOI
2	A. Flint, C. Mei, D. Murray, and I. Reid, "A dynamic programming approach to reconstructing building interiors," in Proc. 11th European Conference on Computer Vision, pp. 394-407, Sep. 5-11, 2010. DOI: https://doi.org/10.1007/978-3-642-15555-0_29. DOI
3	V. Kolmogorov and R. Zabih, "What energy functions can be minimized via graph cuts?," IEEE Transaction on Pattern Analysis and Machine Intelligence, Vol. 26, No. 2, pp. 147-159, June 2004. DOI: https://10.1109/TPAMI.2004.1262177. DOI
4	J.M. Coughlan and A.L. Yuille, "Manhattan world: Compass direction from a single image by Bayesian inference," in Proc. 7th IEEE International Conference on Computer Vision, pp. 941-947, Sep. 20-27, 1999. DOI: https://doi.org/10.1109/ICCV.1999.790349. DOI
5	D. Lee, M. Hebert, and T. Kanade, "Geometric reasoning for single image structure recovery," in Proc. 22nd IEEE Conference on Computer Vision and Pattern Recognition, pp. 2136-2143, June 23-25, 1999. DOI: https://doi.org/10.1109/CVPR.2009.5206872. DOI
6	P. Sturm and S. Maybank, "A method for interactive 3D reconstruction of piecewise planar objects from single images," in Proc. 10th British Machine Vision Conference, pp. 265-274, Sep. 13-16, 1999.
7	P. Muller, G. Zeng, P. Wonka, and L. Van Gool, "Image-based procedural modeling of facades," ACM Transactions on Graphics, Vol. 26, No. 3, pp. 85-es, July 2007. DOI: https://doi.org/10.1145/1276377.1276484. DOI
8	A. Saxena, A, M. Sun, and A.Y. Ng, "Make3D: learning 3D scene structure from a single still image," IEEE Transaction on Pattern Analysis and Machine Intelligence, Vol. 31, No. 5, pp. 824-840, May 2009. DOI: https://doi.org/10.1109/TPAMI.2008.132. DOI
9	Y. Boykov, O. Veksler, and R. Zabih, "Fast approximate energy minimization via graph cuts," IEEE Transaction on Pattern Analysis and Machine Intelligence, Vol. 23, No. 11, pp. 1222-1239, Nov. 2001. DOI: https://doi.org/10.1109/34.969114. DOI
10	A. Zaheer, M. Rashid, and S. Khan, "Shape from angle regularity," in Proc. 12th European Conference on Computer Vision, pp. 1-14, Oct. 7-13, 2012. DOI: https://doi.org/10.1007/978-3-642-33783-3_1. DOI
11	C. Wu, J. Frahm, and M. Pollefeys, "Repetition-based dense single-view reconstruction," in Proc. 24th IEEE Conference on Computer Vision and Pattern Recognition, pp. 3113-3120, June 20-25, 2011. DOI: https://doi.org/10.1109/CVPR.2011.5995551. DOI
12	S. Ramalingam and M. Brand, "Lifting 3D Manhattan lines from a single image," In Proc. 15th IEEE International Conference on Computer Vision, pp. 497-504, Dec. 1-8, 2013. DOI: https://doi.org/10.1109/ICCV.2013.67. DOI
13	P. Denis, J.H. Elder and F.J. Estrada, "Efficient edge-based methods for estimating Manhattan frames in urban imagery," in Proc. 10th European Conference on Computer Vision, pp. 197-210, Oct. 12-18, 2008. DOI: https://doi.org/10.1007/978-3-540-88688-4_15. DOI
14	R.G. Von Gioi, J. Jakubowicz, J.M. Morel, and G. Randall, "LSD: a fast line segment detector with a false detection control," IEEE Transaction on Pattern Analysis and Machine Intelligence, Vol. 32, No. 4, pp. 722-732, April 2010. DOI: https://doi.org/10.1109/TPAMI.2008.300. DOI