DOI QR코드

DOI QR Code

Superimposing 3D Models on Real Scenes Based on The Reinforcement Learning using Visual Observations

  • Yong-Ju Lee (Department of Civil and Environmental Engineering, Myongji University) ;
  • Linh Nguyen (Department of Civil and Environmental Engineering, Myongji University) ;
  • Man-Woo Park (Department of Civil and Environmental Engineering, Myongji University)
  • Published : 2024.07.29

Abstract

This research presents a method for Augmented Reality (AR) object superimposition leveraging reinforcement learning techniques to significantly reduce manual input during the exploration of digital information on construction sites. A distinctive feature of this approach is the application of a reinforcement learning neural network, trained with pairs of real and virtual view images, for AR superimposition. This approach enables the precise adjustment of the virtual camera's position and orientation within a virtual scene, aiming to seamlessly integrate AR objects into real-world views. This research initially focuses on simpler scenarios involving 2 and 3 degrees of freedom for orientation and position adjustments. The purpose is to explore the feasibility of the application through those experiments, with the expectation that the results would be interpreted positively. These initial findings highlight the promise of the suggested method in improving AR applications, especially within the construction sector, by enabling more natural and precise merging of virtual and physical objects, without requiring user intervention.

Keywords

Acknowledgement

This work was supported by the National Research Foundation of Korea(NRF) grant funded by the Korea government(MSIT) (No. 2022R1F1A1074604).

References

  1. E. Marchand, F. Chaumette, "Virtual Visual Servoing: A Framework for Real-Time Augmented Reality" Computer Graphics Forum, vol. 21, no. 3, pp. 289-97, 2002.
  2. A. I. Comport, E. Marchand, M. Pressigout, F. Chaumette, "Real-time markerless tracking for augmented reality: the virtual visual servoing framework" IEEE Transactions on Visualization and Computer Graphics, vol. 12 no. 4, pp. 615-628, 2006.
  3. M. Kalaitzakis, B. Cain, S. Carroll, A. Ambrosi, C. Whitehead, N. Vitzilaios, "Fiducial Markers for Pose Estimation" Journal of Intelligent & Robotic Systems, vol. 101, no. 4, 2021.
  4. J. Polvi, T. Taketomi, G. Yamamoto, A. Dey, C. Sandor, H. Kato, "SlidAR: A 3D positioning method for SLAM-based handheld augmented reality" Computers & Graphics, vol. 55, pp. 33-43, 2016.
  5. M. Tamaazousti, S. Naudet-Collette, V. Gay-Bellile, S. Bourgeois, B. Besbes, M. Dhome, "The constrained SLAM framework for non-instrumented augmented reality" Multimedia Tools and Applications, vol. 75, no. 16, pp. 9511-9547, 2015.
  6. J. Liu, Y. Xie, S. Gu, X. Chen, "A SLAM-Based Mobile Augmented Reality Tracking Registration Algorithm," International Journal of Pattern Recognition and Artificial Intelligence, vol. 34, no. 01, p. 2054005, 2020.