DOI QR코드

DOI QR Code

English Conversation System Using Artificial Intelligent of based on Virtual Reality

가상현실 기반의 인공지능 영어회화 시스템

  • Cheon, EunYoung (Div. of Mechanical & Automative Engineering, Kongju National University)
  • 천은영 (공주대학교 기계자동차공학부)
  • Received : 2019.09.30
  • Accepted : 2019.11.20
  • Published : 2019.11.28

Abstract

In order to realize foreign language education, various existing educational media have been provided, but there are disadvantages in that the cost of the parish and the media program is high and the real-time responsiveness is poor. In this paper, we propose an artificial intelligence English conversation system based on VR and speech recognition. We used Google CardBoard VR and Google Speech API to build the system and developed artificial intelligence algorithms for providing virtual reality environment and talking. In the proposed speech recognition server system, the sentences spoken by the user can be divided into word units and compared with the data words stored in the database to provide the highest probability. Users can communicate with and respond to people in virtual reality. The function provided by the conversation is independent of the contextual conversations and themes, and the conversations with the AI assistant are implemented in real time so that the user system can be checked in real time. It is expected to contribute to the expansion of virtual education contents service related to the Fourth Industrial Revolution through the system combining the virtual reality and the voice recognition function proposed in this paper.

외국어 교육을 실현하기 위하여 기존의 다양한 교육 매체들이 제공되고 있지만, 교구 및 매체프로그램에 대한 비용이 많이 들고 실시간 대응력이 떨어지는 단점이 존재한다. 이 논문에서는 VR과 음성인식을 기반으로 한 인공지능 유형의 영어회화 시스템을 제안한다. 시스템 구축을 위해 Google CardBoard VR과 Google Speech API를 이용하며 가상현실 환경 제공 및 대화를 위한 인공지능 알고리즘을 개발하였다. 제안하는 음성인식 서버시스템에서는 사용자가 발화한 문장을 단어 단위로 분리해 데이터베이스에 저장된 데이터 단어들과 비교하여 확률적으로 가장 높은 것을 답으로 제공할 수 있으며 사용자들이 가상현실의 인물과 적절한 대화 및 응답이 가능하다. 대화가 제공되는 기능은 상황별 대화와 주제에 독립적이며, AI 비서와 나눈 대화 내용을 사용자 시스템에서 실시간 확인이 가능하도록 구현하였고 실험을 통하여 음성인식에 대한 응답비율을 확인하였다. 이 논문에서 제안하는 가상현실과 음성인식 기능을 접목한 시스템을 통하여 4차 산업혁명에 관련한 가상교육 콘텐츠 서비스 확장에 이바지할 것을 기대한다.

Keywords

References

  1. J. Peer, (2005). Web service composition as AI planning: a survey, Switzerland: University of St. Gallen.
  2. Z. XIAO et al. (2019). Tell Me About Yourself: Using an AI-Powered Chatbot to Conduct Conversational Surveys. arXiv preprint, arXiv:1905.10700.
  3. M. F. McTear, Z. Callejas & D. Griol. (2016). The conversational interface, 6(94), 102. Cham: Springer. DOI : 10.1007/978-3-319-32967-3
  4. K. H. Kim. (2017). Interactive AI, the evolution of the chatbot. [Online] http://blogview.hyundaicardcapital.com/4010
  5. O. Mangkang & J. Y. Yun. (2019). A Comparative Study of Self-Diagnosis User Interfaces for Depression: Focusing on Usability of Paper-Based, Text-Based and Voice-Based Conversational Interface. Korea HCI Society Conference, 262-267. DOI: 10.17210/JHSK.2019.08.14.3.5
  6. G. E. Jo & S. I. Kim. (2018). A study on User Experience of Artificial Intelligence speaker. Journal of the Korea Convergence Society, 9(8), 127-133. DOI: 10.15207/JKCS.2018.9.8.127
  7. S. I. Jung. (2019). A Study on the Visualization for Information Delivery of Voice User Interface - Centered around the Display Type of AI Speakers, M. S. thesis, Ewha Womans University, Seoul.
  8. G. Hinton et al. (2012). Deep Neural Networks for Acoustic Modeling in Speech Recognition, The IEEE Signal Processing Magazine, 29(6), 82-97. https://doi.org/10.1109/MSP.2012.2205597
  9. Y. Cho, J. Kim, A. Sun & J. Oh. (2017). Design and implementation of artificial intelligence-based speech recognition for silver generation and single household "Voice" Application, Proc. of the Korean Society of Computer Information Conference 2017, 25(2), 142-145.
  10. KISTI. (2014). .KISTI MARKET REPORT_IT Convergence System.
  11. P. Sinclair. (2007). Integrating Hypermedia Techniques in Augmented Reality Environments . Ph.D thesis, University of Southampton.
  12. L. Freina & M. Ott. (2015). A literature review on immersive virtual reality in education: state of the art and perspectives. In The International Scientific Conference eLearning and Software for Education, 133.
  13. Google. (2019). Google Cloud Platform, Google Cloud. [Online]. https://cloud.google.com/products/?hl=ko
  14. M. Abadi et al. (2016). Tensorflow: A system for large-scale machine learning. In 12th {USENIX} Symposium on Operating Systems Design and Implementation ({OSDI}16), 265-283.
  15. V. Kepuska. (2017). Comparing Speech Recognition Systems (Microsoft API, Google API And CMU Sphinx). Veton Kepuska. Int. Journal of Engineering Research and Application, 7(3), (Part-2), 20-24. DOI: 10.9790/9622-0703022024
  16. B. Iancu. (2019). Evaluating Google Speech-to-Text API's Performance for Romanian e-Learning Resources. Informatica Economica, 23(1), 17-25. DOI: 10.12948/ISSN14531305/23.1.2019.02
  17. J. Kacur. (2006). HTK vs. Sphinx for Speech Recognition. Department of telecommunication FEI STU.
  18. A. Amer & P. Peralez. (2014, October). Affordable altered perspectives: Making augmented and virtual reality technology accessible. In IEEE Global Humanitarian Technology Conference (GHTC 2014), 603-608. DOI: 10.1109/GHTC.2014.6970345
  19. W. Powell, V. Powell, P. Brown, M. Cook & J. Uddin. (2016). Getting around in google cardboard - exploring navigation preferences with low-cost mobile VR, 2016 IEEE 2nd Workshop on Everyday Virtual Reality (WEVR), Greenville, SC, 5-8. DOI: 10.1109/WEVR.2016.7859536
  20. S. R. Jeong., & S. J. Chang. (2019). Production of fusion-type realistic contents using 3D motion control technology. Journal of Convergence for Information Technology, 9(4), 146-151. DOI: 10.22156/CS4SMB.2019.9.4.146
  21. S. Kim. (2018). An Exploratory Study of VR Technology using Patents and News Articles. Journal of Digital Convergence, 16(11), 185-199. DOI: 10.14400/JDC.2018.16.11.185
  22. Google VR Team. (2015). Google Cardboard. Google [Online]. www.google.com/get/cardboard/
  23. S. Yoo & C. Parker. (2015, August). Controller-less interaction methods for Google cardboard. In Proceedings of the 3rd ACM Symposium on Spatial User Interaction, 127-127. DOI: 10.1145/2788940.2794359
  24. R. Raguman, M. Santhakumar, X. P. Thomas, & M. Revathi. (2019). 3D Adventure Game Using Unity. Bonfring International Journal of Software Engineering and Soft Computing, 9(2), 16-20. DOI:10.9756/BIJSESC.9015
  25. Unity. (2019). Unity User Manual. Unity Technologies[Online]. https://docs.unity3d.com/Manual/index.html
  26. S. Ahola. (2019). Developing a Virtual Reality Application in Unity, LAHTI UNIVERSITY OF APPLIED SCIENCES Information and Communications Technology Media technology [Online]. https://www.theseus.fi/bitstream/handle/10024/171035/opinnaytetyo.pdf