Browse > Article
http://dx.doi.org/10.15207/JKCS.2019.10.11.055

English Conversation System Using Artificial Intelligent of based on Virtual Reality  

Cheon, EunYoung (Div. of Mechanical & Automative Engineering, Kongju National University)
Publication Information
Journal of the Korea Convergence Society / v.10, no.11, 2019 , pp. 55-61 More about this Journal
Abstract
In order to realize foreign language education, various existing educational media have been provided, but there are disadvantages in that the cost of the parish and the media program is high and the real-time responsiveness is poor. In this paper, we propose an artificial intelligence English conversation system based on VR and speech recognition. We used Google CardBoard VR and Google Speech API to build the system and developed artificial intelligence algorithms for providing virtual reality environment and talking. In the proposed speech recognition server system, the sentences spoken by the user can be divided into word units and compared with the data words stored in the database to provide the highest probability. Users can communicate with and respond to people in virtual reality. The function provided by the conversation is independent of the contextual conversations and themes, and the conversations with the AI assistant are implemented in real time so that the user system can be checked in real time. It is expected to contribute to the expansion of virtual education contents service related to the Fourth Industrial Revolution through the system combining the virtual reality and the voice recognition function proposed in this paper.
Keywords
Speech Recognition; Artificial Intelligence; Virtual Reality; Deep Learning; Voice Recognition Interface; 4th Industrial Revolution;
Citations & Related Records
Times Cited By KSCI : 3  (Citation Analysis)
연도 인용수 순위
1 Y. Cho, J. Kim, A. Sun & J. Oh. (2017). Design and implementation of artificial intelligence-based speech recognition for silver generation and single household "Voice" Application, Proc. of the Korean Society of Computer Information Conference 2017, 25(2), 142-145.
2 KISTI. (2014). .KISTI MARKET REPORT_IT Convergence System.
3 P. Sinclair. (2007). Integrating Hypermedia Techniques in Augmented Reality Environments . Ph.D thesis, University of Southampton.
4 L. Freina & M. Ott. (2015). A literature review on immersive virtual reality in education: state of the art and perspectives. In The International Scientific Conference eLearning and Software for Education, 133.
5 Google. (2019). Google Cloud Platform, Google Cloud. [Online]. https://cloud.google.com/products/?hl=ko
6 M. Abadi et al. (2016). Tensorflow: A system for large-scale machine learning. In 12th {USENIX} Symposium on Operating Systems Design and Implementation ({OSDI}16), 265-283.
7 V. Kepuska. (2017). Comparing Speech Recognition Systems (Microsoft API, Google API And CMU Sphinx). Veton Kepuska. Int. Journal of Engineering Research and Application, 7(3), (Part-2), 20-24. DOI: 10.9790/9622-0703022024   DOI
8 B. Iancu. (2019). Evaluating Google Speech-to-Text API's Performance for Romanian e-Learning Resources. Informatica Economica, 23(1), 17-25. DOI: 10.12948/ISSN14531305/23.1.2019.02   DOI
9 J. Kacur. (2006). HTK vs. Sphinx for Speech Recognition. Department of telecommunication FEI STU.
10 A. Amer & P. Peralez. (2014, October). Affordable altered perspectives: Making augmented and virtual reality technology accessible. In IEEE Global Humanitarian Technology Conference (GHTC 2014), 603-608. DOI: 10.1109/GHTC.2014.6970345
11 W. Powell, V. Powell, P. Brown, M. Cook & J. Uddin. (2016). Getting around in google cardboard - exploring navigation preferences with low-cost mobile VR, 2016 IEEE 2nd Workshop on Everyday Virtual Reality (WEVR), Greenville, SC, 5-8. DOI: 10.1109/WEVR.2016.7859536   DOI
12 S. R. Jeong., & S. J. Chang. (2019). Production of fusion-type realistic contents using 3D motion control technology. Journal of Convergence for Information Technology, 9(4), 146-151. DOI: 10.22156/CS4SMB.2019.9.4.146   DOI
13 S. Kim. (2018). An Exploratory Study of VR Technology using Patents and News Articles. Journal of Digital Convergence, 16(11), 185-199. DOI: 10.14400/JDC.2018.16.11.185   DOI
14 Google VR Team. (2015). Google Cardboard. Google [Online]. www.google.com/get/cardboard/
15 S. Yoo & C. Parker. (2015, August). Controller-less interaction methods for Google cardboard. In Proceedings of the 3rd ACM Symposium on Spatial User Interaction, 127-127. DOI: 10.1145/2788940.2794359
16 R. Raguman, M. Santhakumar, X. P. Thomas, & M. Revathi. (2019). 3D Adventure Game Using Unity. Bonfring International Journal of Software Engineering and Soft Computing, 9(2), 16-20. DOI:10.9756/BIJSESC.9015   DOI
17 Unity. (2019). Unity User Manual. Unity Technologies[Online]. https://docs.unity3d.com/Manual/index.html
18 Z. XIAO et al. (2019). Tell Me About Yourself: Using an AI-Powered Chatbot to Conduct Conversational Surveys. arXiv preprint, arXiv:1905.10700.
19 S. Ahola. (2019). Developing a Virtual Reality Application in Unity, LAHTI UNIVERSITY OF APPLIED SCIENCES Information and Communications Technology Media technology [Online]. https://www.theseus.fi/bitstream/handle/10024/171035/opinnaytetyo.pdf
20 J. Peer, (2005). Web service composition as AI planning: a survey, Switzerland: University of St. Gallen.
21 M. F. McTear, Z. Callejas & D. Griol. (2016). The conversational interface, 6(94), 102. Cham: Springer. DOI : 10.1007/978-3-319-32967-3
22 K. H. Kim. (2017). Interactive AI, the evolution of the chatbot. [Online] http://blogview.hyundaicardcapital.com/4010
23 O. Mangkang & J. Y. Yun. (2019). A Comparative Study of Self-Diagnosis User Interfaces for Depression: Focusing on Usability of Paper-Based, Text-Based and Voice-Based Conversational Interface. Korea HCI Society Conference, 262-267. DOI: 10.17210/JHSK.2019.08.14.3.5
24 G. Hinton et al. (2012). Deep Neural Networks for Acoustic Modeling in Speech Recognition, The IEEE Signal Processing Magazine, 29(6), 82-97.   DOI
25 G. E. Jo & S. I. Kim. (2018). A study on User Experience of Artificial Intelligence speaker. Journal of the Korea Convergence Society, 9(8), 127-133. DOI: 10.15207/JKCS.2018.9.8.127   DOI
26 S. I. Jung. (2019). A Study on the Visualization for Information Delivery of Voice User Interface - Centered around the Display Type of AI Speakers, M. S. thesis, Ewha Womans University, Seoul.