• Title/Summary/Keyword: Voice learning

Search Result 272, Processing Time 0.023 seconds

The Impact of Audiovisual Elements on Learning Outcomes - Focusing on MOOC -

  • Li Meng;Hong, Chang-kee
    • International Journal of Internet, Broadcasting and Communication
    • /
    • v.16 no.3
    • /
    • pp.98-112
    • /
    • 2024
  • As digital education progresses, MOOC (Massive Open Online Courses) are increasingly utilized by learners, making research on MOOC learning outcomes a necessary endeavor. In this study, we systematically investigated the impact of audiovisual elements on learning outcomes in MOOC, highlighting the nuanced role these components play in enhancing educational effectiveness. Through a comprehensive survey and rigorous analysis involving descriptive statistics, reliability metrics, and regression techniques, we quantified the influence of text, graphics, color, teacher images, sound effects, background music, and teacher's voice on learner attention, cognitive load, and satisfaction. We discovered that background music and text layout significantly improve engagement and reduce cognitive burden, underscoring their pivotal role in the instructional design of MOOC. We findings contribute new insights to the field of digital education, emphasizing the critical importance of integrating audiovisual elements thoughtfully to foster better learning environments and outcomes. Not only advances academic understanding of multimedia learning impacts but also offers practical guidance for educators and course designers seeking to enhance the efficacy of MOOC.

Design and Implementation of Mobile Communication System for Hearing- impaired Person (청각 장애인을 위한 모바일 통화 시스템 설계 및 구현)

  • Yun, Dong-Hee;Kim, Young-Ung
    • The Journal of the Institute of Internet, Broadcasting and Communication
    • /
    • v.16 no.5
    • /
    • pp.111-116
    • /
    • 2016
  • According to the Ministry of Science, ICT and Future Planning's survey of information gap, smartphone retention rate of disabled people stayed in one-third of non-disabled people, the situation is significantly less access to information for people with disabilities than non-disabled people. In this paper, we develop an application, CallHelper, that helps to be more convenient to use mobile voice calls to the auditory disabled people. CallHelper runs automatically when a call comes in, translates caller's voice to text output on the mobile screen, and displays the emotion reasoning from the caller's voice to visualize emoticons. It also saves voice, translated text, and emotion data that can be played back.

Voice Activity Detection based on DBN using the Likelihood Ratio (우도비를 이용한 DBN 기반의 음성 검출기)

  • Kim, S.K.;Lee, S.M.
    • Journal of rehabilitation welfare engineering & assistive technology
    • /
    • v.8 no.3
    • /
    • pp.145-150
    • /
    • 2014
  • In this paper, we propose a novel scheme to improve the performance of a voice activity detection(VAD) which is based on the deep belief networks(DBN) with the likelihood ratio(LR). The proposed algorithm applies the DBN learning method which is trained in order to minimize the probability of detection error instead of the conventional decision rule using geometric mean. Experimental results show that the proposed algorithm yields better results compared to the conventional VAD algorithm in various noise environments.

  • PDF

An interactive teachable agent system for EFL learners (대화형 Teachable Agent를 이용한 영어말하기학습 시스템)

  • Kyung A Lee;Sun-Bum Lim
    • The Journal of the Convergence on Culture Technology
    • /
    • v.9 no.3
    • /
    • pp.797-802
    • /
    • 2023
  • In an environment where English is a foreign language, English learners can use AI voice chatbots in English-speaking practice activities to enhance their speaking motivation, provide opportunities for communication practice, and improve their English speaking ability. In this study, we propose a teaching-style AI voice chatbot that can be easily utilized by lower elementary school students and enhance their learning. To apply the Teachable Agent system to language learning, which is an activity based on tense, context, and memory, we proposed a new method of TA by applying the Teachable Agent to reflect the learner's English pronunciation and level and generate the agent's answers according to the learner's errors and implemented a Teachable Agent AI chatbot prototype. We conducted usability evaluations with actual elementary English teachers and elementary school students to demonstrate learning effects. The results of this study can be applied to motivate students who are not interested in learning or elementary school students to voluntarily participate in learning through role-switching.

Research on Developing a Conversational AI Callbot Solution for Medical Counselling

  • Won Ro LEE;Jeong Hyon CHOI;Min Soo KANG
    • Korean Journal of Artificial Intelligence
    • /
    • v.11 no.4
    • /
    • pp.9-13
    • /
    • 2023
  • In this study, we explored the potential of integrating interactive AI callbot technology into the medical consultation domain as part of a broader service development initiative. Aimed at enhancing patient satisfaction, the AI callbot was designed to efficiently address queries from hospitals' primary users, especially the elderly and those using phone services. By incorporating an AI-driven callbot into the hospital's customer service center, routine tasks such as appointment modifications and cancellations were efficiently managed by the AI Callbot Agent. On the other hand, tasks requiring more detailed attention or specialization were addressed by Human Agents, ensuring a balanced and collaborative approach. The deep learning model for voice recognition for this study was based on the Transformer model and fine-tuned to fit the medical field using a pre-trained model. Existing recording files were converted into learning data to perform SSL(self-supervised learning) Model was implemented. The ANN (Artificial neural network) neural network model was used to analyze voice signals and interpret them as text, and after actual application, the intent was enriched through reinforcement learning to continuously improve accuracy. In the case of TTS(Text To Speech), the Transformer model was applied to Text Analysis, Acoustic model, and Vocoder, and Google's Natural Language API was applied to recognize intent. As the research progresses, there are challenges to solve, such as interconnection issues between various EMR providers, problems with doctor's time slots, problems with two or more hospital appointments, and problems with patient use. However, there are specialized problems that are easy to make reservations. Implementation of the callbot service in hospitals appears to be applicable immediately.

Facial image visualization using voice Big Data (Big Data를 활용한 얼굴 이미지 시각화 연구)

  • Kwak, Dong-Ryul;Kim, Min-Cheol;Kim, Chang-Soo
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2018.10a
    • /
    • pp.634-636
    • /
    • 2018
  • 최근 들어 Big Data를 활용한 기술들이 많이 개발되고 있다. 본 연구에서는 Machine Learning과 Deep Learning을 이용하여 음성 Big Data를 활용한 이미지 시각화를 통해 보이스 피싱 등 여러 범죄에 도움이 되게 하고 그 외의 음성과 얼굴 매칭을 통한 새로운 보안시스템 및 다양한 시너지 효과들을 기대하는 서비스를 기술한다.

A Study on the Usability Evaluation and Improvement of Voice Tag Reader for an Visually Impaired Person (시각장애인 대상 음성태그리더기의 사용성 평가 및 개선 방안 연구)

  • Sora Kim;Yongyun Cho;Taehee Yong
    • Journal of Internet of Things and Convergence
    • /
    • v.9 no.2
    • /
    • pp.1-9
    • /
    • 2023
  • This study was conducted for the purpose of improving the usability of the product through the usability evaluation of the voice tag reader to improve the life convenience of the visually impaired. Perceived usability evaluation was conducted for 19 evaluation items based on the evaluation model considering the usability principle and the characteristics of the visually impaired. A total of 50 participants were included for the analysis. As a result of the perceived usability evaluation of the visually impaired, the safety of the voice tag reader, voice and sound quality, and accuracy of voice information were relatively satisfactory. It was found that the reader received a low evaluation in terms of efficiency in use, including the size and weight of the reader, and the convenience of carrying and storing. For the usability improvement, the procedure for using a product needs to be more simplified, and it would be helpful to input and supply tags for commonly used objects in advance.

WWW Based Instruction Systems for English Learning: GAIA

  • Park, Phan-Woo
    • Journal of The Korean Association of Information Education
    • /
    • v.3 no.2
    • /
    • pp.113-119
    • /
    • 2000
  • I studied a distance education model for English learning on the Internet. Basic WWW files, that contain courseware, are constructed with HTML, and functions, which are required in learning, are implemented with Java. Students and educators can access the preferred unit composed of the appropriate text, voice and image data by using a WWW browser at any time. The education system supports the automatic generation facility of English problems to practice reading and writing by making good use of the courseware data or various English text resources located on the Internet. Our system has functions to manage and control the flow of distance learning and to offer interaction between students and the system in a distributed environment. Educators can manage students' learning and can immediately be aware of who is attending and who is quitting the lesson in virtual space. Also, students and educators in different places can communicate and discuss a topic through the server. I implemented these functions, which are required in a client/server environment of distance education, with the use of Java. The URL for this system is "http://park.taegu-e.ac.kr" in the name of GAIA.

  • PDF

A study for improvement of Recognition velocity of Korean Character using Neural Oscillator (신경 진동자를 이용한 한글 문자의 인식 속도의 개선에 관한 연구)

  • Kwon, Yong-Bum;Lee, Joon-Tark
    • Proceedings of the Korean Institute of Intelligent Systems Conference
    • /
    • 2004.04a
    • /
    • pp.491-494
    • /
    • 2004
  • Neural Oscillator can be applied to oscillatory systems such as the image recognition, the voice recognition, estimate of the weather fluctuation and analysis of geological fluctuation etc in nature and principally, it is used often to pattern recoglition of image information. Conventional BPL(Back-Propagation Learning) and MLNN(Multi Layer Neural Network) are not proper for oscillatory systems because these algorithm complicate Learning structure, have tedious procedures and sluggish convergence problem. However, these problems can be easily solved by using a synchrony characteristic of neural oscillator with PLL(phase-Locked Loop) function and by using a simple Hebbian learning rule. And also, Recognition velocity of Korean Character can be improved by using a Neural Oscillator's learning accelerator factor η$\_$ij/

  • PDF

Development of Educational Programs for PHP using Flash Actionscripts (플래시 액션 스크립트를 이용한 PHP 교육용 프로그램 개발)

  • Kim, Dong-Sik;Lee, Dong-Yeop;Seo, Sam-Jun
    • Proceedings of the KIEE Conference
    • /
    • 2003.07d
    • /
    • pp.2543-2545
    • /
    • 2003
  • This paper presents a web-based virtual classroom which can be creating efficiencies in the learning process of PHP language. The proposed flash animations which explain the important principles of several topics for PHP language are designed for the learners to easily understand by executing them through simple mouse clicks. The proposed flash animations enables the learners to achieve efficient and interesting self-learning since the learning process is designed to enhance the multimedia capabilities on the basis of various educational technologies. Also, internet-based on-line voice presentation and its related texts together with moving images are synchronized for efficient, language learning process. Through the proposed virtual classroom, the learners will be capable of learning the concepts related to PHP language and its coding. The results of this paper are to allow the implementation of an efficient virtual classroom, and are also expected to make a contributions to the activation of internet-based educational systems.

  • PDF