• Title/Summary/Keyword: TTS system

Search Result 148, Processing Time 0.023 seconds

On the Development of Animated Tutoring Dialogue Agent for Elementary School Science Learning (초등과학 수업을 위한 애니메이션 기반 튜터링 다이얼로그 에이전트 개발)

  • Jeong, Sang-Mok;Han, Byeong-Rae;Song, Gi-Sang
    • Journal of The Korean Association of Information Education
    • /
    • v.9 no.4
    • /
    • pp.673-684
    • /
    • 2005
  • In this research, we have developed a "computer tutor" that mimics the human tutor with animated tutoring dialog agent and the agent was integrated to teaching-learning material for elementary science subject. The developed system is a natural language based teaching-learning system using one-to-one dialogue. The developed pedagogical dialogue teaching-learning system analysis student's answer then provides appropriate answer or questions after comparing the student's answer with elementary school level achievement. When the agent gives either question or answer it uses the TTS(Text-to-Speech) function. Also the agent has an animated human tutor face for providing more human like feedback. The developed dialogue interface has been applied to 64 6th grade students. The test results show that the test group's average score is higher than the control group by 10.797. This shows that unlike conventional web courseware, our approach that "ask-answer" process and the animated character, which has human tutor's emotional expression, attracts students and helps to immerse to the courseware.

  • PDF

Image Based Human Action Recognition System to Support the Blind (시각장애인 보조를 위한 영상기반 휴먼 행동 인식 시스템)

  • Ko, ByoungChul;Hwang, Mincheol;Nam, Jae-Yeal
    • Journal of KIISE
    • /
    • v.42 no.1
    • /
    • pp.138-143
    • /
    • 2015
  • In this paper we develop a novel human action recognition system based on communication between an ear-mounted Bluetooth camera and an action recognition server to aid scene recognition for the blind. First, if the blind capture an image of a specific location using the ear-mounted camera, the captured image is transmitted to the recognition server using a smartphone that is synchronized with the camera. The recognition server sequentially performs human detection, object detection and action recognition by analyzing human poses. The recognized action information is retransmitted to the smartphone and the user can hear the action information through the text-to-speech (TTS). Experimental results using the proposed system showed a 60.7% action recognition performance on the test data captured in indoor and outdoor environments.

A Study on the Categorization System and Performance Parameters for the development of the Tube Transportation System's Requirements (튜브운송시스템 요구사항 개발을 위한 분류체계 및 성능변수 추출에 관한 연구)

  • Choi, Yo Chul;Kwon, Huck Bin
    • Journal of the Korean Society of Systems Engineering
    • /
    • v.5 no.2
    • /
    • pp.17-26
    • /
    • 2009
  • This paper is about that case study of the Tube Transportation System that the new transportation system offering passenger and logistic service in a metropolis having plenty of the floating population or between medium-sized cities, and solving large issues like terrible traffic jams and environmental problems etc. in this region. Also it presented that elicitation results of performance parameter and the categorization system of it applying a systematic analysis methodology. By the medium of this paper, It showed that definition, case study, performance parameters, and the categorization system of parameters of a general tube transportation system before developing requirements of a specific tube transportation system. From now on, it will come in pretty handy in systems engineering of activities to establish a concept of a new tube transportation systems and develop requirements.

  • PDF

Model Test Study for the Behavior of the Truss Tower System (실내 모형실험을 통한 무지보 흙막이 공법 거동 연구)

  • Kim, Nak-Kyung;Kim, Sung-Kyu;Baek, Min-Ky;Kim, Ju-Hyung;Joo, Yong-Sun
    • Proceedings of the Korean Geotechical Society Conference
    • /
    • 2010.03a
    • /
    • pp.819-824
    • /
    • 2010
  • Model test was performed for new earth retention system that is a kind of truss tower with non-supported excavation. For the model test, a dimensional analysis of the full-scaled truss tower system was performed. The horizontal displacement of the wall, bending stress acting on TTS system were measured during construction simulation. From the measurements, the performance of the truss tower system was investigated.

  • PDF

Enhancement Plan for Overall Disaster Prevention System

  • Moon, Sang-Ho
    • Journal of information and communication convergence engineering
    • /
    • v.9 no.1
    • /
    • pp.6-10
    • /
    • 2011
  • In Korea, overall disaster prevention system or 119 emergency rescue system has been established to protect life and fortune of citizens. This system supports command & control operation, emergency 119 caller location indicator, automatic formation of fire troops and dispatch, and emergency management. To do this, various new information technologies such as GIS, telematics, CTI and TTS are applied to implement the system. In the future, however, it is not impossible to prevent a large scale disasters caused by world climate environment change and complication of city culture using the current system. In this paper, we propose enhancement plan for overall disaster prevention system to solve this problem.

A New Vocoder based on AMR 7.4Kbit/s Mode for Speaker Dependent System (화자 의존 환경의 AMR 7.4Kbit/s모드에 기반한 보코더)

  • Min, Byung-Jae;Park, Dong-Chul
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.33 no.9C
    • /
    • pp.691-696
    • /
    • 2008
  • A new vocoder of Code Excited Linear Predictive (CELP) based on Adaptive Multi Rate (AMR) 7.4kbit/s mode is proposed in this paper. The proposed vocoder achieves a better compression rate in an environment of Speaker Dependent Coding System (SDSC) and is efficiently used for systems, such as OGM(Outgoing message) and TTS(Text To Speech), which needs only one person's speech. In order to enhance the compression rate of a coder, a new Line Spectral Pairs(LSP) code-book is employed by using Centroid Neural Network (CNN) algorithm. In comparison with original(traditional) AMR 7.4 Kbit/s coder, the new coder shows 27% higher compression rate while preserving synthesized speech quality in terms of Mean Opinion Score(MOS).

The utility of digital evaluation based on automatic item generation in mathematics: Focusing on the CAFA system (수학교과에서 자동문항생성 기반의 디지털 평가 활용 방안: CAFA 시스템을 중심으로)

  • Kim, Sungyeun
    • The Mathematical Education
    • /
    • v.61 no.4
    • /
    • pp.581-595
    • /
    • 2022
  • The purpose of this study is to specify the procedure for making item models based on ontology models using automatic item generation in the mathematics subject through the CAFA system, and to explore the generated item instances. As an illustration for this, an item model was designed as a part of formative assessment based on the content characteristics, including concepts and calculations, and process characteristics, including application, using the representative values and the measures of dispersion in Mathematics of the 9th grade based on the evaluation criteria achievement standards. The item types generated in one item model were a best answer type, a correct answer type, a combined-response type, an incomplete statement type, a negative type, a true-false type, and a matching type. It was found that HTML, Google Charts, TTS, figures, videos and so on can be used as media. The implications of the use of digital evaluation based on automatic item generation were suggested in the aspects of students, pre-service teachers, general teachers, and special education, and the limitations of this study and future research directions were presented.

Design of Smart Glasses Platform walking guide for the visually impaired (시각장애인을 위한 보행 안내 스마트 안경 플랫폼 설계)

  • Lee, Jaebeom;Jang, Jongwook;Jang, Sungjin
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2021.10a
    • /
    • pp.320-322
    • /
    • 2021
  • As the world's elderly population increases, the proportion of visually impaired is also increasing, and there are still many restrictions on the use of outside activities, such as safety problems and lack of guidance information. To solve this problem, research on smart devices such as smart glasses with optical character recognition (OCR) function is being actively conducted. In this paper, we propose a system that recognizes obstacles ahead and informs information by voice, and also guides the way to the destination. Using the deep learning object recognition model Yolo, it let them to recognize the risk factors as obstacles such as stairs and Larva cones. and it also deliver the information with a voice. so you can expect that the visually impaired can do a lot of different activity even more now that system takes the visually impaired to the destination by using the directions API, voice recognition, TTS library.

  • PDF

3D Graphics Visualization and Context Information Service for a Virtual Tourist System

  • Nguyen, Congdu;Le, Minh Tuan;Yoon, Dae-Il;Kim, Hae-Kwang
    • Journal of Ubiquitous Convergence Technology
    • /
    • v.1 no.1
    • /
    • pp.47-52
    • /
    • 2007
  • In this paper, we present a virtual tourist system with realtime 3D visualization and the assistance of context information service. Our system enables a visitor to take a discovering tour on a virtual environment from a remote client by following navigator or by self-navigating. During the tour, the system provides immersive 3D graphics contents while supporting relevant information to the visitors corresponding to their positions in the virtual environment. When the visitors interact with interested objects, the context information service will also support introduction information for presenting about the objects. The introduction information based on text format is represented by a comfortable way-audio conversion to visitors in different languages depended on their preferences using TTS(Text-To-Speak) tool.

  • PDF

A Study on the Voice Conversion with HMM-based Korean Speech Synthesis (HMM 기반의 한국어 음성합성에서 음색변환에 관한 연구)

  • Kim, Il-Hwan;Bae, Keun-Sung
    • MALSORI
    • /
    • v.68
    • /
    • pp.65-74
    • /
    • 2008
  • A statistical parametric speech synthesis system based on the hidden Markov models (HMMs) has grown in popularity over the last few years, because it needs less memory and low computation complexity and is suitable for the embedded system in comparison with a corpus-based unit concatenation text-to-speech (TTS) system. It also has the advantage that voice characteristics of the synthetic speech can be modified easily by transforming HMM parameters appropriately. In this paper, we present experimental results of voice characteristics conversion using the HMM-based Korean speech synthesis system. The results have shown that conversion of voice characteristics could be achieved using a few sentences uttered by a target speaker. Synthetic speech generated from adapted models with only ten sentences was very close to that from the speaker dependent models trained using 646 sentences.

  • PDF