• 제목/요약/키워드: Interactive Voice Interface

검색결과 25건 처리시간 0.025초

An Interactive Voice Web Browser Usable as a Multimodal Interface in Information Devices by Using VoiceXML

  • Jang, Min-Seok
    • 한국지능시스템학회논문지
    • /
    • 제14권6호
    • /
    • pp.771-775
    • /
    • 2004
  • The present Web surroundings is mostly composed of HTML(Hypertext Mark-up Language) and thereby users obtain web informations mainly in GUI(Graphical User Interface) environment by clicking mouse in order to keep up with hyperlinked informations. However it is very inconvenient to work in this environment comparing with easily accessed one in which human`s voice is utilized for obtaining informations. Using VoiceXML, resulted from XML, for supplying the information through telephone on the basis of the contemporary matured technology of voice recognition/synthesis to work out the inconvenience problem, this paper presents the research results about VoiceXML VUI(Voice User Interface) Browser designed and implemented for realizing its technology and also the VoiceXML Dialog designed for the purpose of the browser's efficient use.

VoiceXML 기반 음성인식시스템을 이용한 서비스 개발 (The Interactive Voice Services based on VoiceXML)

  • 김학균;김은향;김재인;구명완
    • 대한음성학회지:말소리
    • /
    • 제43호
    • /
    • pp.113-125
    • /
    • 2002
  • As there are needs to search the Web information via wire or wireless telephones, VoiceXML forum was established to develop and promote the Voice eXtensible Markup Language (VoiceXML). VoiceXML simplifies the creation of personalized interactive voice response services on the Web, and allows voice and phone access to information on Web sites, call center databases. Also, it can utilize the Web-based technologies, such as CGI(Common Gateway Interface) scripts. In this paper, we have developed the voice portal service platform based on VoiceXML called TeleGateway. It enables integration of voice services with data services using the Automatic Speech Recognition (ASR) and Text-To-Speech (TTS) engines. Also, we have showed the various services on voice portal services.

  • PDF

Implementation of interactive Stock Trading System Using VoiceXML

  • Shin Jeong-Hoon;Cho Chang-Su;Hong Kwang-Seok
    • 대한전자공학회:학술대회논문집
    • /
    • 대한전자공학회 2004년도 ICEIC The International Conference on Electronics Informations and Communications
    • /
    • pp.387-390
    • /
    • 2004
  • In this paper, we design and implement practical application service using VoiceXML. And we suggest new solutions of problems can be occurred when implementing a new systems using VoiceXML, based on the fact. Up to now, speech related services were developed using API (Application Program Interface) and programming languages, which methods depend on system architectures. It thus appears that reuse of contents and resource was very difficult. To solve these problems, nowadays, companies develop their applications using VoiceXML. Advantages of using VoiceXML when developing services are as follows. First, we can use web developing technologies and technologies for transmitting web contents. And, we can save labors for low level programming like C language or Assembler language. And we can save labors for managing resources, too. As the result of these advantages, we can reduce developing hours of applications services and we can solve problem of compatibility between systems. But, there's poor grip of actual problems can be occurred when implementing their own services using VoiceXML. To overcome these problems, we implemented interactive stock trading system using VoiceXML and concentrated our effort to find out problems when using VoiceXML. And then, we proposed solutions to these problems and analyzed strong points and weak points of suggested system.

  • PDF

VoiceXML을 이용한 IVR 서버 설계 및 구현 (Design and Implementation of IVR Server Using VoiceXML)

  • 이창호;장원조;강선미
    • 음성과학
    • /
    • 제9권3호
    • /
    • pp.47-59
    • /
    • 2002
  • A new brilliant service using human-voice and DTMF (Dual Tone Multi Frequency) technique is expected nowadays in order to obtain valuable information on the internet more easily. VoiceXML (Voice eXtensible Markup Language) is the right choice that makes the new service possible. In this paper, the design and implementation of IVR (Interactive Voice Response) server using VoiceXML is described, where it connects with internet and IVR server efficiently. IVR server using VoiceXML is composed of two groups: VoiceXML document handling and VoiceXML execution. Scenario part of IVR server corresponds to VoiceXML document, the execution is performed by VoiceXML execution.

  • PDF

대화형 음성 지원을 통한 지능형 검색 시스템 (Intelligent Retrieval System with Interactive Voice Support)

  • 문규진;우요섭
    • 재활복지공학회논문지
    • /
    • 제9권1호
    • /
    • pp.29-35
    • /
    • 2015
  • 본 논문에서는 음성인식을 통해 상품검색을 도와주는 지능형 검색 시스템을 제안한다. 제안하는 시스템은 음성인식과정에서 잘못 인식된 어휘를 자동으로 수정하기 위해 어휘간의 관계를 이용한다. 본 연구에서는 제안하는 시스템의 유용성을 확인하기 위해 시스템을 시뮬레이션 할 수 있는 어플리케이션을 구현하였다. 실험 결과 간단한 유저 인터페이스를 통해 음성인식이 잘못된 어휘를 바로잡아 상품검색에 도움을 주는 것을 확인할 수 있었다.

  • PDF

멀티미디어 인터페이스 기술을 이용한 유아 대상의 체감형 게임 설계 : 신체 놀이 활동 중심 (Interactive Game Designed for Early Child using Multimedia Interface : Physical Activities)

  • 원혜민;이경미
    • 한국콘텐츠학회논문지
    • /
    • 제11권3호
    • /
    • pp.116-127
    • /
    • 2011
  • 본 논문에서는 유아를 위한 체감형 게임 개발에 필요한 요소로 콘텐츠, 디자인, 음향, 동작인식, 음성인식 기술을 제안하였다. 유아용 체감형 게임은 유아의 감성에 맞춘 교육적 요구가 반영된 콘텐츠와 밝고 친근감 있으면서 사용이 편리한 디자인 요소들이 반영되어야 하고 유아가 친숙하고도 쉽게 게임을 할 수 있게 유도할 수 있는 배경음악과 설명 대사가 사용되는 것이 좋다. 만약 동작 인식과 음성인식 시스템을 유아용 체감형 게임에 사용할 경우 게임 사용자의 연령에 맞는 동작 데이터와 음성 데이터를 사용해 인식률을 높여야 한다. 특히, 본 논문에서는 피부색과 유아 신체 모델을 사용하여 유아의 얼굴과 손을 인식한 후 그 위치를 고려하여 유아의 동작을 인식하였고 유아의 음성 데이터를 수집해 신경망을 이용한 음성인식 기술을 게임에 적용해 신체 놀이 중심 활동의 줄넘기 게임인 '신나게 폴짝'을 개발하였다.

Interactive Adaptation of Fuzzy Neural Networks in Voice-Controlled Systems

  • Pulasinghe, Koliya;Watanabe, Keigo;Izumi, Kiyotaka;Kiguchi, Kazuo
    • 제어로봇시스템학회:학술대회논문집
    • /
    • 제어로봇시스템학회 2002년도 ICCAS
    • /
    • pp.42.3-42
    • /
    • 2002
  • Fuzzy Neural Network (FNN) is a compulsory element in a voice-controlled machine due to its inherent capability of interpreting imprecise natural language commands. To control such a machine, user's perception of imprecise words is very important because the words' meaning is highly subjective. This paper presents a voice based controller centered on an adaptable FNN to capture the user's perception of imprecise words. Conversational interface of the machine facilitates the learning through interaction. The system consists of a dialog manager (DM), the conversational interface, a Knowledge base, which absorbs user's perception and acts as a replica of human understanding of imprecise words,...

  • PDF

Implementation of Android-based Interactive Edutainment Contents Using Authoring Tool Developed for Interactive Animation

  • Song, Mi-Young
    • 한국컴퓨터정보학회논문지
    • /
    • 제23권4호
    • /
    • pp.71-80
    • /
    • 2018
  • In this paper, we developed an interactive animation authoring tool and developed the Android based interactive edutainment contents. The authoring tool for creating interactive animations developed in this paper is based on a graphical user interface, so users can easily create interactive animations. Interactive animation contents created by this authoring tool can be created as images and xml files so that they can be used directly on mobile devices. In order to increase learning efficiency for children, Android-based interactive edutainment electronic storybooks, which is implemented using this authoring tool, provided a recording function to listen to the parents' voice as well as an interactive action in which the characters move in accordance with the story line. We also provided a STEAM game that combines creativity and imagination with creative science and technology. Therefore, by creating the edutainment contents through the proposed authoring tool for interactive animation, various interactive animation contents could be produced more easily than the code implementation method. Through this study, I hope that it will be helpful for the development of various interactive edutainment contents to provide educational contents considering the quantity and quality to infants.

Real-time Multi-device Control System Implementation for Natural User Interactive Platform

  • 김명진;황태민;채승훈;김민준;문연국;김승준
    • 인터넷정보학회논문지
    • /
    • 제23권1호
    • /
    • pp.19-29
    • /
    • 2022
  • Natural user interface (NUI) is used for the natural motion interface without using a specific device or tool like a mouse, keyboards, and pens. Recently, as non-contact sensor-based interaction technologies for recognizing human motion, gestures, voice, and gaze have been actively studied, an environment has been prepared that can provide more diverse contents based on various interaction methods compared to existing methods. However, as the number of sensors device is rapidly increasing, the system using a lot of sensors can suffer from a lack of computational resources. To address this problem, we proposed a real-time multi-device control system for natural interactive platform. In the proposed system, we classified two types of devices as the HC devices such as high-end commercial sensor and the LC devices such astraditional monitoring sensor with low-cost. we adopt each device manager to control efficiently. we demonstrate a proposed system works properly with user behavior such as gestures, motions, gazes, and voices.

음성기반 멀티모달 사용자 인터페이스의 사용성 평가 방법론 (Usability Test Guidelines for Speech-Oriented Multimodal User Interface)

  • 홍기형
    • 대한음성학회지:말소리
    • /
    • 제67호
    • /
    • pp.103-120
    • /
    • 2008
  • Basic components for multimodal interface, such as speech recognition, speech synthesis, gesture recognition, and multimodal fusion, have their own technological limitations. For example, the accuracy of speech recognition decreases for large vocabulary and in noisy environments. In spite of those technological limitations, there are lots of applications in which speech-oriented multimodal user interfaces are very helpful to users. However, in order to expand application areas for speech-oriented multimodal interfaces, we have to develop the interfaces focused on usability. In this paper, we introduce usability and user-centered design methodology in general. There has been much work for evaluating spoken dialogue systems. We give a summary for PARADISE (PARAdigm for Dialogue System Evaluation) and PROMISE (PROcedure for Multimodal Interactive System Evaluation) that are the generalized evaluation frameworks for voice and multimodal user interfaces. Then, we present usability components for speech-oriented multimodal user interfaces and usability testing guidelines that can be used in a user-centered multimodal interface design process.

  • PDF