• Title/Summary/Keyword: Voice interface

Search Result 298, Processing Time 0.027 seconds

Korean Speaker Verification Using Speaker Adaptation Methods (화자 적응 기술을 이용한 한국어 화자 확인)

  • Choi Dong-Jin;Oh Yung-Hwan
    • Proceedings of the KSPS conference
    • /
    • 2006.05a
    • /
    • pp.139-142
    • /
    • 2006
  • Speaker verification systems can be implemented using speaker adaptation methods if the amount of speech available for each target speaker is too small to train the speaker model. This paper shows experimental results using well-known adaptation methods, namely Maximum A Posteriori (MAP) and Maximum Likelihood Linear Regression (MLLR). Experimental results using Korean speech show that MLLR is more effective than MAP for short enrollment utterances.

  • PDF

A Study on Automatic Voice Response Service Using TDX-ACD (TDX-ACD를 이용한 자동음성 안내 기능에 관한 연구)

  • 김영곤;신동헌;신석현
    • Proceedings of the Korean Institute of Communication Sciences Conference
    • /
    • 1988.10a
    • /
    • pp.12-16
    • /
    • 1988
  • 본 논문에서는 안내원의 작업처리시간을 줄이기 위한 방법으로 DDX-1A를 이용한 자동호 분배 장치에 자도음성안내 기능을 구현하기 위한 T-level Prncessor 인 PCP (Protocol Convert Processor) VCP(Voice Contro Processor)와 B-level Processor Avru(voice Response)와 B-level Processor AVRU(Automatic Voice Response Unit)의 H/W 기능 및 상호 interface 에 관하여 고찰한다.

  • PDF

Japanese Speech Based Fuzzy Man-Machine Interface of Manipulators

  • Izumi, Kiyotaka;Watanabe, Keigo;Tamano, Yuya;Kiguchi, Kazuo
    • 제어로봇시스템학회:학술대회논문집
    • /
    • 2003.10a
    • /
    • pp.603-608
    • /
    • 2003
  • Recently, personal robots and home robots are developing by many companies and research groups. It is considered that a general effective interface for user of those robots is speech or voice. In this paper, Japanese speech based man-machine interface system is discussed for reflecting the fuzziness of natural language on robots, by using fuzzy reasoning. The present system consists of the derivation part of action command and the modification part of the derived command. In particular, a unique problem of Japanese is solved by applying the morphological analyzer ChaSen. The proposed system is applied for the motion control of a robot manipulator. It is proved from the experimental results that the proposed system can easily modify the same voice command to the actual different levels of the command, according to the current state of the robot.

  • PDF

Voice Activity Detection Algorithm using Wavelet Band Entropy Ensemble Analysis in Car Noisy Environments (문서 편집 접근성 향상을 위한 음성 명령 기반 모바일 어플리케이션 개발)

  • Park, Joo Hyun;Park, Seah;Lee, Muneui;Lim, Soon-Bum
    • Journal of Korea Multimedia Society
    • /
    • v.21 no.11
    • /
    • pp.1342-1352
    • /
    • 2018
  • Voice Command systems are important means of ensuring accessibility to digital devices for use in situations where both hands are not free or for people with disabilities. Interests in services using speech recognition technology have been increasing. In this study, we developed a mobile writing application using voice recognition and voice command technology which helps people create and edit documents easily. This application is characterized by the minimization of the touch on the screen and the writing of memo by voice. We have systematically designed a mode to distinguish voice writing and voice command so that the writing and execution system can be used simultaneously in one voice interface. It provides a shortcut function that can control the cursor by voice, which makes document editing as convenient as possible. This allows people to conveniently access writing applications by voice under both physical and environmental constraints.

Program Development of Emotional Human and Computer Interface

  • Jung, Seul;Cho, Kiho
    • 제어로봇시스템학회:학술대회논문집
    • /
    • 2002.10a
    • /
    • pp.102.3-102
    • /
    • 2002
  • $\textbullet$ Human and computer interface(HCI) $\textbullet$ Voice recognition $\textbullet$ Image recognition $\textbullet$ Neural network $\textbullet$ Hopfield net

  • PDF

Study on User Experience design in Gesture Interaction as a Product Trigger - Focusing on Product Design - (제품 트리거로서 행동인식의 사용자 경험 디자인 연구 - 제품디자인을 중심으로 -)

  • Min, Sae-yan;Lee, Cathy Yeonchoo
    • Journal of Digital Convergence
    • /
    • v.17 no.5
    • /
    • pp.379-384
    • /
    • 2019
  • The purpose of this study is to investigate the problems of the rapidly increasing voice interface and to find out what results will be obtained when the new gesture interaction is applied to the product, and to suggest the improvement method for a better user experience. Through the literature review, I have conducted a theoretical review on the changes in the product interface used in the product and the difference between them, and then conducted in-depth interviews on the 20-30 users who used voice recognition as a product trigger. As a result, it was concluded that the decline in the reliability of accuracy leads to a decrease in the preference of voice recognition interactions and an needs of appropriate interface for the functional aspect of non-relavancy in physical distance as a product trigger. This study is meaningful in that it has found a problem with the study of the product trigger interface and suggested improvement measures, and hope to be helpful in follow-up study.

Study of Event Recorder with Recording Voice Communication (음성 통화 저장 기능을 제공하는 고속전철용 Event Recorder 연구)

  • Song, Gyu-Youn;Lee, Sang-Nam;Ryu, Hee-Moon;Paik, Jin-Sung
    • Proceedings of the KSR Conference
    • /
    • 2008.06a
    • /
    • pp.1962-1967
    • /
    • 2008
  • A event recorder system stores a train speed and the related information for train operation in real time. Using those information, we can analysis the train operation and the reason of train accident. Currently the event recorder only manipulate the data related the train operation mechanically and electrically. In this paper we propose the event recorder to record the voice communication between the manager in the control center and train operator. By recording the voice communication in the high speed train, the correctness of analysis of train accident can be increased. The system architecture of the event recorder with voice recording is studied and interface between other equipment is proposed. And the software architecture of new event recorder is developed. We study the method of converting analog voice signal into digital data and compressing method. Also the architecture of memory to store the compressed voice data and regeneration of original analog voice are studied.

  • PDF

Study on Development of VUI Based on VoiceXML in Mobile Environment (모바일 환경에서 VoiceXML기반의 VUI 개발에 관한 연구)

  • Lim, Chae-Uk;Jang, Min-Seok
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2003.05a
    • /
    • pp.539-542
    • /
    • 2003
  • 기존의 모바일 디바이스(휴대전화, PDA 등)의 인터페이스는 GUI 방식이 주류를 이루고 있으며 약간의 음성인식 기술이 접목되고 있는 실정이다. 그 음성인식 기술의 활용은 음성인식 다이얼링에 제한되어 있는 실정이다. 이러한 한계점을 극복하기 위해 본 논문에서는 VoiceXML 포럼에서 제안한 VoiceXML 버전 2.0 스펙을 따르는 VoiceXML을 모바일 환경에 적용시켜 음성인식 다이얼링 기능뿐만 아니라, 음성인식 및 합성 기술을 이용한 메뉴선택, 정보 청취 등의 기능을 가능하게 하는 목적으로 VoiceXML 기반의 VUI(Voice User Interface) 개발을 위한 요구사항을 제시하고자 한다. 기존의 GUI 방식뿐만 아니라 VUI 방식을 수용하게 함으로써 사용자들에게 인간친화적인 정보획득 환경을 제공할 것이다.

  • PDF

An Experimental Study on Hindrance Factors of Usability of Menu Structure in ARS (ARS 메뉴체계 사용성 저해요소에 대한 실험연구)

  • Kim, Ho-Won;Kim, Hee-Cheol
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.15 no.2
    • /
    • pp.462-470
    • /
    • 2011
  • ARS (Automatic Response Systems) based on VUI (Voice User Interface) and TTI (Touch Tone Interface) are one of the most widely used communication systems. Despite common usages, however, inconvenience of ARS is continually pointed out. This may stem from lack of human-centered studies aside from technological development. In this paper, we provide guidelines for designing ARS by analyzing hindrance factors of usability of ARS menu structure. We had selected two call-centers using ARS, and carried out an experimental study where subjects performed the task of "returning books." After that, they completed questionnaires and interviews. We identified four problems: the complex menu structure, lack of representativeness on the menu name, users' awareness of location, and a difficulty to move among menus. And we partially discussed the ways of avoiding the problems.