• Title/Summary/Keyword: voice interface

Search Result 296, Processing Time 0.026 seconds

An Interactive Voice Web Browser Usable as a Multimodal Interface in Information Devices by Using VoiceXML

  • Jang, Min-Seok
    • Journal of the Korean Institute of Intelligent Systems
    • /
    • v.14 no.6
    • /
    • pp.771-775
    • /
    • 2004
  • The present Web surroundings is mostly composed of HTML(Hypertext Mark-up Language) and thereby users obtain web informations mainly in GUI(Graphical User Interface) environment by clicking mouse in order to keep up with hyperlinked informations. However it is very inconvenient to work in this environment comparing with easily accessed one in which human`s voice is utilized for obtaining informations. Using VoiceXML, resulted from XML, for supplying the information through telephone on the basis of the contemporary matured technology of voice recognition/synthesis to work out the inconvenience problem, this paper presents the research results about VoiceXML VUI(Voice User Interface) Browser designed and implemented for realizing its technology and also the VoiceXML Dialog designed for the purpose of the browser's efficient use.

A Study on the Voice Interface for Mobile Environment (모바일기반 음성인터페이스에 관한 연구)

  • Kim, Soo-Hoon;Ahn, Jong-Young
    • The Journal of the Institute of Internet, Broadcasting and Communication
    • /
    • v.13 no.1
    • /
    • pp.199-204
    • /
    • 2013
  • Google's android-based voice interface is limited to the web application and the users are rare. In this paper, We suggest the method that can be done using existing android-based voice engine and develope voice application. We also study the environments of android-based voice interface and present the appropriate voice interface in mobile environment.

Implementation of Packet Voice Protocol (패킷음성 프로토콜의 구현)

  • 이상길;신병철;김윤관
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.18 no.12
    • /
    • pp.1841-1854
    • /
    • 1993
  • In this paper, the packet voice protocol for the transmission of voice signal onto ethernet is implemented in a personal computer (PC). The packet voice protocol used is a modified one from CCITT G.764 packetized voice protocol. The hardware system to facilitate the voice communication onto ethernet is divided into telephone interface, speech processing, PC interface and controllers. The software structure of the protocol is designed according to the OSI seven layer architecture and is divided into three routines : ethernet device driver, telephone interface, and processing routine of the packet voice protocol. Experiments through ethernet with telephone interface show that this packet voice communication achieves satisfactory quality when the network traffic is light.

  • PDF

Development of a Voice User Interface for Web Browser using VoiceXML (VoiceXML을 이용한 VUI 지원 웹브라우저 개발)

  • Yea SangHoo;Jang MinSeok
    • Journal of KIISE:Computing Practices and Letters
    • /
    • v.11 no.2
    • /
    • pp.101-111
    • /
    • 2005
  • The present web informations are mainly described in terms of HTML, which users obtain through input devices such as mouse, keyboard, etc. Thus the existing GUI environment have not supported human's most natural information acquisition means, that is, voice. To solve the problem, several vendors are developing voice user interface. However these products are deficient in man -machine interactivity and their accommodation of existing web environment. This paper presents a VUI(Voice User Interface) supporting web browser by utilizing more and more maturing speech recognition technology and VoiceXML, a markup language derived from XML. It provides users with both interfaces, VUI as well as GUI. In addition, XML Island technology is applied to the bowser in a way that VoiceXML fragments are nested in HTML documents to accommodate the existing web environment. Also for better interactivity, dialogue scenarios for menu, bulletin, and search engine are suggested.

Design and Implementation of a Usability Testing Tool for User-oriented Design of Command-and-Control Voice User Interfaces (명령 제어 음성 인터페이스 사용자 중심 설계를 위한 사용성 평가도구의 설계 및 구현)

  • Lee, Myeong-Ji;Hong, Ki-Hyung
    • Phonetics and Speech Sciences
    • /
    • v.3 no.2
    • /
    • pp.79-87
    • /
    • 2011
  • Recently, usability has become very important in voice user interface systems. In this paper, we have designed and implemented a wizard-of-oz (WOZ) usability testing tool for command-and-control voice user interfaces. We have proposed the VUIDML (Voice User Interface Design Markup Language) to design the usability test scenario of command-and-control voice interfaces in the early design stages. For highly satisfactory voice user interfaces, we have to select highly preferred voice commands and prompts. In VUIDML, we can specify possible prompt candidates. The WOZ usability testing tool can also be used to collect user-preferred voice commands and feedback from real users.

  • PDF

A Study on Design of Dialog for VoiceXML VUI (VoiceXML VU를 위한 Dialog 설계에 관한 연구)

  • 장민석;예상후
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2002.11a
    • /
    • pp.792-795
    • /
    • 2002
  • Nowadays the corporations related to Information & Communication field are researching more and more on VoiceXML development. VoiceXML can provide users with more efficient interface, VUI(VoiceXML User Interface) in web environment than the existing one. But more research and development for designing the Dialog have to be done for VUI to be used in efficient way. That was a main topic in "2002 VoiceXML Conference & Expo". According to the importance this paper presents VoiceXML Dialog designed for the purpose of its efficient use and the experimental result.

  • PDF

The Development of Heuristics for Voice Shopping Service through Voice Interface with Display (디스플레이 탑재형 음성 인터페이스를 통한 음성쇼핑 서비스 휴리스틱 개발)

  • Gwon, Hyeon Jeong;Lee, Jee Yeon
    • Journal of the Korean Society for information Management
    • /
    • v.39 no.2
    • /
    • pp.1-33
    • /
    • 2022
  • Voice shopping is gaining attention following the trend of non-contact E-commerce by enabling people to shop via voice command. Therefore, in this study, voice shopping service heuristics using a display-mounted voice interface were developed in preparation for the future where voice shopping becomes a part of daily life in the world. First, as a theoretical approach, a literature survey of 50 papers on the design principles of 'visual interface,' 'voice interface,' and 'shopping service' was conducted to produce a total of 29 draft design principles. Second, as an empirical approach, a focus group interview was conducted on consumer decision-making processes in shopping experiences and information-seeking behavior within the context of shopping to draft the heuristics. This was to supplement the user experience, a weak part of the literature research. Finally, a Delphi survey asked 20 experts in UX, service planning, artificial intelligence development, and shopping to evaluate the heuristics draft developed through the above two stages. After three rounds of Delphi surveys, the final heuristics were proposed.

A Study on the Reliability of Voice Payment Interface (음성결제 인터페이스의 신뢰도에 관한 연구)

  • Gwon, Hyeon Jeong;Lee, Jee Yeon
    • Journal of the Korean Society for information Management
    • /
    • v.38 no.3
    • /
    • pp.101-140
    • /
    • 2021
  • As the payment service sector actively embraces artificial intelligence technology, "Voice Payments" is becoming a trend in contactless payment services. Voice payment services can execute payments faster and more intuitively through "voice," the most natural means of communication for humans. In this study, we selected richness, intimacy, and autonomy as factors for building trust with artificial intelligence agents. We wanted to determine whether the trust will be formed if the factors were applied to the voice payment services. The experiment results showed that the higher the richness and autonomy of the voice payment interface and the lower the intimacy, the higher the trust. In addition, the two-way interaction effects of richness and autonomy were significant. We analyzed and synthesized the collected short-answer system to identify users' anxiety when using voice payment services and proposed speech interface design ideas to increase their trust in the voice payment.

GMM based Nonlinear Transformation Methods for Voice Conversion

  • Vu, Hoang-Gia;Bae, Jae-Hyun;Oh, Yung-Hwan
    • Proceedings of the KSPS conference
    • /
    • 2005.11a
    • /
    • pp.67-70
    • /
    • 2005
  • Voice conversion (VC) is a technique for modifying the speech signal of a source speaker so that it sounds as if it is spoken by a target speaker. Most previous VC approaches used a linear transformation function based on GMM to convert the source spectral envelope to the target spectral envelope. In this paper, we propose several nonlinear GMM-based transformation functions in an attempt to deal with the over-smoothing effect of linear transformation. In order to obtain high-quality modifications of speech signals our VC system is implemented using the Harmonic plus Noise Model (HNM)analysis/synthesis framework. Experimental results are reported on the English corpus, MOCHA-TlMlT.

  • PDF

A Study on Development of VUI(Voice User Interface) using VoiceXML (VoiceXML을 이용한 VUI 개발에 관한 연구)

  • 장민석;양운모
    • Proceedings of the Korean Information Science Society Conference
    • /
    • 2002.04a
    • /
    • pp.349-351
    • /
    • 2002
  • 한국현재의 컴퓨팅환경은 Text위주의 Command Line상에서의 입출력에서 GUI(Graphic User Interface)환경으로 전환되었다. 이는 사용자에게 좀더 친근한 방법으로의 컴퓨팅환경을 제공하고 있는 것이다. 하지만 아직까지 그러한 환경에 익숙해지기 위해서는 많은 습득시간이 필요하며 또한, 응용프로그램 간의 인터페이싱 기능 등을 익히기 위해서는 추가적인 학습을 통해야 원활한 작업을 수행할 수 있다. 이를 해결하고자 본 연구는 음성인식/ 합성과, 현재 음성마크업 언어인 VoiceXML 등을 통해서 모색해보고자 한다.

  • PDF