• Title/Summary/Keyword: Voice interface

Search Result 297, Processing Time 0.036 seconds

A Method of Arrangement of Voice and Sound : For User Interface of Domestic Appliance (음성과 소리의 할당 방법 : 가전제품 UI 를 중심으로)

  • Hong, Ji-Young;Chae, Haeng-Suk;Lee, Seung-Yong;Park, Young-Hyun;Kim, Jun-Hee;Ryu, Hyung-Su;Kim, Jong-Wan;Han, Kwang-Hee
    • 한국HCI학회:학술대회논문집
    • /
    • 2007.02b
    • /
    • pp.478-483
    • /
    • 2007
  • 본 연구는 가전제품 사용자 인터페이스에서 음성 신호와 청각 신호의 최적 할당 방법을 기술하였다. 가정에서 수시로 접하는 가전제품에서 음성 유저 인터페이스(Voice User Interface, 이하 VUI) 는 음성을 매개로 일어나는 인간과 기계 간 인터페이스를 뜻한다. 음성 유저 인터페이스의 단독적 적용보다는 소리 신호와 함께 사용하여 사용자들의 인터페이스를 향상시킬 수 있다. 본 연구에서는 주부 사용자들을 대상으로 F.G.I, 실험, Depth Interview 를 수행하여 가전제품의 음성 생성 및 표현 인터페이스에서 음성과 소리 신호의 배치에 대한 사용자들의 니즈 조사 및 실험 결과를 기반으로 최적의 할당 방법을 제시하였다.

  • PDF

Expected Matching Score Based Document Expansion for Fast Spoken Document Retrieval (고속 음성 문서 검색을 위한 Expected Matching Score 기반의 문서 확장 기법)

  • Seo, Min-Koo;Jung, Gue-Jun;Oh, Yung-Hwan
    • Proceedings of the KSPS conference
    • /
    • 2006.11a
    • /
    • pp.71-74
    • /
    • 2006
  • Many works have been done in the field of retrieving audio segments that contain human speeches without captions. To retrieve newly coined words and proper nouns, subwords were commonly used as indexing units in conjunction with query or document expansion. Among them, document expansion with subwords has serious drawback of large computation overhead. Therefore, in this paper, we propose Expected Matching Score based document expansion that effectively reduces computational overhead without much loss in retrieval precisions. Experiments have shown 13.9 times of speed up at the loss of 0.2% in the retrieval precision.

  • PDF

A Study on 3D View Design of Images and Voices Integration for Effective Information Transfer (효과적 정보전달을 위한 영상정보의 3D 뷰 및 음성정보와의 융합 연구)

  • Shin, C.H.;Lee, J.S.
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.35 no.1B
    • /
    • pp.35-41
    • /
    • 2010
  • In this paper, we propose a 3D view design scheme which arranges 2D information in a 3D virtual space with a flexible interface and voice information. The scheme allows the user interface of the 2D image in 3D virtual space anytime from any view point. Voice information can be easily attached. It is this simple and efficient image and voice information arrangement in 3D virtual space that improves information transfer.

Design and Implementation of IVR Server Using VoiceXML (VoiceXML을 이용한 IVR 서버 설계 및 구현)

  • Lee, Chang-Ho;Jang, Won-Jo;Kang, Sun-Mee
    • Speech Sciences
    • /
    • v.9 no.3
    • /
    • pp.47-59
    • /
    • 2002
  • A new brilliant service using human-voice and DTMF (Dual Tone Multi Frequency) technique is expected nowadays in order to obtain valuable information on the internet more easily. VoiceXML (Voice eXtensible Markup Language) is the right choice that makes the new service possible. In this paper, the design and implementation of IVR (Interactive Voice Response) server using VoiceXML is described, where it connects with internet and IVR server efficiently. IVR server using VoiceXML is composed of two groups: VoiceXML document handling and VoiceXML execution. Scenario part of IVR server corresponds to VoiceXML document, the execution is performed by VoiceXML execution.

  • PDF

A Method For Utilizing Voice Interface in Web Environment Using VoiceXML (웹 환경에서 VoiceXML을 이용한 음성 인터페이스 활용방안)

  • 장민석;방초균
    • Proceedings of the Korean Information Science Society Conference
    • /
    • 2002.04a
    • /
    • pp.451-453
    • /
    • 2002
  • 현재의 웹 환경은 HTML로 구성이 되어있고 이로인해 하이퍼링크를 따라가기 위해 마우스 클릭을 통해 작업하는 GUI환경이 주를 이룬다. 하지만 이러한 방법은 인간이 가장 손쉽게 사용하는 음성과 비교해 볼때 상당히 불편한 축에 속한다. 이를 해결하기 위해 현재 무르익은 음성인식 기술과 전화기를 통해 정보를 제공하고자 하는 XML의 파생인 VoiceXML을 이용하여 현재 HTML이 주류를 이루는 웹 환경을 VoiceXML을 이용한 음성인터페이스 환경을 마련하고자 한다.

  • PDF

A Method For Utilizing Voice Interface in Web Environment Using VoiceXML (웹 환경에서 VoiceXML을 이용한 음성인터페이스 활용방안)

  • Jang, Min-Seok;Bang, Cho-Kyun
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2002.04b
    • /
    • pp.1447-1450
    • /
    • 2002
  • 현재의 웹 환경은 HTML로 구성이 되어있고 이로인해 하이퍼링크를 따라가기 위해 마우스 클릭을 통해 작업하는 GUI환경이 주를 이룬다. 하지만 이러한 방법은 인간이 가장 손쉽게 사용하는 음성과 비교해 볼 때 상당히 불편한 축에 속한다. 이를 해결하기 위해 현재 무르익은 음성인식 기술과 전화기를 통해 정보를 제공하고자 하는 XML의 파생인 VoiceXML을 이용하여 현재 HTML이 주류를 이루는 웹 환경을 VoiceXML을 이용한 음성인터페이스 환경을 마련하고자 한다.

  • PDF

Design and Control of Haptic Device using Voice Coil Type Motor (보이스 코일형 모터를 이용한 햅틱 장치의 설계 및 제어)

  • Sung, Ha-Gyeong;Borm, Jin-Hwan
    • The Transactions of the Korean Institute of Electrical Engineers D
    • /
    • v.51 no.10
    • /
    • pp.439-445
    • /
    • 2002
  • In this paper force feedback control system is investigated for improving the quality of the haptic feedback in virtual reality applications. We suggested the method of controlling the haptic device and modelling the virtual environment. Haptic device is composed of five bar link structure, voice coil motor, control board, and virtual environment modeling program. We applied voice coil motor in the actuating system for simple structure and easy control. Virtual environment modelling is constructed in PC, and the control signals of the actuators and the encoder data are transferred to the control system through USB. Experiment is performed to evaluate the characteristics of the haptic device.

The Study on the Quality Assessment Model of Aircraft Voice Recognition Software (항공기 음성인식 소프트웨어 품질 평가 모델 연구)

  • Lee, Seung-Mok
    • Journal of Software Assessment and Valuation
    • /
    • v.15 no.2
    • /
    • pp.73-83
    • /
    • 2019
  • Voice Recognition has recently been improved with AI(Artificial Intelligence) and has greatly improved the false recognition rate and provides an effective and efficient Human Machine Interface (HMI). This trend has also been applied in the defense industry, particularly in the aviation, F-35. However, for the quality evaluation of Voice Recognition, the defense industry, especially the aircraft, requires measurable quantitative models. In this paper, the quantitative evaluation model is proposed for applying Voice Recognition to aircraft. For the proposal, the evaluation items are identified from the Voice Recognition technology and ISO/IEC 25000(SQuaRE) quality attributes. Using these two perspectives, the quantitative evaluation model is proposed under aircraft operation condition and confirms the evaluation results.

AI Voice Agent and Users' Response (AI 음성 에이전트의 음성 특성에 대한 사용자 반응 연구)

  • Beak, Seung Ju;Jung, Yoon Hyuk
    • The Journal of Information Systems
    • /
    • v.31 no.2
    • /
    • pp.137-158
    • /
    • 2022
  • Purpose As artificial intelligence voice agents (AIVA) have been widely adopted in services, diverse forms of their voices, which are the main interface with users, have been experimented. The purpose of this study is to examine how users evaluate vocal characteristics (gender, voice pitch, and voice pace) of AIVA, depending on prior research on human voice attractiveness. Design/methodology/approach This study employed an experimental survey which 516 participated in. Each participant was randomly assigned into one of eight situations (e.g., male - higher pitch - faster pace) and listened a AIVA voice sample, which introduce weather information. Next, a participant answered three consequence factors (attractiveness, trust, and anthropomorphism). Findings The results reveal that female voices of AIVA were perceived as more attractive and trustworthy than male voices. As far as voice pitch goes, while lower-pitch voices were preferred in female voices, higher-pitch voices were preferred in male voices. Finally, faster voices of AIVA were more attractive than slower voices.

Implementation of an AAL2 processor for voice gateway application (음성 게이트웨이 응용을 위한 AAL2 프로세서 구현)

  • 이상길;최명렬
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.27 no.11C
    • /
    • pp.1152-1157
    • /
    • 2002
  • In this paper, a detailed procedure of development for an AAL2 processor widely used in voice gateway application is introduced. The processor supports CPS and SSCS with voice service and framed mode data service. It provides 4 ATM virtual connections, which include 1020 AAL2 channels. The processor has one UTOPIA Level 1 interface for an ATM cell interface and 4 TDM ports for a voice channel interface. The TDM ports carry PCM/ADPCM voice streams. Most AAL2 processors are implemented as software, or hardware and software, so its latency is large. But this processor has very low latency as to CPS and SSCS because all of them are implemented in hardware. Also, it allows not only loopback and switching of CPS packets, but loopback and switching of TDM channels. The key feature is that the internal structure of the CPS and SSCS in this processor seems like as each software function, so they are called whenever they are required. In addition, they are reusable for another design and are scalable for more channels.