• 제목/요약/키워드: voice recognition

검색결과 650건 처리시간 0.036초

무선랜 환경에서의 분산 음성 인식을 이용한 음성 다이얼링 시스템 (A Voice-Activated Dialing System with Distributed Speech Recognition in WiFi Environments)

  • 박성준;구명완
    • 대한음성학회지:말소리
    • /
    • 제56호
    • /
    • pp.135-145
    • /
    • 2005
  • In this paper, a WiFi phone system with distributed speech recognition is implemented. The WiFi phone with voice-activated dialing and its functions are explained. Features of the input speech are extracted and are sent to the interactive voice response (IVR) server according to the real-time transport protocol (RTP). Feature extraction is based on the European Telecommunication Standards Institute (ETSI) standard front-end, but is modified to reduce the processing time. The time for front-end processing on a WiFi phone is compared with that in a PC.

  • PDF

Adaptive Post Processing of Nonlinear Amplified Sound Signal

  • Lee, Jae-Kyu;Choi, Jong-Suk;Seok, Cheong-Gyu;Kim, Mun-Sang
    • 제어로봇시스템학회:학술대회논문집
    • /
    • 제어로봇시스템학회 2005년도 ICCAS
    • /
    • pp.872-876
    • /
    • 2005
  • We propose a real-time post processing of nonlinear amplified signal to improve voice recognition in remote talk. In the previous research, we have found the nonlinear amplification has unique advantage for both the voice activity detection and the sound localization in remote talk. However, the original signal becomes distorted due to its nonlinear amplification and, as a result, the rest of sequence such as speech recognition show less satisfactorily results. To remedy this problem, we implement a linearization algorithm to recover the voice signal's linear characteristics after the localization has been done.

  • PDF

장애인을 위한 멀티모달 인터페이스 기반의 홈 네트워크 제어 (Home Automation Control with Multi-modal Interfaces for Disabled Persons)

  • 박희동
    • 디지털융복합연구
    • /
    • 제12권2호
    • /
    • pp.321-326
    • /
    • 2014
  • 최근 장애인을 위한 IT 접근성 향상 기술에 대한 요구가 증대되고 있다. 따라서 장애인 IT 사용자를 위하여 음성 인식, 영상 인식, TTS 등과 같은 멀티모달 인터페이스를 지원하는 것이 매우 중요하다. 본 논문에서는 홈 네트워크 제어에 있어서 장애인 IT 접근성 향상 기술의 적용 방안에 대하여 서술한 후, 장애인이 쉽게 홈 네트워크를 제어할 수 있도록 음성 인식 및 애니메이션 UI (User interfaces)등과 같은 멀티모달 인터페이스 기반의 홈 네트워크 제어 시스템 모델을 구현하였다.

한국어 음성인식 시스템 향상을 위한 동음이철 단위의 중의성 유형 분류 (Ambiguity Types of the Homonymic & Heterographic Units for Improving Korean Voice Recognition System - a Preliminary Research)

  • 윤애선;강미영
    • 음성과학
    • /
    • 제15권4호
    • /
    • pp.67-81
    • /
    • 2008
  • The accuracy rate of P2G (Phoneme-to-Grapheme) is one of the important factors determining the quality of unlimited voice recognition (VR) systems. Few studies were, however, conducted to reduce ambiguities of a phoneme string which can be segmented into a variety of different linguistic units (i.e. morphemes, words, eo-jeols), thus be transformed into more than one grapheme string. This paper is a preliminary research for building a large knowledge base of those homonymic & heterographic units(HHUs), which will provide unlimited Korean VR systems with more accurate P2G information. This paper analyzes 2 main factors generating HHUs: (1) boundary determination of the prosodic unit; (2) its segmentation into linguistic units. In this paper, linguistic characteristics determining variable boundaries of a prosodic unit are investigated, and the ambiguity types of HHUs are classified in accordance with their morphological and syntactic structures as well as with the phonological rules governing them.

  • PDF

Voice Recognition Softwares: Their implications to second language teaching, learning, and research

  • Park, Chong-won
    • 음성과학
    • /
    • 제7권3호
    • /
    • pp.69-85
    • /
    • 2000
  • Recently, Computer Assisted Language Learning (CALL) received widely held attention from diverse audiences. However, to the author's knowledge, relatively little attention was paid to the educational implications of voice recognition (VR) softwares in language teaching in general, and teaching and learning pronunciation in particular. This study explores, and extends the applicability of VR softwares toward second language research areas addressing how VR softwares might facilitate interview data entering processes. To aid the readers' understanding in this field, the background of classroom interaction research, and the rationale of why interview data, therefore the role of VR softwares, becomes critical in this realm of inquiry will be discussed. VR softwares' development and a brief report on the features of up-to-date VR softwares will be sketched. Finally, suggestions for future studies investigating the impact of VR softwares on second language learning, teaching, and research will be offered.

  • PDF

VOICE CONTROL SYSTEM FOR TELEVISION SET USING MASKING MODEL AS A FRONT-END OF SPEECH RECOGNIZER

  • Usagawa, Tsuyoshi;Iwata, Makoto;Ebata, Masanao
    • 한국음향학회:학술대회논문집
    • /
    • 한국음향학회 1994년도 FIFTH WESTERN PACIFIC REGIONAL ACOUSTICS CONFERENCE SEOUL KOREA
    • /
    • pp.991-996
    • /
    • 1994
  • Surrounding noise often affects the performance of speech recognition system when it is used in office or home. Especially situation is more serious when colored and nonstational noise such as an sound from television or other audio equipment is introduced. The authors proposed a voice control system for television set using an adaptive noise canceler, and it works well even is sound of television set has comparable level of speech. In this paper, a new front-end of speech recognition is introduced for the voice control system. This font-end utilizes a simplified masking model to reduce the effect of residual noise. According to experimental results, 90% correct recognition is achieved even if the level of television sound is almost 15dB higher than one of speech.

  • PDF

VoiceXML을 사용한 상가 검색 음성인식 시스템의 설계 및 구현 (Design and Implementation of Store Locator Voice Recognition System Using VoiceXML)

  • 김우일;송성균;고경만;윤재석;김국보
    • 한국멀티미디어학회:학술대회논문집
    • /
    • 한국멀티미디어학회 2002년도 춘계학술발표논문집(상)
    • /
    • pp.138-143
    • /
    • 2002
  • 음성은 컴퓨터와 인간 사이의 인터페이스로서 지속적인 연구가 되어 왔다. VoiceXML로 구현된 음성 포털 서비스는 사용자의 음성 질의에 따라 정보를 검색하고 청취할 수 있는 기술로서 현재 다양한 컨텐츠로 서비스가 진행되고 있다. 본 연구에서는 전화나 인터넷 전화 프로그램으로 상가의 위치, 전화 번호, 상가 소개 등의 정보를 음성으로 검색할 수 있는 시스템을 VoiceXML을 이용하여 구현하여 보았다. 웹과 연동할 수 있도록 시스템을 구성하고 다양한 다이얼로그를 표현하기 위해 특히, JSP를 이용하고 각 로직을 자바빈즈 컴포넌트로 구현하였다.

  • PDF

응답형 음성제어 전동 휠체어(INMEL-1)의 설계 (Design of the Motorized Wheel Chair(INMEL-1) Controlled by Response Type Voices)

  • 정동명;홍승홍
    • 대한의용생체공학회:의공학회지
    • /
    • 제8권2호
    • /
    • pp.231-240
    • /
    • 1987
  • This Paper introduces a new design of motorized wheel chair for the disabled, which is intended to improve the quality of the disabled's indoor life. This vehicle was based on high manoeuvrability of the omnidirectional drive and saftey. Usually, the vehicle controlled by a joystick but also the voice control system to be prepared for the severely disabled. This voice control system responds to the result of voice recognition, state of system or warning of dangers with voices, which has real time response and 95.3% recognition ratio and satisfactory synthesis voice Quality Therefore this system is able to provide independency in driving and the disabled's daily life.

  • PDF

실버세대 및 1인 가구를 위한 인공지능 기반 음성인식 'Voice' Application 설계 및 구현 (Design and implementation of artificial intelligence-based speech recognition for silver generation and single household "Voice" Application)

  • 조영주;김진혁;선아영;오지훈
    • 한국컴퓨터정보학회:학술대회논문집
    • /
    • 한국컴퓨터정보학회 2017년도 제56차 하계학술대회논문집 25권2호
    • /
    • pp.141-144
    • /
    • 2017
  • 4차 산업혁명 시대에 살고 있는 현대인들은 혼자 사는 1인 가구의 증가와 고령화 사회의 진입으로 인한 실버세대가 증가하는 추세이다. 외로움, 소외감, 우울증을 겪는 1인 가구 및 실버세대의 문제점을 해소시켜 주고 더 나아가 실버세대의 스마트폰 활성화를 위해 본 논문에서는 인공지능 기반 음성인식 기능을 탑재한 'Voice' 어플리케이션을 제안하고자 한다.

  • PDF

병적 음성과 정상 음성의 음향학적 파라미터 분포에 대한 통계적 분석 (An analysis of a statistical difference of acoustic Parameters' distribution between normal voice and pathological voice)

  • 김용주;권순복;김기련;신민철;조철우;왕수건
    • 대한전자공학회:학술대회논문집
    • /
    • 대한전자공학회 2001년도 하계종합학술대회 논문집(4)
    • /
    • pp.249-252
    • /
    • 2001
  • The most basic means of communication among humans is a voice. Without speaking of voice technologies, we found it is important and convenient to use a voice in everyday life. But. in consideration to speech recognition systems, we can't always desire a normal voice input as input signal to the system. Generally speaking. a pathological voice as against a normal which is a voice with a problem in the larynx. could be also special case of input voice. Of course, but the distortion of a speech signal by environmental effects i.e., noise or transmission channel was a raised problem. we will take up a pathological voices with laryngeal disease which is essential distortion factor in voice. Also, we are to find out the difference of acoustic parameters distribution between normal and pathological voice by a statistical method in our research.

  • PDF