• Title/Summary/Keyword: automatic voice system

Search Result 81, Processing Time 0.026 seconds

Voice Activity Detection Based on Signal Energy and Entropy-difference in Noisy Environments (엔트로피 차와 신호의 에너지에 기반한 잡음환경에서의 음성검출)

  • Ha, Dong-Gyung;Cho, Seok-Je;Jin, Gang-Gyoo;Shin, Ok-Keun
    • Journal of Advanced Marine Engineering and Technology
    • /
    • v.32 no.5
    • /
    • pp.768-774
    • /
    • 2008
  • In many areas of speech signal processing such as automatic speech recognition and packet based voice communication technique, VAD (voice activity detection) plays an important role in the performance of the overall system. In this paper, we present a new feature parameter for VAD which is the product of energy of the signal and the difference of two types of entropies. For this end, we first define a Mel filter-bank based entropy and calculate its difference from the conventional entropy in frequency domain. The difference is then multiplied by the spectral energy of the signal to yield the final feature parameter which we call PEED (product of energy and entropy difference). Through experiments. we could verify that the proposed VAD parameter is more efficient than the conventional spectral entropy based parameter in various SNRs and noisy environments.

A study on a design of developed-ERES/WCS using the ASR and fuzzy set theory as a part of human interface technique (Human interface 기술의 일환으로서 ASR과 fuzzy set theory를 이용한 developed-ERES/WCS 설계에 관한 연구)

  • 이순요;이창민;박세권
    • 제어로봇시스템학회:학술대회논문집
    • /
    • 1988.10a
    • /
    • pp.76-81
    • /
    • 1988
  • As a means of human interface, this study designs Developed-ERES/WCS with voice recognition capability and fuzzy set theory. In the advanced teleoperator system, when an error occurs on the automatic mode, the error is recovered after the automatic mode is changed into the manual mode intervened by a human. The purpose of this study is to reduce human work load and to shorten error recovery time during error recovery.

  • PDF

A Study on the Development of Korea Telecom Automatic Voice Recognition System (음성인식에 의한 연구센타 부서안내 시스팀 개발에 관한 연구)

  • Koo, Myoung-Wan;Sohn, Il-Hyun;Doh, Sam-Joo;Lee, Jong-Rak
    • Annual Conference on Human and Language Technology
    • /
    • 1992.10a
    • /
    • pp.185-192
    • /
    • 1992
  • 이 논문에서는 음성인식기술을 이용한 연구센타 부서안내 시스팀(KARS:Korea Telecom Automatic voice Recognition system)에 대하여 기술하였다. 이 시스팀은 기본적으로 음성응답 시스팀과 유사하지만 명령입력을 위해 푸시버튼 대신 음성을 이용한다는 점이 다르다. 사용자가 마이크로폰을 통해 음성명령을 입력하면, 이 시스팀은 사용자의 음성명령을 인식하여 연구센타내 각 부서의 간략한 소개, 전화번호 및 위치를 안내해 준다. 이 시스팀은 HMM(Hidden Markov Model)을 이용하는 화자독립 격리단어 인식시스팀으로서 116개의 부서이름과 7개의 제어용 단어로 구성되어 있는 123개 단어를 인식할 수 있다. 이 시스팀은 음소와 유사한 한국어 서브워드(subword)를 HMM의 기본단위로 사용하며 인식 실험결과 98.6%의 인식율을 얻을 수 있었다.

  • PDF

Training of Fuzzy-Neural Network for Voice-Controlled Robot Systems by a Particle Swarm Optimization

  • Watanabe, Keigo;Chatterjee, Amitava;Pulasinghe, Koliya;Jin, Sang-Ho;Izumi, Kiyotaka;Kiguchi, Kazuo
    • 제어로봇시스템학회:학술대회논문집
    • /
    • 2003.10a
    • /
    • pp.1115-1120
    • /
    • 2003
  • The present paper shows the possible development of particle swarm optimization (PSO) based fuzzy-neural networks (FNN) which can be employed as an important building block in real life robot systems, controlled by voice-based commands. The PSO is employed to train the FNNs which can accurately output the crisp control signals for the robot systems, based on fuzzy linguistic spoken language commands, issued by an user. The FNN is also trained to capture the user spoken directive in the context of the present performance of the robot system. Hidden Markov Model (HMM) based automatic speech recognizers are developed, as part of the entire system, so that the system can identify important user directives from the running utterances. The system is successfully employed in a real life situation for motion control of a redundant manipulator.

  • PDF

A Threshold Adaptation based Voice Query Transcription Scheme for Music Retrieval (음악검색을 위한 가변임계치 기반의 음성 질의 변환 기법)

  • Han, Byeong-Jun;Rho, Seung-Min;Hwang, Een-Jun
    • The Transactions of The Korean Institute of Electrical Engineers
    • /
    • v.59 no.2
    • /
    • pp.445-451
    • /
    • 2010
  • This paper presents a threshold adaptation based voice query transcription scheme for music information retrieval. The proposed scheme analyzes monophonic voice signal and generates its transcription for diverse music retrieval applications. For accurate transcription, we propose several advanced features including (i) Energetic Feature eXtractor (EFX) for onset, peak, and transient area detection; (ii) Modified Windowed Average Energy (MWAE) for defining multiple small but coherent windows with local threshold values as offset detector; and finally (iii) Circular Average Magnitude Difference Function (CAMDF) for accurate acquisition of fundamental frequency (F0) of each frame. In order to evaluate the performance of our proposed scheme, we implemented a prototype music transcription system called AMT2 (Automatic Music Transcriber version 2) and carried out various experiments. In the experiment, we used QBSH corpus [1], adapted in MIREX 2006 contest data set. Experimental result shows that our proposed scheme can improve the transcription performance.

A Study On the Automatic Generation System of Mobile Voice Web Page (모바일 음성 웹 페이지의 자동 생성 시스템에 관한 연구)

  • You-Jung Ko;Yoon-Joong Kim
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2008.11a
    • /
    • pp.153-156
    • /
    • 2008
  • 모바일 기기는 화면의 크기가 작아 스타일러스나 펜으로 웹 컨텐츠를 이용하기에는 불편함이 있다. 이에 따라 음성으로 웹의 컨텐츠를 개발하기 위한 포준 언어인 VoiceXML(Voice Extenxible Markup Language), SALT(Speech application Language Tags)가 빠르게 보급되고 있다. 이를 이용하기 위해서는 기존의 모바일 웹페이지를 음성 웹 표준 기술에 맞게 변환해줘야 한다. 따라서 본 논문에서는 WML(Wireless Markup Language)로 구성된 모바일 웹 페이지를 SALT 음성기술을 이용하여 음성명령이 가능한 모바일 음성 웹페이지(WML + SALT)로 자동 생성하는 시스템을 구현 하고자 한다. 이에 따라 사용자는 음성명령을 통해 컨텐츠를 제어함으로써 편리함을 제공하고, 개발자는 자동 생성 시스템을 이용함으로써 기존의 모바일 웹 페이지를 음성 웹 페이지를 변환하기 위한 개발시간과 비용을 감소 할 수 있다.

Telecommunication System Construction to minimize the Casualty of Fisher in the coastal Fishing Boat (연안 어선에서 어선원 인명피해 최소화를 위한 통신 체계 구축)

  • Kim, Seok-Jae;Kim, Wook-Sung;Lee, Yoo-Won
    • Journal of Fisheries and Marine Sciences Education
    • /
    • v.25 no.3
    • /
    • pp.580-586
    • /
    • 2013
  • For telecommunication system construction to minimize the casualty of fisher, we investigated the usability of TRS communication system and performance of GPS automatic position transmitter (APT) which can be utilized for the survival, search and rescue of the victims. The trial experiments were conducted at sea with TRS and CDMA in the East, West and South Sea of Korea from October to December. As a result, the usability of the TRS as an emergency communication system device was verified since it provided stable position and voice information to the boundary of 50km far from the coast. Therefore the system is expected to contribute to minimization of victims.

Uumanned Automatic System for Function Test of Analog Subscriber Line Card (아날로그 가입자 정합 회로 기능시험을 위한 무인 자동화 시스템)

  • 이성원;김영범
    • Proceedings of the Korean Society of Precision Engineering Conference
    • /
    • 2002.05a
    • /
    • pp.432-437
    • /
    • 2002
  • DSPA311(Analog Subscriber Line Board Assembly) is offer the interface of between analog subscriber and TDX-100 exchange system. DSPA311 is belong ASI block, accommodate dial and MFC telephone subscriber of 32 channel, and voice signal designed for interface with TSW, and 2 and 4 wire loop impedance is 600 (ohm). DSPA311 is consist 4 channel daughter beard QSLM-10(Quad Subscriber Line Module-10) and perform BORSCHT and be possible A/U-law select and GAIN value control by data control of DSPA171(Device controller I). In this Paper, We described the function test program for the DSPA311 Board by using the HP3070CT combinational test system, and an unmanned automatic test system.

  • PDF

Study of the Wheelchair controlled by Joystick and Voices (조이스틱제어 및 음성으로 제어되는 휠체어의 연구)

  • Min, Hea-Jung;Yoon, Hung-Ri
    • Proceedings of the KIEE Conference
    • /
    • 1988.07a
    • /
    • pp.723-726
    • /
    • 1988
  • This paper is a study about the automatic control of wheelchairs. This is realized by joystick, and is simulated by voice signal recognition. The control system by joystick is designed as follows: joystick paddle is connected with a timer and this timer ouput is high only when the joystick is moved. A computer reads the duration of this high state, and ouputs motor control word decided from this value using look-up table. The control system by voice signal is designed as follows: partial autocorrelation coefficients are computed from A/D converted signals and these values are compared with referance patterns. From this, the motor control word is decided on by the neareast neighbor rule.

  • PDF

A Study on the Automatic Monitoring System for the Contact Center Using Emotion Recognition and Keyword Spotting Method (감성인식과 핵심어인식 기술을 이용한 고객센터 자동 모니터링 시스템에 대한 연구)

  • Yoon, Won-Jung;Kim, Tae-Hong;Park, Kyu-Sik
    • Journal of Internet Computing and Services
    • /
    • v.13 no.3
    • /
    • pp.107-114
    • /
    • 2012
  • In this paper, we proposed an automatic monitoring system for contact center in order to manage customer's complaint and agent's quality. The proposed system allows more accurate monitoring using emotion recognition and keyword spotting method for neutral/anger voice emotion. The system can provide professional consultation and management for the customer with language violence, such as abuse and sexual harassment. We developed a method of building robust algorithm on heterogeneous speech DB of many unspecified customers. Experimental results confirm the stable and improved performance using real contact center speech data.