Search | Korea Science

Implementation of a Robust Speech Recognizer in Noisy Car Environment Using a DSP (DSP를 이용한 자동차 소음에 강인한 음성인식기 구현)

Chung, Ik-Joo
- Speech Sciences
- /
- v.15 no.2
- /
- pp.67-77
- /
- 2008
In this paper, we implemented a robust speech recognizer using the TMS320VC33 DSP. For this implementation, we had built speech and noise database suitable for the recognizer using spectral subtraction method for noise removal. The recognizer has an explicit structure in aspect that a speech signal is enhanced through spectral subtraction before endpoints detection and feature extraction. This helps make the operation of the recognizer clear and build HMM models which give minimum model-mismatch. Since the recognizer was developed for the purpose of controlling car facilities and voice dialing, it has two recognition engines, speaker independent one for controlling car facilities and speaker dependent one for voice dialing. We adopted a conventional DTW algorithm for the latter and a continuous HMM for the former. Though various off-line recognition test, we made a selection of optimal conditions of several recognition parameters for a resource-limited embedded recognizer, which led to HMM models of the three mixtures per state. The car noise added speech database is enhanced using spectral subtraction before HMM parameter estimation for reducing model-mismatch caused by nonlinear distortion from spectral subtraction. The hardware module developed includes a microcontroller for host interface which processes the protocol between the DSP and a host.
PDF

Voice Dialing System using Speaker Dependent Recognition for Korean Digit Speech (화자 종속 한국어 숫자음 음성 인식 다이얼링 시스템)

Park, Kee-Young;Shin, You-Shik;Kim, Chong-Kyo
- Journal of the Korean Institute of Telematics and Electronics T
- /
- v.36T no.2
- /
- pp.56-62
- /
- 1999
This paper described a voice dialing system(VDS) and its hardware implementation for a speaker-dependent recognition of Korean digit speech using duty cycle. The proposed VDS consist of integrator, leveling divider circuit and recognition program. The analog speech signal is applied to the VDS through the low-pass filter cutoff frequency is 4.5(kHz). It is thoroughly confirmed that the speaker-dependent recognition of Korean digit speech is well behaved by the hardware system. Experimental results show that the recognition rate is 64% in average for Korean digit speech. Moreover, a high recognition rate of 100% is obtained for digits; /4/, /5/, /6/, /7/, /9/, /0/.
PDF

Speaker Adaptation for Voice Dialing (음성 다이얼링을 위한 화자적응)

;Chin-Hui Lee
- The Journal of the Acoustical Society of Korea
- /
- v.21 no.5
- /
- pp.455-461
- /
- 2002
This paper presents a method that improves the performance of the personal voice dialling system in which speaker independent phoneme HMM's are used. Since the speaker independent phoneme HMM based voice dialing system uses only the phone transcription of the input sentence, the storage space could be reduced greatly. However, the performance of the system is worse than that of the system which uses the speaker dependent models due to the phone recognition errors generated when the speaker independent models are used. In order to solve this problem, a new method that jointly estimates transformation vectors for the speaker adaptation and transcriptions from training utterances is presented. The biases and transcriptions are estimated iteratively from the training data of each user with maximum likelihood approach to the stochastic matching using speaker-independent phone models. Experimental result shows that the proposed method is superior to the conventional method which used transcriptions only.
PDF KSCI

A Study on Isolated Word Recognition for Implementation of Real-Time Voice Dialing System (실시간 음성 다이얼링 시스템 구현을 위한 단독어 인식에 관한 연구)

이항섭;홍진우;이강성;김순협
- The Journal of the Acoustical Society of Korea
- /
- v.11 no.1E
- /
- pp.5-14
- /
- 1992
본 논문은 실시간 음성 다이얼링 시스템 구현을 위한 화자종속의 단독어 인식에 대하여 기술하 였다. 인식을 위한 모델 작성은 적은 메로리에 계산 시간이 적게 걸리는 DMS 모델을 사용하였다. 인식 대상어는 대학교내의 50개 부서명을 선택하여고, 발성후 3초내에 인식결과를얻을 수 있었다. 시스템은 구간 수 22에서 가중치 0.6의 DMS 모델을 표준패턴으로 사용하였을 때 98%의 성능을 나타냈다.
PDF

차세대 엔터프라이즈웨어 마이포스 소개

정창현
- Proceedings of the Korea Database Society Conference
- /
- 1995.12a
- /
- pp.3-19
- /
- 1995
시스템 Technology ★ Server Technology - 운영환경구축 ★ Network 구성설계 - ATM, FDDI, NMS ★ Client/Server시스템 구성별 Bench Marking ★ Windows 메뉴 및 GUI 설계 ★다기능 PC 운영환경 설정 시스템 Technology ★ Data Base Technology - DB Administration - BB Performance Tuning ★ System Integration Technology - Application Integration - System Flow Control - Task Control - Applicational Interface - S/W Down Load 시스템 Technology ★ Memory Optimization ★ IBM/Facom Host API ★ 영상전화 Customizing - Intel Proshare ★ Auto Dialing - CTI Link ★ IC-Card Interface 시스템 Technology ★ Sound 처리 - Voice Mail - 음절 처리 ★ Image 처리 ★도움말 처리 - Hyper Text 시스템 Technology ★ Socket Programming - 긴급메일 - Peer to peer message switching ★ Set Up Programming -Install Shield ★ DB Access Programming - DB-Library ★ TCP/IP Programming(중략)
PDF

A Study on the Isolated word Recognition Using One-Stage DMS/DP for the Implementation of Voice Dialing System

Seong-Kwon Lee
- Proceedings of the Acoustical Society of Korea Conference
- /
- 1994.06a
- /
- pp.1039-1045
- /
- 1994
The speech recognition systems using VQ have usually the problem decreasing recognition rate, MSVQ assigning the dissimilar vectors to a segment. In this paper, applying One-stage DMS/DP algorithm to the recognition experiments, we can solve these problems to what degree. Recognition experiment is peformed for Korean DDD area names with DMS model of 20 sections and word unit template. We carried out the experiment in speaker dependent and speaker independent, and get a recognition rates of 97.7% and 81.7% respectively.
PDF

A Study on the Voice Dialing using HMM and Post Processing of the Connected Digits (HMM과 연결 숫자음의 후처리를 이용한 음성 다이얼링에 관한 연구)

Yang, Jin-Woo;Kim, Soon-Hyob
- The Journal of the Acoustical Society of Korea
- /
- v.14 no.5
- /
- pp.74-82
- /
- 1995
This paper is study on the voice dialing using HMM and post processing of the connected digits. HMM algorithm is widely used in the speech recognition with a good result. But, the maximum likelihood estimation of HMM(Hidden Markov Model) training in the speech recognition does not lead to values which maximize recognition rate. To solve the problem, we applied the post processing to segmental K-means procedure are in the recognition experiment. Korea connected digits are influenced by the prolongation more than English connected digits. To decrease the segmentation error in the level building algorithm some word models which can be produced by the prolongation are added. Some rules for the added models are applied to the recognition result and it is updated. The recognition system was implemented with DSP board having a TMS320C30 processor and IBM PC. The reference patterns were made by 3 male speakers in the noisy laboratory. The recognition experiment was performed for 21 sort of telephone number, 252 data. The recognition rate was $6\%$ in the speaker dependent, and $80.5\%$ in the speaker independent recognition test.
PDF

A Study on Speech Recognition in a running automobile (주행중인 자동차 환경에서의 음성인식 연구)

유봉근
- Proceedings of the Acoustical Society of Korea Conference
- /
- 1998.06c
- /
- pp.47-50
- /
- 1998
본 논문은 자동차의 편의성 및 안전성의 동시 확보를 위하여, 보조적 스위치의 조작없이 상시 음성의 입,출력이 가능하도록 하며, band pass filter를 이용하여 잡음환경에서 자동으로 정확하게 음성구간 검출(End Point Detection)을 하게 하였다. Reference Pattern은 Dynamic Multi-Section(DMS)[1] 모델을 사용하였고 차량의 속도에 따라 자동으로 잡음환경에 강인한 모델을 선택하도록 하였으며, 음성의 특징 파라미터와 인식 알고리즘은 Perceptual Linear Predictive(PLP) 13차와 One Stage Dynamic Programming(OSDP)를 사용하였다. 주행중인 자동차 환경(30~70km/h)에서 자주 사용되는 차량제어 명령 33개에 대하여 화자독립 92.98%, 화자종속 94.44% 인식율을 구하였다. 또한 주행중인 차량에서 카폰, 핸드폰 사용으로 인한 사고를 줄이기 위하여 음성으로 전화를 걸 수 있도록 하는 Voice Dialing 기능도 구현하였다.
PDF

Design and Implementation of PSTN Auto Dialing System for VoIP Services (VoIP 서비스를 위한 PSTN 자동 발신 시스템의 설계 및 구현)

송영호;이호근;권택근
- Proceedings of the Korean Information Science Society Conference
- /
- 2003.10c
- /
- pp.67-69
- /
- 2003
현재 인터넷은 음성을 포함한 실시간 정보의 제공을 기반으로 정보에 대한 욕구를 충족시키고 있으며, 이러한 인터넷의 실시간을 바탕으로 사용자는 새로운 서비스에 대한 요구를 창출하게 되었고, 저렴한 인터넷을 이용하여 Public Switched Telephone Network(PSTN)과 같은 기존 통신망을 대체하는 연구가 활발히 이루어지고 있다. VoIP(Voice over Internet Protocol)는 이러한 요구에 부흥하는 인터넷의 대표적인 서비스로 등장하고 있으며, MGCP, SIP 그리고 H.323 같은 프로토콜을 기반으로 VoIP 서비스를 위한 다각적인 접근과 연구가 진행 중이다. 본 연구는 VoIP 서비스를 위한 여러 프로토콜 중 IETF가 주관하고 있는 MGCP(Media Gateway Control Protocol) 스팩에 따라 MGCP를 구현하였으며, 댁내 서비스를 위한 인터넷에서의 VoIP 신뢰성을 보장하는 방안으로 기존 PSTN망을 백업형태로 지원하는 방안을 연구하여 특정 번호는 Call Agent(CA)와 MGCP 프로토콜로 통신하지 않고 임의 변경 없이 자동으로 기존 망으로의 발신이 가능한 시스템을 설계하고 구현하였다.
PDF

A Study on Connected Word Recognition for the Implementation of a Real-Time Voice Dialing System (실시간 음성 다이얼링 시스템 구현을 위한 연결어 인식에 관한 연구)

김천영;양진우;유형근;이형준;홍진우;이강성;안태옥
- The Journal of the Acoustical Society of Korea
- /
- v.12 no.3
- /
- pp.13-25
- /
- 1993
본 논문은 음성 다이얼링 시스템을 구현하기 위한 연결어 인식에 관한 연구이다. 적용된 인식 알고리즘은 기준패턴을 생성할 때 DMS 모델을 이용한 One-stage DMS/DP 알고리즘이고, 인식 대상어는 광운대학교 부서명 150 단어이다. 연결어 인식을 실시간으로 처리하기 위한 방법으로써 본 논문에서는 음절과 단어 단위의 DMS 템플리트를 구성하여 실험하였고 이 실험결과로부터 실시간과 인식률을 고려한 최적의 인식은 단어단위 템플리트에서 20 구간의 DMS 템플리트를 구성하여 실험하였고 이 실험결과로부터 실시간과 인식률을 고려한 최적의 인식은 단어단위 템플리트에서 20구간의 DMS 모델을 적용하였을 때 수행되었고, 이때 다중화자종속과 화자독립의 인식률은 각각 97.2%, 86.8%이다. 실험된 결과를 이용하여 음성 다이얼링 모델 시스템을 DSP 전용칩인 TMS320C30 프로세서를 내장한 DSP 보오드, 486 PC와 DIAL 모뎀을 이용해서 구현하였고, 전체 다이얼링 시간은 약 7~14초가 소요되었다.
PDF

Search Result 25, Processing Time 0.021 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)