통합 검색 | Korea Science

무선 통신망에서 음성인식률 개선을 위한 보상기법 연구 (Compensation Method for Improvement of Speech Recognition in Wireless Communication Network)

서진호;박호종
- 한국음향학회:학술대회논문집
- /
- 한국음향학회 2004년도 추계학술발표대회논문집 제23권 2호
- /
- pp.65-68
- /
- 2004
이동통신 기술의 발전으로 이동통신 사용이 폭발적으로 증가하였고 그에 따라 이동통신망을 이용한 많은 서비스가 제공되고 있다. 이동통신망에서의 음성 인식 서비스에서 음성 인식기에 입력되는 음성신호는 통신망을 통해 음성 압축기를 거치게 되고 이에 음성신호가 왜곡되어 인식기의 인식성능이 저하된다. 본 논문에서는 무선통신 환경에서 음성인식기의 성능을 개선하기 위한 보상 방법을 제안한다. 기존의 제안된 방법은 음성 데이터에 의존하는 방법을 사용하나 본 논문에서는 음성 데이터와는 독립적 방법인 음성 압축기에 의해 손상된 입력 신호의 스펙트럼 보상방법과 Cepstrum 보정방법을 통해 인식률을 향상시키는 방법을 제안한다. 즉, 음성 압축기에 의하여 왜곡된 스펙트럼을 단계적 방법으로 보상하고 그를 토대로 왜곡된 신호에서 만들어진 Cepstrum을 보정하여 음성 인식기의 성능을 향상시키는 방법을 연구하였으며, 그 견과 손상된 음성신호의 인식률 $64.88\%$에 대하여, 본 논문에서 제안하는 보상 방법을 적용한 음성신호의 인식률은 $79.73\%$로서 $14.85\%$가 향상된 결과를 얻을 수 있었다.
PDF

입술정보를 이용한 음성 특징 파라미터 추정 및 음성인식 성능향상 (Estimation of speech feature vectors and enhancement of speech recognition performance using lip information)

민소희;김진영;최승호
- 대한음성학회지:말소리
- /
- 제44호
- /
- pp.83-92
- /
- 2002
Speech recognition performance is severly degraded under noisy envrionments. One approach to cope with this problem is audio-visual speech recognition. In this paper, we discuss the experiment results of bimodal speech recongition based on enhanced speech feature vectors using lip information. We try various kinds of speech features as like linear predicion coefficient, cepstrum, log area ratio and etc for transforming lip information into speech parameters. The experimental results show that the cepstrum parameter is the best feature in the point of reconition rate. Also, we present the desirable weighting values of audio and visual informations depending on signal-to-noiso ratio.
PDF

실험에 의한 음성·음악 분류 특징의 비교 분석 (Comparison & Analysis of Speech/Music Discrimination Features through Experiments)

이경록;류시우;곽재영
- 한국콘텐츠학회:학술대회논문집
- /
- 한국콘텐츠학회 2004년도 추계 종합학술대회 논문집
- /
- pp.308-313
- /
- 2004
본 논문에서는 각 특징 파라미터 조합의 음성/음악 분류 성능을 비교 분석하였다. 음향신호는 3가지(음성, 음악, 음성+음악)로 분류하였다. 본 실험에서는 분류 특징으로 멜캡스트럼, 에너지, 영교차 3가지 형태가 사용되었다. 음성/음악 분류 성능이 가장 좋은 특징간의 상호 조합을 비교 분석하였다. 실험결과 멜캡스트럼, 영교차 조합이 가장 좋은 결과(음성: 95.1%, 음악: 61.9%, 음성+음악: 55.5%)를 보인다는 것을 확인할 수 있었다.
PDF

화자 확인 시스템의 설계 제작 및 성능 분석 (Implementation and Performance Analysis of a Speaker Verification System)

권석규;이병기
- 전자공학회논문지B
- /
- 제30B권3호
- /
- pp.1-9
- /
- 1993
This paper discusses issues on the disign and implementation of real-time automatic speaker verification system, as well as the performance analysis of the implemented system. The system employs TI's TMS320C25 digital signal processor TMS320C25 and high speed SRAMs. The system is designed to be used stand-alone as well as via hand-shaking with IBM-PC. The speech parameters used for speaker verification are PARCOR and LPC-cepstrum coefficients, and the employed decision logics are those based on the generalized weighted distance comcept. The implemented system showed the performance of 5.3% error rate for the PARCOR coefficient, and 4.7% error rate for the LPG-cepstrum coefficient.
PDF

최소 분산 켑스트럼을 이용한 자동차 허브 베어링 결함 검출 (Faults Detection in Hub Bearing with Minimum Variance Cepstrum)

박춘수;최영철;김양한;고을석
- 한국소음진동공학회:학술대회논문집
- /
- 한국소음진동공학회 2004년도 춘계학술대회논문집
- /
- pp.593-596
- /
- 2004
Hub bearings not only sustain the body of a car, but permit wheels to rotate freely. Excessive radial or axial load and many other reasons can cause defects to be created and grown in each component. Therefore, vibration and noise from unwanted defects in outer-race, inner-race or ball elements of a Hub bearing are what we want to detect as early as possible. How early we can detect the faults has to do with how the detection algorithm finds the fault information from measured signal. Fortunately, the bearing signal has periodic impulse train. This information allows us to find the faults regardless how much noise contaminates the signal. This paper shows the basic signal processing idea and experimental results that demonstrate how good the method is.
PDF

말초 청각 계통 모델을 이용한 한국어 모음 인식 (Korean Vowel Recognition using Peripheral Auditory Model)

윤태성;백승화;박상희
- 대한의용생체공학회:의공학회지
- /
- 제9권1호
- /
- pp.1-10
- /
- 1988
In this study, the recognition experiments for Korean vowel are performed using peripheral auditory model. In addition, for the purpose of objective comparison, the recognition experiments are performed by extracting LPC cepstrum coefficients for the same speech data. The results are as follows. 1) The time and the frequency responses of the auditory model show that important features of input signal are involved in the responses of inner ear and auditory nerve. 2) The recognition results for Korean vowel show that the recognition rate by auditory model output is higher than the recognition rate by LPC cepstrum coefficients. 3) The adaptation phenomenon of auditory nerve provides useful characteristics for the discrimination of vowel signal.
PDF

에어컨 실내기에서 발생하는 충격 소음원의 위치 추정 (Source localization of impact noise on an indoor unit of air-conditioner)

최영철;김양한;이종구;김구영
- 한국소음진동공학회:학술대회논문집
- /
- 한국소음진동공학회 2003년도 추계학술대회논문집
- /
- pp.324-329
- /
- 2003
An air-conditioner has various noise sources such as cooling fan noise, pumping noise, flow noise and impact noise. Among these, impact noise is the most unpleasant source. This is because the noise is produced in indoor unit of air-conditioner. To control the noise source effectively, first we must identify the noise sources. When we identify impact noise source, the measurement have to be carried out simultaneously. So we use beamforming method that requires less measurement points than intensity method and acoustic holography. The objective of this paper is to estimate the location of impact source. This objective can be achieved by using minimum variance cepstrum that is able to detect impulse embedded in noise. In this study, modified beamforming method based on cepstrum domain is proposed. Then this method applied to air-conditioner noise sources which produce impact noise.
PDF

Classification of Pathological Voice Signal with Severe Noise Component

Li, Ta-O;Jo, Cheol-Woo
- 음성과학
- /
- 제10권4호
- /
- pp.107-115
- /
- 2003
In this paper we tried to classify the pathological voice signal with severe noise component based on two different parameters, the spectral slope and the ratio of energies in the harmonic and noise components (HNR), The spectral slope is obtained by using a curve fitting method and the HNR is computed in cepstrum quefrency domain. Speech data from normal peoples and patients are collected, diagnosed and divided into three different classes (normal, relatively less noisy and severely noisy data), The mean values and the standard deviations of the spectral slope and the HNR are computed and compared with in the three kinds of data to characterize and classify the severely noisy pathological voice signals from others.
PDF

낮은 차원의 벡터 변환을 통한 음성 변환 (Voice conversion using low dimensional vector mapping)

이기승;도원;윤대희
- 전자공학회논문지S
- /
- 제35S권4호
- /
- pp.118-127
- /
- 1998
In this paper, we propose a voice personality transformation method which makes one person's voice sound like another person's voice. In order to transform the voice personality, vocal tract transfer function is used as a transformation parameter. Comparing with previous methods, the proposed method can obtain high-quality transformed speech with low computational complexity. Conversion between the vocal tract transfer functions is implemented by a linear mapping based on soft clustering. In this process, mean LPC cepstrum coefficients and mean removed LPC cepstrum modeled by the low dimensional vector are used as transformation parameters. To evaluate the performance of the proposed method, mapping rules are generated from 61 Korean words uttered by two male and one female speakers. These rules are then applied to 9 sentences uttered by the same persons, and objective evaluation and subjective listening tests for the transformed speech are performed.
PDF

EMG Pattern Recognition based on Evidence Accumulation for Prosthesis Control

Lee, Seok-Pil;Park, Sand-Hui
- Journal of Electrical Engineering and information Science
- /
- 제2권6호
- /
- pp.20-27
- /
- 1997
We present a method of electromyographic(EMG) pattern recognition to identify motion commands for the control of a prosthetic arm by evidence accumulation with multiple parameters. Integral absolute value, variance, autoregressive(AR) model coefficients, linear cepstrum coefficients, and adaptive cepstrum vector are extracted as feature parameters from several time segments of the EMG signals. Pattern recognition is carried out through the evidence accumulation procedure using the distances measured with reference parameters. A fuzzy mapping function is designed to transform the distances for the application of the evidence accumulation method. Results are presented to support the feasibility of the suggested approach for EMG pattern recognition.
PDF

검색결과 274건 처리시간 0.025초

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

자세히 찾기

이미지 검색 (β)