• 제목/요약/키워드: Speech Enhancement

검색결과 340건 처리시간 0.031초

인간의 청각모델에 기초한 잡음환경에 적응된 잡음억압 시스템 (Adaptive Noise Suppression system based on Human Auditory Model)

  • 최재승
    • 한국정보통신학회:학술대회논문집
    • /
    • 한국해양정보통신학회 2008년도 춘계종합학술대회 A
    • /
    • pp.421-424
    • /
    • 2008
  • 본 논문에서는 다양한 배경잡음에 의해 열화된 음성을 강조하기 위하여 청각모델에 기초로 한 잡음환경에 적응된 잡음억압 시스템을 제안한다. 제안한 시스템은 먼저 유성음과 무성음의 구간을 검출한 후, 각 입력 프레임에서 적응적인 청각기강의 처리를 한다. 마지막으로 진폭성분과 위상성분이 포함된 신경회로망을 사용하여 잡음신호를 제거한 후에 음성을 강조하는 처리를 한다. 본 시스템은 신호대잡음비의 평가방법을 통하여 다양한 잡음에 의해서 열화된 음성신호에 대해서 유효하다는 것을 실험으로 확인한다.

  • PDF

TMS320C30을 이용한 단일채널 적응잡음제거기 구현 (Implementation of the single channel adaptive noise canceller using TMS320C30)

  • 정성윤;우세정;손창희;배건성
    • 음성과학
    • /
    • 제8권2호
    • /
    • pp.73-81
    • /
    • 2001
  • In this paper, we focus on the real time implementation of the single channel adaptive noise canceller(ANC) by using TMS320C30 EVM board. The implemented single channel adaptive noise canceller is based on a reference paper [1] in which it is simulated by using the recursive average magnitude difference function(AMDF) to get a properly delayed input speech on a sample basis as a reference signal and normalized least mean square(NLMS) algorithm. To certify results of the real time implementation, we measured the processing time of the ANC and enhancement ratio according to various signalto-noise ratios(SNRs). Experimental results demonstrate that the processing time of the speech signal of 32ms length with delay estimation of every 10 samples is about 26.3 ms, and almost the same performance as given in [1] is obtained with the implemented system.

  • PDF

Split Model Speech Analysis Techniques for Speech Signal Enhancement

  • Park, Young-Ho;You, Kwang-Bock;Bae, Myung-Jin
    • 대한전자공학회:학술대회논문집
    • /
    • 대한전자공학회 1999년도 추계종합학술대회 논문집
    • /
    • pp.1135-1138
    • /
    • 1999
  • In this paper, The Split Model Analysis Algorithm, which can generate the wideband speech signal from the spectral information of narrowband signal, is developed. The Split Model Analysis Algorithm deals with the separation of the 10$\^$th/ order LPC model into five cascade-connected 2$\^$nd/ order model. The use of the less complex 2$\^$nd/ order models allows for the exclusion of the complicated nonlinear relationships between model parameters and all the poles of the LPC model. The relationships between the model parameters and its corresponding analog poles is proved and applied to each 2$\^$nd/ order model. The wideband speech signal is obtained by changing only the sampling rate.

  • PDF

잡음 환경하에서의 PSO-NCM을 이용한 거절기능 성능 향상 (Enhancement of Rejection Performance using the PSO-NCM in Noisy Environment)

  • 김병돈;송민규;최승호;김진영
    • 음성과학
    • /
    • 제15권4호
    • /
    • pp.85-96
    • /
    • 2008
  • Automatic speech recognition has severe performance degradation under noisy environments. To cope with the noise problem, many methods have been proposed. Most of them focused on noise-robust features or model adaptation. However, researchers have overlooked utterance verification (UV) under noisy environments. In this paper we discuss UV problems based on the normalized confidence measure. First, we show that UV performance is also degraded in noisy environments with the experiments of an isolated word recognition. Then we observe how the degradation of UV performances is suffered. Based on the UV experiments we propose a modeling method of the statistics of phone confidences using sigmoid functions. For obtaining the parameters of the sigmoidal models, the particle swarm optimization (PSO) is adopted. The proposed method improves 20% rejection performance. Our experimental results show that the PSO-NCM can apply noise speech recognition successfully.

  • PDF

피치예측과 점진적 복원 기법을 이용한 EVRC 음질개선 (EVRC Speech Quality Enhancement Using Pitch Prediction and Gradual Increase of the Decoded Speech)

  • 민병준;김재원
    • 한국음향학회지
    • /
    • 제18권6호
    • /
    • pp.38-43
    • /
    • 1999
  • SK Telecom에서 현재 서비스중인 EVRC 보코더는 유선전화 수준의 음질을 제공하는 우수한 음성 부호화기이나, 약전계에서 급격한 음질 저하를 보인다. 본 논문에서는 실제 서비스 상황에서 발생하는 EVRC 보코더의 음질 저하 현상 및 그 원인을 분석하였고, 해결책으로 피치 예측과 점진적 복원 기법을 제안하였다. 다양한 전파환경에 대한 음질 평가방법으로 선호도 실험을 수행하였고, 제안한 방법이 효과적임을 확인하였다.

  • PDF

Particle Swarm 기반 최적화 멤버쉽 함수에 의한 잡음 환경에서의 화자인식 성능향상 (Performance Enhancement of Speaker Identification in Noisy Environments by Optimization Membership Function Based on Particle Swarm)

  • 민소희;송민규;나승유;김진영
    • 음성과학
    • /
    • 제14권2호
    • /
    • pp.105-114
    • /
    • 2007
  • The performance of speaker identifier is severely degraded in noisy environments. A study suggested the concept of observation membership for enhancing performances of speaker identifier with noisy speech [1]. The method scaled observation probabilities of input speech by observation identification values decided by SNR. In the paper [1], the authors suggested heuristic parameter values for membership function. In this paper we attempt to apply particle swarm optimization (PSO) for obtaining the optimal parameters for speaker identification in noisy environments. With the speaker identification experiments using the ETRI database we prove that the optimization approach can yield better performance than using only the original membership function.

  • PDF

잡음제거 특성을 갖는 웨이브릿변환 기반 서브밴드 적응 음향반향제거기 (The Wavelet Transform Based Subband Adaptive Acoustic Echo Canceller with Noise Cancellation Property)

  • 박재우;안주원;권기룡;문광석;김강언
    • 대한전자공학회:학술대회논문집
    • /
    • 대한전자공학회 2000년도 제13회 신호처리 합동 학술대회 논문집
    • /
    • pp.7-10
    • /
    • 2000
  • This paper focuses on the development of speech enhancement techniques for hands-free audio terminals, including two major problems : noise cancellation and acoustic echo cancellation. The objective is to find a joint structure to get a near-end speech signal with minimum distortion and low levels of echo and noise. To solve the two problems, a new promising technique is studied and tested in computer simulation conditions.

  • PDF

A New Formulation of Multichannel Blind Deconvolution: Its Properties and Modifications for Speech Separation

  • Nam, Seung-Hyon;Jee, In-Nho
    • The Journal of the Acoustical Society of Korea
    • /
    • 제25권4E호
    • /
    • pp.148-153
    • /
    • 2006
  • A new normalized MBD algorithm is presented for nonstationary convolutive mixtures and its properties/modifications are discussed in details. The proposed algorithm normalizes the signal spectrum in the frequency domain to provide faster stable convergence and improved separation without whitening effect. Modifications such as nonholonomic constraints and off-diagonal learning to the proposed algorithm are also discussed. Simulation results using a real-world recording confirm superior performanceof the proposed algorithm and its usefulness in real world applications.

확산망을 이용한 음성인식 (The Speech Recognition Using the Diffusion Network)

  • 허만택
    • 한국음향학회:학술대회논문집
    • /
    • 한국음향학회 1996년도 영남지부 학술발표회 논문집 Acoustic Society of Korean Youngnam Chapter Symposium Proceedings
    • /
    • pp.70-75
    • /
    • 1996
  • In this paper, the pre-precessing method for the recognition of single vowels by use of spectrum envelope is presented , we use new method of an extrating spectrum envelope using the diffusion filter bank. We reduced the total processing time, and got higher enhancement of discrimination . By getting 88.3% of average recognition rate for single vowels of real voice through computer simulation, we confirmed it to be useful for speech recongition which use spectrum analysis for voice signal to have many frequency components.

  • PDF

스펙트럴 서브트렉션과 비동기 KLT 잡음 감소 기법의 조합에 의한 음성 인식 성능 개선 (Improvement of the ASR Robustness using Combinations of Spectral Subtraction and KLT-based Adaptive Comb-filtering)

  • 박성준
    • 대한음성학회:학술대회논문집
    • /
    • 대한음성학회 2003년도 5월 학술대회지
    • /
    • pp.207-210
    • /
    • 2003
  • In this paper, the combinations of speech enhancement techniques are experimented. Specifically, the spectral subtraction, KLT based comb-filtering, and their combinations are applied to the Aurora2 database. The results show that recognition accuracy is improved when KLT based comb-filtering is applied after spectral subtraction.

  • PDF