통합 검색 | Korea Science

최소 통계법과 Short-Term 예측계수 코드북을 이용한 Non-Stationary/Mixed 배경잡음 추정 기법 (Non-Stationary/Mixed Noise Estimation Algorithm Based on Minimum Statistics and Codebook Driven Short-Term Predictor Parameter Estimation)

이명석;노명훈;박성주;이석필;김무영
- 한국음향학회지
- /
- 제29권3호
- /
- pp.200-208
- /
- 2010
본 논문에서는 배경잡음에 강인한 잡음제거 알고리즘 설계를 위해서 minimum statistics (MS) 기법을 codebook driven short-term predictor parameter estimation (CDSTP) 기법에 접목하는 방법을 제안한다. MS는 stationary 배경잡음에는 강인하지만, non-stationary 배경잡음에는 상대적으로 취약하다. CDSTP는 non-stationary 배경잡음에 강인한 특성을 보이지만, 코드북에 없는 배경잡음 환경에는 취약하다. 따라서 non-stationary 배경잡음에 강인한 CDSTP 방법과 별도의 코드북 학습 과정이 필요 없는 MS를 결합해서 다양한 배경잡음에 강인한 알고리즘을 제안한다. 제안방법은 MS나 CDSTP 방법에 비해서 전체적으로 향상된 perceptual evaluation of speech quality (PESQ) 성능을 나타냈으며, 특히 stationary 배경잡음과 non-stationary 배경잡음이 섞여 있는 mixed 배경잡음 환경에서 강인한 특성을 보였다.
https://doi.org/10.7776/ASK.2010.29.3.200 인용 PDF KSCI

조건 사후 최대 확률 기반 최소값 제어 재귀평균기법을 이용한 음성향상 (Speech Enhancement Based on Minima Controlled Recursive Averaging Technique Incorporating Conditional MAP)

금종모;박윤식;장준혁
- 한국음향학회지
- /
- 제27권5호
- /
- pp.256-261
- /
- 2008
본 논문에서는 기존의 최소값 제어 재귀 평균기법(minima controlled recursive averaging, MCRA) 알고리즘에 조건 사후 최대 확률 (maximun a posteriori, MAP)을 적용한 음성향상을 제안한다. 기존의 MCRA는 파워스펙트럼에 평균을 취하고 각 서브밴드에서 음성 신호 존재 확률로 조절하는 스무딩 매개변수를 사용한다. 본 논문에서 제안된 알고리즘은 현재 프레임에 들어온 신호가 이전 프레임에서의 음성의 존재와 부재에 대한 조건을 부여해 주어 음성 신호 존재확률을 수정하여 음성향상에 적용한다. 제안된 음성 향상은 ITU-T P.862 perceptual evaluation of speech quality (PESQ)와 주관적 음질평가를 이용하여 평가하였고 기존의 MCRA 방법보다 향상된 결과를 나타내었다.
https://doi.org/10.7776/ASK.2008.27.5.256 인용 PDF KSCI

이중 분기 디코더를 사용하는 복소 중첩 U-Net 기반 음성 향상 모델 (Complex nested U-Net-based speech enhancement model using a dual-branch decoder)

황서림;박성욱;박영철
- 한국음향학회지
- /
- 제43권2호
- /
- pp.253-259
- /
- 2024
본 논문에서는 이중 분기 디코더를 갖는 복소 중첩 U-Net 기반의 새로운 음성 향상 모델을 제안하였다. 제안된 모델은 음성 신호의 크기와 위상 성분을 동시에 추정할 수 있도록 복소 중첩 U-Net으로 구성되며, 디코더는 스펙트럼 사상과 시간 주파수 마스킹을 각각의 분기에서 수행하는 이중 분기 디코더 구조를 갖는다. 이때, 이중 분기 디코더 구조는 단일 디코더 구조에 비하여, 음성 정보의 손실을 최소화하면서 잡음을 효과적으로 제거할 수 있도록 한다. 실험은 음성 향상 모델 학습을 위해 보편적으로 사용되는 VoiceBank + DEMAND 데이터베이스 상에서 이루어졌으며, 다양한 객관적 평가 지표를 통해 평가되었다. 실험 결과, 이중 분기 디코더를 사용하는 복소 중첩 U-Net 기반 음성 향상 모델은 기존의 베이스라인과 비교하여 Perceptual Evaluation of Speech Quality(PESQ) 점수가 0.13가량 증가하였으며, 최근 제안된 음성 향상 모델들보다도 높은 객관적 평가 점수를 보였다.
https://doi.org/10.7776/ASK.2024.43.2.253 인용 PDF

VoIP 코더들의 프레임손실은닉 알고리즘 성능평가 (Performance Evaluation of Frame Erasure Concealment Algorithms in VoIP Coders)

한승호;문광;한민수
- 대한음성학회:학술대회논문집
- /
- 대한음성학회 2004년도 춘계 학술대회 발표논문집
- /
- pp.235-238
- /
- 2004
Frame erasures cause speech quality degradation in wireless communication networks or packet networks. The degradation becomes worse when consecutive frame erasures occur. Speech coders have a frame erasure concealment(FEC) mechanism to compensate for frame erasures. It is meaningful to evaluate the performance of FEC mechanisms for frame erasures that occur in communications networks. In this paper, various frame erasures are designed. And the FEC algorithms of speech coders are evaluated and analyzed with the Perceptual Evaluation of Speech Quality(PESQ). It is found that the performances vary in accordance with frame erasure types, frame erasure rates, and utterance lengths.
PDF

파형보간 코더에서 파라미터간 거리차를 이용한 가변비트율 기법 (A New Variable Bit Rate Scheme for Waveform Interpolative Coders)

양희식;정상배;한민수
- 대한음성학회지:말소리
- /
- 제65호
- /
- pp.81-91
- /
- 2008
In this paper, we propose a new variable bit-rate speech coder based on the waveform interpolation concept. After the coder extracted all parameters, the amounts of the distortions between the current and the predicted parameters which are estimated by extrapolation using past two parameters are measured for all parameters. A parameter would not be transmitted unless the distortion exceeds the preset threshold. At the decoder side, the non-transmitted parameter is reconstructed by extrapolation with past two parameters used to synthesize signals. In this way, we can reduce 26% of the total bit rate while retaining the speech quality degradation below 0.1 PESQ score.
PDF

A Single Channel Speech Enhancement for Automatic Speech Recognition

이진규;서현손;강홍구
- 한국방송∙미디어공학회:학술대회논문집
- /
- 한국방송공학회 2011년도 하계학술대회
- /
- pp.85-88
- /
- 2011
This paper describes a single channel speech enhancement as the pre-processor of automatic speech recognition system. The improvements are based on using optimally modified log-spectra (OM-LSA) gain function with a non-causal a priori signal-to-noise ratio (SNR) estimation. Experimental results show that the proposed method gives better perceptual evaluation of speech quality score (PESQ) and lower log-spectral distance, and also better word accuracy. In the enhancement system, parameters was turned for automatic speech recognition.
PDF

Improved Single Channel Speech Enhancement Algorithm Using Adaptive Postfiltering

송은우;강홍구
- 한국방송∙미디어공학회:학술대회논문집
- /
- 한국방송공학회 2011년도 하계학술대회
- /
- pp.122-125
- /
- 2011
In real environment, background noise exists everywhere and degrades the performance of system. To reduce this distortion, a speech enhancement algorithm can be very useful and variety methods have been proposed. In this paper, we propose a postfilter to improve the performance of optimally modified log-spectral amplitude (OM-LSA) estimator. Proposed algorithm uses the formant postfilter to minimize perceptual distortion caused by background noise. We adjust an emphasizing parameter which is varied by spectral flatness and first reflection coefficient. The performance of the proposed algorithm is evaluated by measuring the log-spectral distance (LSD) and the perceptual evaluation of speech quality (PESQ) score. The test results show the improvement of proposed algorithm compared to conventional OM-LSA.
PDF

스마트TV향 VoIP 컨퍼런스 기능을 위한 잡음제거 알고리즘의 성능비교 (Comparison of Noise Reduction Algorithm for Smart TV in VoIP Conference Facility)

서광덕;최홍재;김형국
- 한국방송∙미디어공학회:학술대회논문집
- /
- 한국방송공학회 2011년도 하계학술대회
- /
- pp.482-483
- /
- 2011
본 논문에서는 스마트TV향 VoIP(Voice over Internet Protocol) 컨퍼런스 기능을 위한 잡음제거 알고리즘의 성능비교 하였다. 기존에 연구 되어져 있는 Improved Minima Controlled Recursive Averaging(IMCRA)방식과 Gaussian분포 기반의 잡음제거 알고리즘, IMCRA방식과 Gamma분포 기반의 잡음제거 알고리즘, IMCRA방식과 Mel-filter를 적용한 잡음제거 알고리즘, R&L 알고리즘들의 방식을 비교하였으며, 성능 비교를 위해 각 알고리즘을 통해 나온 다양한 잡음 환경에서의 잡음이 제거된 신호의 PESQ와 연산속도를 비교한다.
PDF

광대역 VoIP 기반 고품질 음성통화를 위한 음성패킷 재생 스케줄링 방식 (Voice Packet Playout Scheduling for High Quality Voice Communication Based on Wide Band VoIP)

최홍재;김형국
- 한국멀티미디어학회:학술대회논문집
- /
- 한국멀티미디어학회 2012년도 춘계학술발표대회논문집
- /
- pp.353-354
- /
- 2012
광대역 VoIP 네트워크 환경에서는 불안정한 네트워크 환경으로 인해 음성패킷이 불규칙적으로 수신되어 음성데이터의 재생이 원활하지 못하다. 이러한 문제점을 해결하기 위해 본 논문에서는 네트워크 상태에 따라 원활하게 음성패킷을 재생시키는 스케줄링 방식을 제안한다. 제안하는 방식은 수신단에 도착한 패킷 헤더정보를 이용해 네트워크 지터를 추정하고, 추정된 지터와 지터버퍼와 음성프레임버퍼에 존재하는 패킷수 및 음성프레임 개수, 음성클래스정보에 따라 음성프레임의 길이를 변화시켜 재생시킴으로써 수신단의 버퍼링 지연을 줄이고 출력신호의 음성왜곡을 최소화한다. 제안하는 스케줄링 방식의 성능측정을 위해 버퍼링 지연과 PESQ를 기존 음성패킷 재생 스케줄링 방식과 비교한다.
PDF

저 전송률 음성 부호화기를 위한 여기 신호 개선 알고리즘에 관한 연구 (Enhancement of Excitation in Low-bit-rate Speech Coders)

이미숙;김홍국;최승호;김도영
- 대한전자공학회:학술대회논문집
- /
- 대한전자공학회 2003년도 신호처리소사이어티 추계학술대회 논문집
- /
- pp.57-60
- /
- 2003
In this paper, we propose a new excitation enhancement technique to improve the speech quality of low bit rate speech coders. The proposed technique is based on a harmonic model and it is employed only in the decoding process of speech coders without any additional bits. We develop the procedure of harmonic model parameters estimation and harmonic generation. and apply the technique to a current state of the art low bit rate speech coder, ITU-T G.729 Annex D. Also its performance is measured by using the ITU-T P.862 PESQ score and compared to those of the phase dispersion filter and the long-term postfilter applied to the decoded excitation. It is shown that the proposed excitation enhancement technique can improve the quality of decoded speech and provide better quality for male speech than other techniques.
PDF

검색결과 84건 처리시간 0.027초

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

자세히 찾기

이미지 검색 (β)