• 제목/요약/키워드: Spectral enhancement

검색결과 208건 처리시간 0.028초

음성부호화기에서의 잡음제거 방식 비교 (Comparison of Noise Suppression Methods in Voice CODEC)

  • 이진걸;기훈재
    • 대한전자공학회:학술대회논문집
    • /
    • 대한전자공학회 1998년도 추계종합학술대회 논문집
    • /
    • pp.1203-1206
    • /
    • 1998
  • Considerable research in the last three decades has examined the problem of enhancement of speech degraded by additive background noise. We compare traditional methods such as spectral subtraction and Wiener filter, recently proposed psychoacoustic model based methods such as perceptual filter and noise suppression in EVRC in terms of performance and complexity.

  • PDF

Noise Reduction Using the Standard Deviation of the Time-Frequency Bin and Modified Gain Function for Speech Enhancement in Stationary and Nonstationary Noisy Environments

  • Lee, Soo-Jeong;Kim, Soon-Hyob
    • The Journal of the Acoustical Society of Korea
    • /
    • 제26권3E호
    • /
    • pp.87-96
    • /
    • 2007
  • In this paper we propose a new noise reduction algorithm for stationary and nonstationary noisy environments. Our algorithm classifies the speech and noise signal contributions in time-frequency bins, and is not based on a spectral algorithm or a minimum statistics approach. It relies on calculating the ratio of the standard deviation of the noisy power spectrum in time-frequency bins to its normalized time-frequency average. We show that good quality can be achieved for enhancement speech signal by choosing appropriate values for ${\delta}_t\;and\;{\delta}_f$. The proposed method greatly reduces the noise while providing enhanced speech with lower residual noise and somewhat higher mean opinion score (MOS), background intrusiveness (BAK) and signal distortion (SIG) scores than conventional methods.

MMSE-STSA 기반의 음성개선 기법에서 잡음 및 신호 전력 추정에 사용되는 파라미터 값의 변화에 따른 잡음음성의 인식성능 분석 (Performance Analysis of Noisy Speech Recognition Depending on Parameters for Noise and Signal Power Estimation in MMSE-STSA Based Speech Enhancement)

  • 박철호;배건성
    • 대한음성학회지:말소리
    • /
    • 제57호
    • /
    • pp.153-164
    • /
    • 2006
  • The MMSE-STSA based speech enhancement algorithm is widely used as a preprocessing for noise robust speech recognition. It weighs the gain of each spectral bin of the noisy speech using the estimate of noise and signal power spectrum. In this paper, we investigate the influence of parameters used to estimate the speech signal and noise power in MMSE-STSA upon the recognition performance of noisy speech. For experiments, we use the Aurora2 DB which contains noisy speech with subway, babble, car, and exhibition noises. The HTK-based continuous HMM system is constructed for recognition experiments. Experimental results are presented and discussed with our findings.

  • PDF

A Single Channel Speech Enhancement for Automatic Speech Recognition

  • 이진규;서현손;강홍구
    • 한국방송∙미디어공학회:학술대회논문집
    • /
    • 한국방송공학회 2011년도 하계학술대회
    • /
    • pp.85-88
    • /
    • 2011
  • This paper describes a single channel speech enhancement as the pre-processor of automatic speech recognition system. The improvements are based on using optimally modified log-spectra (OM-LSA) gain function with a non-causal a priori signal-to-noise ratio (SNR) estimation. Experimental results show that the proposed method gives better perceptual evaluation of speech quality score (PESQ) and lower log-spectral distance, and also better word accuracy. In the enhancement system, parameters was turned for automatic speech recognition.

  • PDF

High Frequency Enhancement of Sound Using Wavelet Transform

  • Yoon Won-Jung;Lee Kang-Kyu;Park Kyu-Sik
    • 대한전자공학회:학술대회논문집
    • /
    • 대한전자공학회 2004년도 ICEIC The International Conference on Electronics Informations and Communications
    • /
    • pp.233-236
    • /
    • 2004
  • This paper proposes new method for the enhancement of nonexistent high frequency spectral contents from low sample rate audio signal. For example, Due to the protocol constraint, the audio bandwidth of MP3 is restricted to 16Khz. Although band-restricted MP3 audio provide savings of storage space and network bandwidth, it suffers a major problem of a loss in high frequency fidelity such as localization, ambient information, and bright nature of audio. This paper provides a new mathematical analysis for the adaptive estimation of the high frequency contents based on the nature of the input low sample rate audio. Proposed method can be worked globally to any kind of audio such as speech and music that are restricted by sampling rate and bandwidth.

  • PDF

CASA 기반의 마이크간 전달함수 비 추정 알고리즘 (CASA Based Approach to Estimate Acoustic Transfer Function Ratios)

  • 신민규;고한석
    • 한국음향학회지
    • /
    • 제33권1호
    • /
    • pp.54-59
    • /
    • 2014
  • 본 논문은 비정상 (nonstationary)특성을 가지는 잡음환경에서 마이크간 전달함수 비 (RTF, Relative Transfer Function) 추정 알고리즘을 제안한다. 음성을 이용한 다양한 기기에 다중 마이크를 이용한 잡음제거 기술은 널리 사용되며, 이때 각 마이크간의 입력 신호 사이의 관계는 필수적으로 추정되어야 한다. 본 논문에서는 기존의 OM-LSA(Optimally-Modified Log-Spectral Amplitude)기반의 추정 방식에 CASA (Computational Auditory Scene Analysis)를 접목시킨 방식을 제안한다. 제안한 방법의 성능 검증을 위하여 비정상 백색 잡음 (nonstationary white Gaussian noise) 환경에서 10명 화자 발음을 이용한 마이크간 전달함수 비 추정 성능 평가 실험을 수행하였다. 잡음 신호가 초당 8dB 증감하는 환경에서 SBF (Signal Blocking Factor)가 평균 2.65dB 개선됨을 확인하였다.

신호부각에 의한 신호 부공간 회전을 이용한 광대역 인코히어런트 신호의 공간 스펙트럼 추정 (Spatial Spectrum Estimation of Broadband Incoherent Signals using Rotation of Signal Subspace Via Signal Enhancement)

  • 김영수;이계산;김정근
    • 한국전자파학회논문지
    • /
    • 제15권7호
    • /
    • pp.669-676
    • /
    • 2004
  • 등 간격 선형 어레이로 입사하는 광대역 인코히어런트 신호의 도래각을 효율적으로 추정하는 새로운 알고리즘을 제안한다. 변환행렬을 구성하기 위하여 CSM 방법이 초기 추정각을 요구하는 반면에 제안된 방법은 전혀 초기 추정각을 필요로 하지 않는다. 이 방법의 연산과정은 먼저 신호부각 방법에 의하여 중심주파수에서의 신호 부공간을 추정 한 다음 신호 부공간 회전 방법을 통한 직교변환행렬을 구성하는 것이다. 시뮬레이션 결과 제안된 방법이 CSM 방법보다 표본바이어스 면에서 우수한 성능을 제공함을 알 수 있었다.

잡음 환경 분류 알고리즘을 이용한 IMCRA 기반의 음성 향상 기법 (Speech Enhancement Based on IMCRA Incorporating noise classification algorithm)

  • 송지현;박규석;안홍섭;이상민
    • 전기학회논문지
    • /
    • 제61권12호
    • /
    • pp.1920-1925
    • /
    • 2012
  • In this paper, we propose a novel method to improve the performance of the improved minima controlled recursive averaging (IMCRA) in non-stationary noisy environment. The conventional IMCRA algorithm efficiently estimate the noise power by averaging past spectral power values based on a smoothing parameter that is adjusted by the signal presence probability in frequency subbands. Since the minimum of smoothing parameter is defined as 0.85, it is difficult to obtain the robust estimates of the noise power in non-stationary noisy environments that is rapidly changed the spectral characteristics such as babble noise. For this reason, we proposed the modified IMCRA, which adaptively estimate and updata the noise power according to the noise type classified by the Gaussian mixture model (GMM). The performances of the proposed method are evaluated by perceptual evaluation of speech quality (PESQ) and composite measure under various environments and better results compared with the conventional method are obtained.

Charge-Transfer Complexing Properties of 1-Methyl Nicotinamide and Adenine in Relation to the Intramolecular Interaction in Nicotinamide Adenine Dinucleotide (NAD$^+$)

  • Park, Joon-woo;Paik, Young-Hee
    • Bulletin of the Korean Chemical Society
    • /
    • 제6권1호
    • /
    • pp.23-29
    • /
    • 1985
  • The charge-transfer complexing properties of 1-methyl nicotinamide (MNA), an acceptor, and adenine, a donor, were investigated in water and SDS micellar solutions in relation to the intramolecular interaction in nicotinamide adenine dinucleotide ($NAD^+$). The spectral and thermodynamic parameters of MNA-indole and methyl viologen-adenine complex formations were determined, and the data were utilized to evaluate the charge-transfer abilities of MNA and adenine. The electron affinity of nicotinamide was estimated to be 0.28 eV from charge-transfer energy $of{\sim}300$ nm for MNA-indole. The large enhancement of MNA-indole complexation in SDS solutions by entropy effect was attributed to hydrophobic nature of indole. The complex between adenine and methyl viologen showed an absorption band peaked near 360 nm. The ionization potential of adenine was evaluated to be 8.28 eV from this. The much smaller enhancement of charge-transfer interaction involving adenine than that of indole in SDS solutions was attributed to weaker hydrophobic nature of the donor. The charge-transfer energy of 4.41 eV (280 nm) was estimated for nicotinamide-adenine complex. The spectral behaviors of $NAD^+$ were accounted to the presence of intramolecular interaction in $NAD^+$, which is only slightly enhanced in SDS solutions. The replacement of nicotinamide-adenine interaction in $NAD^+$ by intermolecular nicotinamide-indole interaction in enzyme bound $NAD^+$, and guiding role of adenine moiety in $NAD^+$ were discussed.