Search | Korea Science

An Adaptive Wind Noise Reduction Method Based on a priori SNR Estimation for Speech Eenhancement (음성 강화를 위한 a priori SNR 추정기반 적응 바람소리 저감 방법)

Seo, Ji-Hun;Lee, Seok-Pil
- The Transactions of The Korean Institute of Electrical Engineers
- /
- v.64 no.12
- /
- pp.1756-1760
- /
- 2015
This paper focuses on a priori signal to noise ratio (SNR) estimation method for the speech enhancement. There are many researches for speech enhancement with several ambient noise cancellation methods. The method based on spectral subtraction (SS) which is widely used in noise reduction has a trade-off between the performance and the distortion of the signals. So the need of adaptive method like an estimated a priori SNR being able to making a high performance and low distortion is increasing. The decision directed (DD) approach is used to determine a priori SNR in noisy speech signals. A priori SNR is estimated by using only the magnitude components and consequently follows a posteriori SNR with one frame delay. We propose a modified a priori SNR estimator and the weighted rational transfer function for speech enhancement with wind noises. The experimental result shows the performance of our proposed estimator is better Perceptual Evaluation of Speech Quality scores (PESQ, ITU-T P.862) compare to the conventional DD approach-based systems and different noise reduction methods.
https://doi.org/10.5370/KIEE.2015.64.12.1756 인용 PDF KSCI

SNR Enhancement Algorithm Using Multiple Chirp Symbols with Clock Drift for Accurate Ranging

Jang, Seong-Hyun;Kim, Yeong-Sam;Yoon, Sang-Hun;Chong, Jong-Wha
- ETRI Journal
- /
- v.33 no.6
- /
- pp.841-848
- /
- 2011
A signal-to-noise ratio (SNR) enhancement algorithm using multiple chirp symbols with clock drift is proposed for accurate ranging. Improvement of the ranging performance can be achieved by using the multiple chirp symbols according to Cramer-Rao lower bound; however, distortion caused by clock drift is inevitable practically. The distortion induced by the clock drift is approximated as a linear phase term, caused by carrier frequency offset, sampling time offset, and symbol time offset. SNR of the averaged chirp symbol obtained from the proposed algorithm based on the phase derotation and the symbol averaging is enhanced. Hence, the ranging performance is improved. The mathematical analysis of the SNR enhancement agrees with the simulations.
https://doi.org/10.4218/etrij.11.0111.0013 인용 PDF KSCI

Speech Enhancement Using Phase-Dependent A Priori SNR Estimator in Log-Mel Spectral Domain

Lee, Yun-Kyung;Park, Jeon Gue;Lee, Yun Keun;Kwon, Oh-Wook
- ETRI Journal
- /
- v.36 no.5
- /
- pp.721-729
- /
- 2014
We propose a novel phase-based method for single-channel speech enhancement to extract and enhance the desired signals in noisy environments by utilizing the phase information. In the method, a phase-dependent a priori signal-to-noise ratio (SNR) is estimated in the log-mel spectral domain to utilize both the magnitude and phase information of input speech signals. The phase-dependent estimator is incorporated into the conventional magnitude-based decision-directed approach that recursively computes the a priori SNR from noisy speech. Additionally, we reduce the performance degradation owing to the one-frame delay of the estimated phase-dependent a priori SNR by using a minimum mean square error (MMSE)-based and maximum a posteriori (MAP)-based estimator. In our speech enhancement experiments, the proposed phase-dependent a priori SNR estimator is shown to improve the output SNR by 2.6 dB for both the MMSE-based and MAP-based estimator cases as compared to a conventional magnitude-based estimator.
https://doi.org/10.4218/etrij.14.2214.0039 인용 PDF KSCI KPUBS

Two-step a priori SNR Estimation in the Log-mel Domain Considering Phase Information (위상 정보를 고려한 로그멜 영역에서의 2단계 선험 SNR 추정)

Lee, Yun-Kyung;Kwon, Oh-Wook
- Phonetics and Speech Sciences
- /
- v.3 no.1
- /
- pp.87-94
- /
- 2011
The decision directed (DD) approach is widely used to determine a priori SNR from noisy speech signals. In conventional speech enhancement systems with a DD approach, a priori SNR is estimated by using only the magnitude components and consequently follows a posteriori SNR with one frame delay. We propose a phase-dependent two-step a priori SNR estimator based on the minimum mean square error (MMSE) in the log-mel spectral domain so that we can consider both magnitude and phase information, and it can overcome the performance degradation caused by one frame delay. From the experimental results, the proposed estimator is shown to improve the output SNR of enhanced speech signals by 2.3 dB compared to the conventional DD approach-based system.
PDF

A Novel Approach to a Robust A Priori SNR Estimator in Speech Enhancement (음성 향상에서 강인한 새로운 선행 SNR 추정 기법에 관한 연구)

Park, Yun-Sik;Chang, Joon-Hyuk
- The Journal of the Acoustical Society of Korea
- /
- v.25 no.8
- /
- pp.383-388
- /
- 2006
This Paper presents a novel approach to single channel microphone speech enhancement in noisy environments. Widely used noise reduction techniques based on the spectral subtraction are generally expressed as a spectral gam depending on the signal-to-noise ratio (SNR). The well-known decision-directed(DD) estimator of Ephraim and Malah efficiently reduces musical noise under the background noise conditions, but generates the delay of the a prioiri SNR because the DD weights the speech spectrum component of the Previous frame in the speech signal. Therefore, the noise suppression gain which is affected by the delay of the a priori SNR, which is estimated by the DD matches the previous frame rather than the current one, so after noise suppression. this degrades the noise reduction performance during speech transient periods. We propose a computationally simple but effective speech enhancement technique based on the sigmoid type function for the weight Parameter of the DD. The proposed approach solves the delay problem about the main parameter, the a priori SNR of the DD while maintaining the benefits of the DD. Performances of the proposed enhancement algorithm are evaluated by ITU-T p.862 Perceptual Evaluation of Speech duality (PESQ). the Mean Opinion Score (MOS) and the speech spectrogram under various noise environments and yields better results compared with the fixed weight parameter of the DD.
https://doi.org/10.7776/ASK.2006.25.8.383 인용 PDF KSCI

Speech enhancement using psychoacoustics model (사이코어쿠스틱스 모델을 이용한 음성 향상)

Kwon, Chul-Hyun;Shin, Dae-Kyu;Park, Sang-Hui
- Proceedings of the KIEE Conference
- /
- 1999.11c
- /
- pp.748-750
- /
- 1999
In this study, a speech enhancement is presented based on the utilization of well-known auditory mechanism, noise masking. The speech enhancement approach adopted here is to derive an modifier that achieves audible noise suppression. This modification selectively affects the perceptually significant spectral values, and is therefore less prone to introduction of unwanted distortions than methods that affect the complete STSA and produces more enhanced results at low SNR as well as at high SNR. The speech enhancement method adopted here needs exact estimation of the minimum specteal value per critical band because it uses only the minimum spectral value per critical band. For this, the method adopted here uses the modified spectral subtraction that is more flexible than power spectral subtraction. So, the result in experiment represented better SNR than before.
PDF

Evaluation of Usefulness of IDEAL(Iterative decomposition of water and fat with echo asymmetry and least squares estimation) Technique in 3.0T Breast MRI (3.0T 자기공명영상을 이용한 유방 검사시 IDEAL기법의 유용성 평가)

Cho, Jae-Hwan
- Journal of Digital Contents Society
- /
- v.11 no.2
- /
- pp.217-224
- /
- 2010
The purpose of this study was to examine the usefulness of IDEAL technique in breast MRI by performing a quantitative comparative analysis in patients diagnosed with DCIS. On a 3.0T MR scanner, fat-suppressed T2-weighted images and T1-weighted images before and after contrast enhancement were obtained from 20 patients histologically diagnosed with ductal carcinoma in situ (DCIS). The findings from the quantitative image analysis are the following: 1) On T2-weighted images, SNR were not significantly different in the lesion area itself between the CHESS and IDEAL groups, while the IDEAL group showed higher SNR at the ductal area and fat area than the CHESS group. In addition, the CNR were higher for the IDEAL group in those regions. 2) On T1-weighted images before enhancement, SNR were not significantly different in the lesion area itself between the CHESS and IDEAL groups, while the IDEAL group showed higher SNR at the ductal area and fat area than the CHESS group. In addition, the CNR were higher for the IDEAL group in those regions. 3) On T1-weighted images after enhancement, SNR were not significantly different in the lesion area itself between the CHESS and IDEAL groups, while the IDEAL group showed higher SNR at the ductal area and fat area than the CHESS group.
PDF KSCI

Performance Enhancement of Decision Directed SNR Estimation by Correction Scheme of SNR Estimation Error (결정지향 SNR 추정방식에서의 추정오차 보정기법을 통한 SNR 추정성능개선)

Kwak, Jae-Min
- Journal of Advanced Navigation Technology
- /
- v.16 no.6
- /
- pp.982-987
- /
- 2012
In this paper, the SNR estimation error of Decision Directed SNR estimation method in AWGN is investigated, which uses samples received in reference decision region. In communication system receiver, when SNR estimation scheme using error vectors between ideal sample points and received sample points of reference region is adopted, the samples contain incorrectly received samples due to AWGN. Consequently, the mean of estimated reference constellation point is shifted and Decision Directed SNR estimation is inaccurately performed. These effects are explained by modified probability density function and difference between actual SNR and estimated SNR is theoretically derived and quantatively analyzed. It is proved that SNR estimation error obtained through computer simulation is matched up with derived one, and SNR estimation performance is enhanced significantly by adopting suggested correction scheme.
https://doi.org/10.12673/jkoni.2012.16.6.982 인용 PDF KSCI

A study on loss combination in time and frequency for effective speech enhancement based on complex-valued spectrum (효과적인 복소 스펙트럼 기반 음성 향상을 위한 시간과 주파수 영역 손실함수 조합에 관한 연구)

Jung, Jaehee;Kim, Wooil
- The Journal of the Acoustical Society of Korea
- /
- v.41 no.1
- /
- pp.38-44
- /
- 2022
Speech enhancement is performed to improve intelligibility and quality of the noise-corrupted speech. In this paper, speech enhancement performance was compared using different loss functions in time and frequency domains. This study proposes a combination of loss functions to utilize advantage of each domain by considering both the details of spectrum and the speech waveform. In our study, Scale Invariant-Source to Noise Ratio (SI-SNR) is used for the time domain loss function, and Mean Squared Error (MSE) is used for the frequency domain, which is calculated over the complex-valued spectrum and magnitude spectrum. The phase loss is obtained using the sin function. Speech enhancement result is evaluated using Source-to-Distortion Ratio (SDR), Perceptual Evaluation of Speech Quality (PESQ), and Short-Time Objective Intelligibility (STOI). In order to confirm the result of speech enhancement, resulting spectrograms are also compared. The experimental results over the TIMIT database show the highest performance when using combination of SI-SNR and magnitude loss functions.
https://doi.org/10.7776/ASK.2022.41.1.038 인용 PDF KSCI

Adaptive Threshold for Speech Enhancement in Nonstationary Noisy Environments (비정상 잡음환경에서 음질향상을 위한 적응 임계 치 알고리즘)

Lee, Soo-Jeong;Kim, Sun-Hyob
- The Journal of the Acoustical Society of Korea
- /
- v.27 no.7
- /
- pp.386-393
- /
- 2008
This paper proposes a new approach for speech enhancement in highly nonstationary noisy environments. The spectral subtraction (SS) is a well known technique for speech enhancement in stationary noisy environments. However, in real world, noise is mostly nonstationary. The proposed method uses an auto control parameter for an adaptive threshold to work well in highly nonstationary noisy environments. Especially, the auto control parameter is affected by a linear function associated with an a posteriori signal to noise ratio (SNR) according to the increase or the decrease of the noise level. The proposed algorithm is combined with spectral subtraction (SS) using a hangover scheme (HO) for speech enhancement. The performances of the proposed method are evaluated ITU-T P.835 signal distortion (SIG) and the segment signal to-noise ratio (SNR) in various and highly nonstationary noisy environments and is superior to that of conventional spectral subtraction (SS) using a hangover (HO) and SS using a minimum statistics (MS) methods.
https://doi.org/10.7776/ASK.2008.27.7.386 인용 PDF KSCI

Search Result 190, Processing Time 0.025 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)