Search | Korea Science

An Adaptive Wind Noise Reduction Method Based on a priori SNR Estimation for Speech Eenhancement (음성 강화를 위한 a priori SNR 추정기반 적응 바람소리 저감 방법)

Seo, Ji-Hun;Lee, Seok-Pil
- The Transactions of The Korean Institute of Electrical Engineers
- /
- v.64 no.12
- /
- pp.1756-1760
- /
- 2015
This paper focuses on a priori signal to noise ratio (SNR) estimation method for the speech enhancement. There are many researches for speech enhancement with several ambient noise cancellation methods. The method based on spectral subtraction (SS) which is widely used in noise reduction has a trade-off between the performance and the distortion of the signals. So the need of adaptive method like an estimated a priori SNR being able to making a high performance and low distortion is increasing. The decision directed (DD) approach is used to determine a priori SNR in noisy speech signals. A priori SNR is estimated by using only the magnitude components and consequently follows a posteriori SNR with one frame delay. We propose a modified a priori SNR estimator and the weighted rational transfer function for speech enhancement with wind noises. The experimental result shows the performance of our proposed estimator is better Perceptual Evaluation of Speech Quality scores (PESQ, ITU-T P.862) compare to the conventional DD approach-based systems and different noise reduction methods.
https://doi.org/10.5370/KIEE.2015.64.12.1756 인용 PDF KSCI

ML Frame Synchronization for Gaussian Channel with Co-channel Interference (가우스 잡음과 CO-CHANNEL 간섭이 존재하는 채널에서의 최대추정 프레임 동기)

문병현;우홍체;김신환;이채욱
- The Journal of Korean Institute of Communications and Information Sciences
- /
- v.18 no.5
- /
- pp.643-649
- /
- 1993
The problem of locating a periodically inserted frame synchronization pattern in random data for a binary pulse amplitude modulated (PAM) digital communication system over a additive white Gaussian noise(AWGN) channel with co-channel interference is considered. The performance degradation of frame synchronization for the correlation rule due to the presence of co-channel interference is shown. The maximum likelihood(ML) decision rule for the frame synchronization over an AWGN channel with co-channel interference is derived. For the entire range of SNR considered, the ML frame synchronization rule obtains about 1dB signal energy gain over the correlation rule. Specially, the ML rule obtains as much as 2dB gain over the correlation rule when the SNR is greater than 0dB.
PDF

Audio Forensic Marking using Psychoacoustic Model II and MDCT (심리음향 모델 II와 MDCT를 이용한 오디오 포렌식 마킹)

Rhee, Kang-Hyeon
- Journal of the Institute of Electronics Engineers of Korea CI
- /
- v.49 no.4
- /
- pp.16-22
- /
- 2012
In this paper, the forensic marking algorithm is proposed using psychoacoustic model II and MDCT for high-quality audio. The proposed forensic marking method, that inserts the user fingerprinting code of the audio content into the selected sub-band, in which audio signal energy is lower than the spectrum masking level. In the range of the one frame which has 2,048 samples for FFT of original audio signal, the audio forensic marking is processed in 3 sub-bands. According to the average attack of the fingerprinting codes, one frame's SNR is measured on 100% trace ratio of the collusion codes. When the lower strength 0.1 of the inserted fingerprinting code, SNR is 38.44dB. And in case, the added strength 0.5 of white gaussian noise, SNR is 19.09dB. As a result, it confirms that the proposed audio forensic marking algorithm is maintained the marking robustness of the fingerprinting code and the audio high-quality.
PDF KSCI

An Enhanced MELP Vocoder in Noise Environments (MELP 보코더의 잡음성능 개선)

전용억;전병민
- The Journal of Korean Institute of Communications and Information Sciences
- /
- v.28 no.1C
- /
- pp.81-89
- /
- 2003
For improving the performance of noise suppression in tactical communication environments, an enhanced MELP vocoder is suggested, in which an acoustic noise suppressor is integrated into the front end of the MELP algorithm, and an FEC code into the channel side of the MELP algorithm. The acoustic noise suppressor is the modified IS-127 EVRC noise suppressor which is adapted for the MELP vocoder. As for FEC, the turbo code, which consists of rate-113 encoding and BCJR-MAP decoding algorithm, is utilized. In acoustic noise environments, the lower the SNR becomes, the more the effects of noise suppression is increased. Moreover, The suggested system has greater noise suppression effects in stationary noise than in non-stationary noise, and shows its superiority by 0.24 in MOS test to the original MELP vocoder. When the interleave size is one MELP frame, BER 10-6 is accomplished at channel bit SNR 4.2 ㏈. The iteration of decoding at 3 times is suboptimal in its complexity vs. performance. Synthetic quality is realized as more than MOS 2.5 at channel bit SNR 2 ㏈ in subjective voice quality test, when the interleave size is one MELP frame and the iteration of decoding is more than 3 times.
PDF KSCI

A Novel Approach to a Robust A Priori SNR Estimator in Speech Enhancement (음성 향상에서 강인한 새로운 선행 SNR 추정 기법에 관한 연구)

Park, Yun-Sik;Chang, Joon-Hyuk
- The Journal of the Acoustical Society of Korea
- /
- v.25 no.8
- /
- pp.383-388
- /
- 2006
This Paper presents a novel approach to single channel microphone speech enhancement in noisy environments. Widely used noise reduction techniques based on the spectral subtraction are generally expressed as a spectral gam depending on the signal-to-noise ratio (SNR). The well-known decision-directed(DD) estimator of Ephraim and Malah efficiently reduces musical noise under the background noise conditions, but generates the delay of the a prioiri SNR because the DD weights the speech spectrum component of the Previous frame in the speech signal. Therefore, the noise suppression gain which is affected by the delay of the a priori SNR, which is estimated by the DD matches the previous frame rather than the current one, so after noise suppression. this degrades the noise reduction performance during speech transient periods. We propose a computationally simple but effective speech enhancement technique based on the sigmoid type function for the weight Parameter of the DD. The proposed approach solves the delay problem about the main parameter, the a priori SNR of the DD while maintaining the benefits of the DD. Performances of the proposed enhancement algorithm are evaluated by ITU-T p.862 Perceptual Evaluation of Speech duality (PESQ). the Mean Opinion Score (MOS) and the speech spectrogram under various noise environments and yields better results compared with the fixed weight parameter of the DD.
https://doi.org/10.7776/ASK.2006.25.8.383 인용 PDF KSCI

A Study on SNR Estimation of Continuous Speech Signal (연속음성신호의 SNR 추정기법에 관한 연구)

Song, Young-Hwan;Park, Hyung-Woo;Bae, Myung-Jin
- The Journal of the Acoustical Society of Korea
- /
- v.28 no.4
- /
- pp.383-391
- /
- 2009
In speech signal processing, speech signal corrupted by noise should be enhanced to improve quality. Usually noise estimation methods need flexibility for variable environment. Noise profile is renewed on silence region to avoid effects of speech properties. So we have to preprocess finding voice region before noise estimation. However, if received signal does not have silence region, we cannot apply that method. In this paper, we proposed SNR estimation method for continuous speech signal. The waveform which is stationary region of voiced speech is very correlated by pitch period. So we can estimate the SNR by correlation of near waveform after dividing a frame for each pitch. For unvoiced speech signal, vocal track characteristic is reflected by noise, so we can estimate SNR by using spectral distance between spectrum of received signal and estimated vocal track. Lastly, energy of speech signal is mostly distributed on voiced region, so we can estimate SNR by the ratio of voiced region energy to unvoiced.
https://doi.org/10.7776/ASK.2009.28.4.383 인용 PDF KSCI

Frequency Division Concurrent Sensing Method for High-Speed Detection of Large Touch Screens (대형 터치스크린의 고속감지를 위한 주파수분할 동시센싱 기법)

Jang, Un-Yong;Kim, HyungWon
- Journal of the Korea Institute of Information and Communication Engineering
- /
- v.19 no.4
- /
- pp.895-902
- /
- 2015
This paper presents a high-speed sensing and noise cancellation technique for large touch screens, which is called FDCS (Frequency Division Concurrent Sensing). Most conventional touch screen detection methods apply excitation pulses sequentially and analyze the sensing signals sequentially, and so are often unacceptably slow for large touch screens. The proposed technique applies sinusoidal signals of orthogonal frequencies simultaneously to all drive lines, and analyzes the signals from each sense line in frequency domain. Its parallel driving allows high speed detection even for a very large touch screens. It enhances the sensing SNR (Signal to Noise Ratio) by introducing a frequency domain noise filtering scheme. We also propose a pre-distortion equalizer, which compensates the drive signals using the inverse transfer function of touch screen panel to further enhance the sensing SNR. Experimental results with a 23" large touch screen show that the proposed technique enhances the frame scan rate by 273% and an SNR by 43dB compared with a conventional scheme.
https://doi.org/10.6109/jkiice.2015.19.4.895 인용 PDF KSCI KPUBS HTML

Speech Enhancement Using Phase-Dependent A Priori SNR Estimator in Log-Mel Spectral Domain

Lee, Yun-Kyung;Park, Jeon Gue;Lee, Yun Keun;Kwon, Oh-Wook
- ETRI Journal
- /
- v.36 no.5
- /
- pp.721-729
- /
- 2014
We propose a novel phase-based method for single-channel speech enhancement to extract and enhance the desired signals in noisy environments by utilizing the phase information. In the method, a phase-dependent a priori signal-to-noise ratio (SNR) is estimated in the log-mel spectral domain to utilize both the magnitude and phase information of input speech signals. The phase-dependent estimator is incorporated into the conventional magnitude-based decision-directed approach that recursively computes the a priori SNR from noisy speech. Additionally, we reduce the performance degradation owing to the one-frame delay of the estimated phase-dependent a priori SNR by using a minimum mean square error (MMSE)-based and maximum a posteriori (MAP)-based estimator. In our speech enhancement experiments, the proposed phase-dependent a priori SNR estimator is shown to improve the output SNR by 2.6 dB for both the MMSE-based and MAP-based estimator cases as compared to a conventional magnitude-based estimator.
https://doi.org/10.4218/etrij.14.2214.0039 인용 PDF KSCI KPUBS

Multiple Transmit Focusing Method With Modified Orthogonal Golay Codes for Ultrasound Imaging (초음파 영상에서 변형된 직교 골레이 코드를 이용한 동시 다중 송신 집속 기법)

김배형;송태경
- Journal of Biomedical Engineering Research
- /
- v.24 no.3
- /
- pp.217-231
- /
- 2003
Coded excitation with complementary Golay sequences is an effective means to increase the SNR and penetration of ultrasound imaging. in which the two complementary binary codes are transmitted successively along each scan-line, reducing the imaging frame rate by half. This method suffers from low frame rate particularly when multiple transmit focusing is employed, since the frame rate will be further reduced in proportion to the number of focal zones. In this paper. a new ultrasound imaging technique based on simultaneous multiple transmit focusing using modified orthogonal Golay codes is proposed to improve lateral resolution with no accompanying decrease in the imaging frame rate, in which a pair of orthogonal Golay codes focused at two different focal depths are transmitted simultaneously. On receive, these modified orthogonal Golay codes are separately compressed into two short pulses and individually focused. These two focused beams are combined to form a frame of image with improved lateral resolution. The Golay codes were modified to improve the transmit power efficiency (TPE) for practical imaging. Computer simulations and experimental results show that the proposed method improves significantly the lateral resolution and penetration of ultrasound imaging compared with the conventional method.
PDF KSCI

Efficient Frame Synchronization Detector and Low Complexity Automatic Gain Controller for DVB-S2 (효율적인 디지털 위성 방송 프레임 동기 검출 회로 및 낮은 복잡도의 자동 이득 제어 회로)

Choi, Jin-Kyu;Sunwoo, Myung-Hoon;Kim, Pan-Soo;Chang, Dae-Ig
- Journal of the Institute of Electronics Engineers of Korea SD
- /
- v.46 no.2
- /
- pp.31-37
- /
- 2009
This paper presents an efficient frame synchronization strategy with the identification of modulation type for Digital Video Broadcasting-Satellite second generation (DVB-S2). To detect the Start Of Frame (SOF) and identify a modulation mode at low SNR, we propose a new correlator structure and a low complexity Automatic Gain Controller (AGC). The proposed frame synchronization architecture can reduce about 93% multipliers and 89% adders compared with the direct implementation of the Differential - Generalized Post Detection Integration (D-GPDI) algorithm which is very complex and the proposed a low complexity AGC consists of only 5 multipliers and 3 adders. The proposed architecture has been thoroughly verified on the Xilinx Virtex II FPGA board.
PDF KSCI

Search Result 112, Processing Time 0.045 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)