Search | Korea Science

Reference Channel Input-Based Speech Enhancement for Noise-Robust Recognition in Intelligent TV Applications (지능형 TV의 음성인식을 위한 참조 잡음 기반 음성개선)

Jeong, Sangbae
- Journal of the Korea Institute of Information and Communication Engineering
- /
- v.17 no.2
- /
- pp.280-286
- /
- 2013
In this paper, a noise reduction system is proposed for the speech interface in intelligent TV applications. To reduce TV speaker sound which are very serious noises degrading recognition performance, a noise reduction algorithm utilizing the direct TV sound as the reference noise input is implemented. In the proposed algorithm, transfer functions are estimated to compensate for the difference between the direct TV sound and that recorded with the microphone installed on the TV frame. Then, the noise power spectrum in the received signal is calculated to perform Wiener filter-based noise cancellation. Additionally, a postprocessing step is applied to reduce remaining noises. Experimental results show that the proposed algorithm shows 88% recognition rate for isolated Korean words at 5 dB input SNR.
https://doi.org/10.6109/jkiice.2013.17.2.280 인용 PDF KSCI

음성 인식률 향상을 위한 음성의 특징 파라미터 추출 알고리즘

Choi, Jae-Seung
- Proceedings of the Korean Institute of Information and Commucation Sciences Conference
- /
- 2017.05a
- /
- pp.686-687
- /
- 2017
본 논문에서는 잡음에 강인하고 음성인식 성능이 효과적인 멜 주파수 켑스트럼 계수의 파라미터의 추출 알고리즘을 제안한다. 본 논문에서 제안한 알고리즘은 배경잡음이 혼합된 깨끗한 연속음성 중에서 위너필터를 이용하여 음성에 포함된 배경잡음을 감소시키며, 이후에 멜 주파수 켑스트럼 계수의 특징추출 방법을 사용하여 음성의 특징 파라미터를 추출한다.
PDF

Encounter of Lattice-type coding with Wiener's MMSE and Shannon's Information-Theoretic Capacity Limits in Quantity and Quality of Signal Transmission (신호 전송의 양과 질에서 위너의 MMSE와 샤논의 정보 이론적 정보량 극한 과 격자 코드 와의 만남)

Park, Daechul;Lee, Moon Ho
- Journal of the Institute of Electronics and Information Engineers
- /
- v.50 no.8
- /
- pp.83-93
- /
- 2013
By comparing Wiener's MMSE on stochastic signal transmission with Shannon's mutual information first proved by C.E. Shannon in terms of information theory, connections between two approaches were investigated. What Wiener wanted to see in signal transmission in noisy channel is to try to capture fundamental limits for signal quality in signal estimation. On the other hands, Shannon was interested in finding fundamental limits of signal quantity that maximize the uncertainty in mutual information using the entropy concept in noisy channel. First concern of this paper is to show that in deriving limits of Shannon's point to point fundamental channel capacity, Shannon's mutual information obtained by exploiting MMSE combiner and Wiener filter's MMSE are interelated by integro-differential equantion. Then, At the meeting point of Wiener's MMSE and Shannon's mutual information the upper bound of spectral efficiency and the lower bound of energy efficiency were computed. Choosing a proper lattice-type code of a mod-${\Lambda}$AWGN channel model and MMSE estimation of ${\alpha}$ confirmed to lead to the fundamental Shannon capacity limits.
https://doi.org/10.5573/ieek.2013.50.8.083 인용 PDF KSCI

Medical Image Restoration by Digital Image Processing (디지털영상처리를 이용한 의료영상복원)

Lee, Won-Seok;Chung, Kil-Soo;Lee, Yong-Gu
- 전자공학회논문지 IE
- /
- v.49 no.2
- /
- pp.75-81
- /
- 2012
In this paper, restoration methods were applied to restore analog medicine images with an aged image added and then blurred by noises. To restore the aged image blurred by the blurring function and added by noises, it was applied to the restoration methods which are inverse filtering and wiener filtering which are linear restoration techniques and Lucy-Richardson's algorithm which is nonlinear restoration technique. Moreover, ROC curve, a subjective evaluation method, was applied to evaluate the image quality of the restoration image. The wiener filtering using the ratio of constants acquired better image than the inverse filtering, but both of them couldn't improve ability to make a diagnosis. The restoration image applied to Lucy-Richardson algorithm was the best performance of the applied techniques and its sensitivity and specitivity were improved by 15[%] as much performance as the original aged image.
PDF KSCI

Wiener filtering-based ambient noise reduction technique for improved acoustic target detection of directional frequency analysis and recording sonobuoy (Directional frequency analysis and recording 소노부이의 표적 탐지 성능 향상을 위한 위너필터링 기반 주변 소음 제거 기법)

Hong, Jungpyo;Bae, Inyeong;Seok, Jongwon
- The Journal of the Acoustical Society of Korea
- /
- v.41 no.2
- /
- pp.192-198
- /
- 2022
As an effective weapon system for anti-submarine warfare, DIrectional Frequency Analysis and Recording (DIFAR) sonobuoy detects underwater targets via beamforming with three channels composed of an omni-direcitonal and two directional channels. However, ambient noise degrades the detection performance of DIFAR sonobouy in specific direction (0°, 90°, 180°, 270°). Thus, an ambient noise redcution technique is proposed for performance improvement of acoustic target detection of DIFAR sonobuoy. The proposed method is based on OTA (Order Truncate Average), which is widely used in sonar signal processing area, for ambient noise estimation and Wiener filtering, which is widely used in speech signal processing area, for noise reduction. For evaluation, we compare mean square errors of target bearing estmation results of conventional and proposed methods and we confirmed that the proposed method is effective under 0 dB signal-to-noise ratio.
https://doi.org/10.7776/ASK.2022.41.2.192 인용 PDF KSCI

Robust Speech Enhancement Using HMM and $H_\infty$ Filter (HMM과 $H_\infty$필터를 이용한 강인한 음성 향상)

이기용;김준일
- The Journal of the Acoustical Society of Korea
- /
- v.23 no.7
- /
- pp.540-547
- /
- 2004
Since speech enhancement algorithms based on Kalman/Wiener filter require a priori knowledge of the noise and have focused on the minimization of the variance of the estimation error between clean and estimated speech signal, small estimation error on the noise statistics may lead to large estimation error. However, H/sub ∞/ filter does not require any assumptions and a priori knowledge of the noise statistics, but searches the best estimated signal among the entire estimated signal by applying least upper bound, consequently it is more robust to the variation of noise statistics than Kalman/Wiener filter. In this paper, we Propose a speech enhancement method using HMM and multi H/sub ∞/ filters. First, HMM parameters are estimated with the training data. Secondly, speech is filtered with multiple number of H/sub ∞/ filters. Finally, the estimation of clean speech is obtained from the sum of the weighted filtered outputs. Experimental results shows about 1dB∼2dB SNR improvement with a slight increment of computation compared with the Kalman filter method.
PDF KSCI

Three-Dimensional Processing of Ultrasonic Pulse-Echo Signal (초음파 펄스에코 신호의 3차원 처리)

Song, Moon-Ho;Song, Sang-Rock;Cho, Jung-Ho;Sung, Je-Joong;Ahn, Hyung-Keun;Jang, Soon-Jae
- Journal of the Korean Society for Nondestructive Testing
- /
- v.23 no.5
- /
- pp.464-474
- /
- 2003
Ultrasonic imaging of 3-D structures for nondestructive evaluation must provide readily recognizable images with enough details to clearly show various flaws that may or may not be present. Typical flaws that need to be detected are miniature cracks, for instance, in metal pipes having aged over years of operation in nuclear power plants; and these sub-millimeter cracks or flaws must be depicted in the final 3-D image for a meaningful evaluation. As a step towards improving conspicuity and thus detection of flaws, we propose a pulse-echo ultrasonic imaging technique to generate various 3-D views of the 3-D object under evaluation through strategic scanning and processing of the pulse-echo data. We employ a 2-D Wiener filter that filters the pulse-echo data along the plane orthogonal to the beam propagation so that ultrasonic beams can be sharpened. This three-dimensional processing and display coupled with 3-D manipulation capabilities by which users are able to pan and rotate the 3-D structure improve conspicuity of flaws. Providing such manipulation operations allow a clear depiction of the size and the location of various flaws in 3-D.
PDF KSCI

SNR-based Weight Control for the Spatially Preprocessed Speech Distortion Weighted Multi-channel Wiener Filtering (공간 필터와 결합된 음성 왜곡 가중 다채널 위너 필터에서의 신호 대 잡음 비에 의한 가중치 결정 방법)

Kim, Gibak
- Journal of Broadcast Engineering
- /
- v.18 no.3
- /
- pp.455-462
- /
- 2013
This paper introduces the Spatially Preprocessed Speech Distortion Weighted Multi-channel Wiener Filter (SP-SDW-MWF) for multi-microphone noise reduction and proposes a method to determine the speech distortion weights. The SP-SDW-MWF is known as a robust noise reduction algorithm against the error caused by the mismatch in microphones. The SP-SDW-MWF adopts weights which determine the amount of noise reduction at the expense of introducing speech distortion in the noise-suppressed speech. In this paper, we use the error of power spectral density between the estimated signal and the desired signal as the evaluation measure. Thus the a priori SNR is used to control the speech distortion weights in the frequency domain. In the experimental results, the proposed method yields better result in terms of MFCC distortion compared to the conventional method.
https://doi.org/10.5909/JBE.2013.18.3.455 인용 PDF KSCI

Hand-held Multimedia Device Identification Based on Audio Source (음원을 이용한 멀티미디어 휴대용 단말장치 판별)

Lee, Myung Hwan;Jang, Tae Ung;Moon, Chang Bae;Kim, Byeong Man;Oh, Duk-Hwan
- Journal of Korea Society of Industrial Information Systems
- /
- v.19 no.2
- /
- pp.73-83
- /
- 2014
Thanks to the development of diverse audio editing Technology, audio file can be easily revised. As a result, diverse social problems like forgery may be caused. Digital forensic technology is actively studied to solve these problems. In this paper, a hand-held device identification method, an area of digital forensic technology is proposed. It uses the noise features of devices caused by the design and the integrated circuit of each device but cannot be identified by the audience. Wiener filter is used to get the noise sounds of devices and their acoustic features are extracted via MIRtoolbox and then they are trained by multi-layer neural network. To evaluate the proposed method, we use 5-fold cross-validation for the recorded data collected from 6 mobile devices. The experiments show the performance 99.9%. We also perform some experiments to observe the noise features of mobile devices are still useful after the data are uploaded to UCC. The experiments show the performance of 99.8% for UCC data.
https://doi.org/10.9723/jksiis.2014.19.2.073 인용 PDF KSCI

Efficiency of Median Modified Wiener Filter Algorithm for Noise Reduction in PET/MR Images: A Phantom Study (PET/MR 영상에서의 팬텀을 활용한 노이즈 감소를 위한 변형된 중간값 위너필터의 적용 효율성 연구)

Cho, Young Hyun;Lee, Se Jeong;Lee, Youngjin;Park, Chan Rok
- Journal of radiological science and technology
- /
- v.44 no.3
- /
- pp.225-229
- /
- 2021
The digital image such as medical X-ray and nuclear medicine field mainly contains noise distribution. The noise degree in image degrades image quality. That is why, the noise reduction algorithm is efficient for medical image field. In this study, we confirmed effectiveness of application for median modified Wiener filter (MMWF) algorithm for noise reduction in PET/MR image compared with median filter image, which is used as conventional noise redcution algorithm. The Jaszczak PET phantom was used by using 18F solution and filled with NaCl+NiSO4 fluids. In addition, the radioactivity ratio between background and six spheres in the phantom is maintained to 1:8. In order to mimic noise distribution in the image, we applied Gaussian noise using MATLAB software. To evlauate image quality, the contrast to noise ratio (CNR) and coefficient of variation (COV) were used. According to the results, compared with noise image and images with MMWF algorithm, the image with MMWF algorithm is increased approximately 33.2% for CNR result, decreased approximately 79.3% for COV result. In conclusion, we proved usefulness of MMWF algorithm in the PET/MR images.
https://doi.org/10.17946/JRST.2021.44.3.225 인용 PDF KSCI

Search Result 76, Processing Time 0.027 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)