Search | Korea Science

Efficient Speaker Verification in Noise Environment with Noise-added Speaker Model Composition (잡음 첨가된 화자 모델 구성에 의한 잡음 환경의 효과적인 화자확인)

안성주;강선미;고한석
- Proceedings of the Korean Information Science Society Conference
- /
- 1999.10b
- /
- pp.542-544
- /
- 1999
본 논문에서는 다수의 화자 모델을 구성함으로써 잡음에 강인한 화자확인 방법을 제안한다. Non-stationary한 잡음을 가진 입력음성의 SNR을 측정하는 것은 어렵기 때문에, 각 화자에 대해 잡음이 없을 때의 화자모델에 여러 SNR에 대한 잡음 모델을 결합시킴으로써 여러 개의 잡음 첨가된 화자 모델을 구성한다. 그리고, 화자확인에서는 이렇게 구한 각 모델에 대한 입력 음성의 likelihood를 구해 그 중 가장 큰 likelihood만을 선택한다. 이 값을 이용하여 화자확인을 수행한다. 실험 결과, 제안한 방법은 입력음성의 SNR을 모르는 잡음환경에서 일반적으로 하나의 모델을 사용하는 것보다 훨씬 좋은 성능을 보였다.
PDF

Model adaptation employing DNN-based estimation of noise corruption function for noise-robust speech recognition (잡음 환경 음성 인식을 위한 심층 신경망 기반의 잡음 오염 함수 예측을 통한 음향 모델 적응 기법)

Yoon, Ki-mu;Kim, Wooil
- The Journal of the Acoustical Society of Korea
- /
- v.38 no.1
- /
- pp.47-50
- /
- 2019
This paper proposes an acoustic model adaptation method for effective speech recognition in noisy environments. In the proposed algorithm, the noise corruption function is estimated employing DNN (Deep Neural Network), and the function is applied to the model parameter estimation. The experimental results using the Aurora 2.0 framework and database demonstrate that the proposed model adaptation method shows more effective in known and unknown noisy environments compared to the conventional methods. In particular, the experiments of the unknown environments show 15.87 % of relative improvement in the average of WER (Word Error Rate).
https://doi.org/10.7776/ASK.2019.38.1.047 인용 PDF KSCI HTML

A Crowd Noise Reduction Model for Speech Signal processing (음성 신호처리를 위한 군중잡음 제거 모델)

안용운;김중환;김상철
- Proceedings of the Korean Information Science Society Conference
- /
- 2002.10d
- /
- pp.502-504
- /
- 2002
군중잡음(crowd noise)이 발생하는 환경에서 음성 통화 및 화자 인식을 할 때에는 음성에 파열음이나 마찰음과 같은 유색잡음(colored noise)이 부가되어 원래 음성이 왜곡된다. 이와 같이 왜곡된 음성 신호를 처리할 때에는 군중잡음을 제거하는 과정이 반드시 필요하다. 본 논문에서는 군중잡음의 특성을 분석하고, 그 결과를 이용하여 음성 신호처리 시에 효과적으로 군중잡음만을 제거할 수 있는 모델을 제안한다. 제안된 모델은 시간 영역에서는 침묵 구간을 검출하여 마찰음과 파열음을 제거하는 과정과 주파수 영역에서는 잡음 평균을 생성하고 이를 이용한 스펙트럼 차감법(spectral subtraction)으로 군중 잡음을 제거하는 과정으로 이루어진다.
PDF

Improved Correlation Noise Modeling for Transform-Domain Wyner-Ziv Coding (변환영역에서의 Wyner-Ziv 코딩을 위한 개선된 상관 잡음 모델)

Kim, Byung-Hee;Ko, Bong-Hyuck;Jeon, Byeung-Woo
- Proceedings of the Korean Society of Broadcast Engineers Conference
- /
- 2008.11a
- /
- pp.81-84
- /
- 2008
최근 센서네트워크와 같은 에너지 제한 환경을 위한 경량화 부호화 기술의 필요성이 대두됨에 따라 분산 소스 부호화 기술(Distributed Source Coding)의 응용기술로 비디오 부호화 복잡도의 대부분을 차지하는 움직임 예측/보상과정을 부호화기가 아닌 복호화기에서 수행하는 분산 비디오 부호화 기술(Distributed Video Coding)에 대한 연구가 활발히 이루어져 왔다. 이에 가장 대표적인 기술인 Wyner-Ziv 코딩 기술은 채널 코드를 이용하여 원본 프레임과 이에 대한 복호화기의 예측영상인 보조정보 사이의 잡음을 제거하여 영상을 복원한다. 일반적으로 보조정보는 원본영상에 유사한 키 프레임간의 프레임 보간을 통하여 생성되며 채널 코드는 Shannon limit에 근접한 성능을 보이는 Turbo 코드나 LDPC 코드가 사용된다. 이와 같은 채널 코드의 복호화는 채널 잡음 모델에 기반하여 수행되어지며 Wyner-Ziv 코딩 기술에서는 이 채널 잡음 모델을 '상관 잡음 모델' (Correlation Noise Modeling)이라 하고 일반적으로 Laplacian이나 Gaussian으로 모델화 한다. 하지만 복호화기에는 원본 영상에 대한 정보가 없기 때문에 정확한 상관 잡음 모델을 알 수 없으며 잡음 모델에 대한 예측의 부정확성은 잡음 제거를 위한 패리티 비트의 증가를 야기해 부호화 기술의 압축 성능 저하를 가져온다. 이에 본 논문은 원본 프레임과 보조정보 사이의 잡음을 정확하게 예측하여 잡음을 정정할 수 있는 향상된 상관 잡음 모델을 제안한다. 제안 방법은 잘못된 잡음 예측에 의해 Laplacian 계수가 너무 커지는 것을 방지하면서 영상내의 잡음의 유무에 별다른 영향을 받지 않는 새로운 문턱값을 사용한다. 다양한 영상에 대한 제안 방법의 실험 결과는 평균적으로 약 0.35dB에 해당하는 율-왜곡 성능 향상을 보여주었다.
PDF

A Scalable Bias-dependent P-HEMT Noise Model with Single Drain Current Noise Source (드레인 전류 잡음원만을 고려한 스케일링이 가능한 바이어스 의존 P-HEMT 잡음모델)

윤경식
- The Journal of Korean Institute of Communications and Information Sciences
- /
- v.24 no.10A
- /
- pp.1579-1587
- /
- 1999
Bias-dependent noise models of $0.2\mu\textrm{m}$ gate length P-HEMT's which are scalable with gate width are proposed. To predict S-parameters of the P-HEMT's the intrinsic parameters except for $\tau$ subtracted the offsets introduced in this paper are normalized to the gate width and then scaled. The small-signal model parameters are expressed as fitting functions of the drain current to $\textrm{I}_{dss}$ ratio and gate width. In addition, to estimate accurately noise parameters the noise temperature $\textrm{T}_{g}$ of the intrinsic resistance, the equivalent noise conductance $\textrm{G}_{ni}$ of the gate current noise source, and the equivalent noise conductance $\textrm{G}_{no}$ of the drain current noise source are adopted as the noise model parameters. The extracted values of $\textrm{T}_{g}$ are nearly independent of drain current and gate width and their average is around the ambient temperature. The extracted values of $\textrm{G}_{ni}$ are small enough to be neglected to the circuit characteristics. From the comparison of the noise model with only $\textrm{G}_{no}$ and that having $\textrm{T}_{g}$, $\textrm{G}_{ni}$ and $\textrm{G}_{no}$ to the measured data it is fund that even the former model is in good agreement with the measured noise parameters. Thus, from a practical point of view the noise model having only the drain current noise source is confirmed as a scalable bias-dependent model.
PDF

A Study on Phase-Noise and Jitter due to the Power Supply Noise of the CMOS Ring Oscillator (CMOS 링발진기의 전원 잡음에 의한 위상잡음과 Jitter 연구)

Park Se-Hoon
- Journal of the Korea Institute of Information and Communication Engineering
- /
- v.10 no.2
- /
- pp.298-302
- /
- 2006
Models for the phase noise and jitter of the ring oscillator with the power supply noise are suggested and verified by simulations. The power supply noise is converted into the phase-noise by the narrow band phase modulation. The phase-noise appears as sideband frequencies apart from the center frequency of the ring oscillator as much as the frequency of the power supply noise. A jitter model describing the linear relation of jitter with the amplitude of the power supply noise is suggested and verified by simulation.
PDF KSCI

Performance Analysis of M-ary APSK Modulation Method in the Presence of Phase Noise (M-ary APSK 변조 방식의 위상잡음 영향 분석)

Kim, Young-Wan
- Journal of the Korea Institute of Information and Communication Engineering
- /
- v.18 no.7
- /
- pp.1511-1517
- /
- 2014
Based on the simulation model for phase noise effect, the phase noise performance of communication system using the M-ary APSK modulation method is analyzed in this paper. The phase noise effects for APSK higher order modulation method are modeled by considering the signal constellations of the modulation methods. The simulation model is verified by comparing the analytic results for lower order modulation signals with simulated results for those modulation signals. Based on the simulation model, then, the phase noise effects for M-ary APSK are analyzed. The simulated results for PSK lower order modulation signals are in agreement with the results by the analytic model in the order of 1 %. The proper phase noise effects for M-ary APSK modulation signals are deduced by the simulation model. Also, the required signal-to-noise ratios of phase noise are proposed for phase noise degradations, which are used as the design parameter that considers the phase noise effect.
https://doi.org/10.6109/jkiice.2014.18.7.1511 인용 PDF KSCI

The Use of Unsupervised Machine Learning for the Attenuation of Seismic Noise (탄성파 자료 잡음 제거를 위한 비지도 학습 연구)

Kim, Sujeong;Jun, Hyunggu
- Geophysics and Geophysical Exploration
- /
- v.25 no.2
- /
- pp.71-84
- /
- 2022
When acquiring seismic data, various types of simultaneously recorded seismic noise hinder accurate interpretation. Therefore, it is essential to attenuate this noise during the processing of seismic data and research on seismic noise attenuation. For this purpose, machine learning is extensively used. This study attempts to attenuate noise in prestack seismic data using unsupervised machine learning. Three unsupervised machine learning models, N2NUNET, PATCHUNET, and DDUL, are trained and applied to synthetic and field prestack seismic data to attenuate the noise and leave clean seismic data. The results are qualitatively and quantitatively analyzed and demonstrated that all three unsupervised learning models succeeded in removing seismic noise from both synthetic and field data. Of the three, the N2NUNET model performed the worst, and the PATCHUNET and DDUL models produced almost identical results, although the DDUL model performed slightly better.
https://doi.org/10.7582/GGE.2022.25.2.071 인용 PDF KSCI

An Efficient Approach for Noise Robust Speech Recognition by Using the Deterministic Noise Model (결정적 잡음 모델을 이용한 효율적인 잡음음성 인식 접근 방법)

정용주
- The Journal of the Acoustical Society of Korea
- /
- v.21 no.6
- /
- pp.559-565
- /
- 2002
In this paper, we proposed an efficient method that estimates the HMM (Hidden Marke Model) parameters of the noisy speech. In previous methods, noisy speech HMM parameters are usually obtained by analytical methods using the assumed noise statistics. However, as they assume some simplication in the methods, it is difficult to come closely to the real statistics for the noisy speech. Instead of using the simplication, we used some useful statistics from the clean speech HMMs and employed the deterministic noise model. We could find that the new scheme showed improved results with reduced computation cost.
PDF KSCI

Design for Crowd Noise Reduction System Using DSI and Spectral Subtraction (DSI와 스펙트럼 차감법을 이용한 군중잡음 감쇄기의 설계)

Ahn, Yong-Woon;Kim, Sang-Chul;Kim, Joong-Hwan
- Proceedings of the Korea Information Processing Society Conference
- /
- 2002.11a
- /
- pp.703-706
- /
- 2002
군중잡음(crowd noise)이 발생하는 환경에서 음성 통화 및 화자 인식을 할 때에는 음성에 파열음이나 마찰음과 같은 유색잡음(colored noise)이 부가되어 원래 음성이 왜곡된다. 이와 같이 왜곡된 음성 신호를 처리할 때에는 군중잡음을 제거하는 과정이 반드시 필요하다. 본 논문에서는 전형적인 군중잡음의 모델인 쇼핑 센터 잡음을 분석하고, 그 결과를 이용하여 음성 신호처리시에 효과적으로 군중잡음만을 제거할 수 있는 모델을 제안한다. 제안된 모델은 시간 영역에서 마찰음과 파열음을 제거하고. DSI(Digital Speech Interpolation)를 이용하여 침묵 구간을 검출한다. 이때 주파수 영역에서는 이 침묵구간을 잡음으로 간주하여 이를 이용한 스펙트럼 차감법(spectral subtraction)으로 음성 신호에 부가된 군중 잡음을 제거하는 과정을 거친다.
PDF

Search Result 1,005, Processing Time 0.03 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)