통합 검색 | Korea Science

잡음에 강한 음성 인식에서 SNR 기준 함수를 사용한 가우시안 함수 변형 및 결정에 관한 연구 (A Study on Variation and Determination of Gaussian function Using SNR Criteria Function for Robust Speech Recognition)

전선도;강철호
- 한국음향학회지
- /
- 제18권7호
- /
- pp.112-117
- /
- 1999
잡음에 강한 음성인식시스템을 위하여 주파수 차감법을 사용할 경우 음성 신호마저 차감하여 신호를 더욱 부식시키는 경우가 존재한다. 본 연구에서는 이러한 경우를 위해서 프레임 마다 추정 잡음과 차감 신호의 SNR(Signal to Noise Ratio) 함수로부터 반연속 HMM(Hidden Markov Model)의 가우시안 함수를 변형 및 결정하는 방법을 제안한다. 이 방법의 타당성을 위해 프레임마다 추정 잡음의 오류 정도가 추정 잡음의 크기와 관계함을 신호 파형 형태로써 보였으며, 이러한 이유에서 SNR을 기준으로 가우시안 함수를 변형 및 결정하게 된다. 실험에서 80㎞/h 이상의 속도로 달리는 차량 내에서 배경 잡음과 음성이 혼합되었을 때의 음성 인식율을 평가하였다. 그 결과 주파수 차감한 경우와 차감하지 않은 경우에 비해 본 논문에서 제안한 SNR에 의한 가우시안 결정 방법이 더욱 향상된 인식율을 보였다.
PDF

A New Robust Signal Recognition Approach Based on Holder Cloud Features under Varying SNR Environment

Li, Jingchao
- KSII Transactions on Internet and Information Systems (TIIS)
- /
- 제9권12호
- /
- pp.4934-4949
- /
- 2015
The unstable characteristic values of communication signals along with the varying SNR (Signal Noise Ratio) environment make it difficult to identify the modulations of signals. Most of relevant literature revolves around signal recognition under stable SNR, and not applicable for signal recognition at varying SNR. To solve the problem, this research developed a novel communication signal recognition algorithm based on Holder coefficient and cloud theory. In this algorithm, the two-dimensional (2D) Holder coefficient characteristics of communication signals were firstly calculated, and then according to the distribution characteristics of Holder coefficient under varying SNR environment, the digital characteristics of cloud model such as expectation, entropy, and hyper entropy are calculated to constitute the three-dimensional (3D) digital cloud characteristics of Holder coefficient value, which aims to improve the recognition rate of the communication signals. Compared with traditional algorithms, the developed algorithm can describe the signals' features more accurately under varying SNR environment. The results from the numerical simulation show that the developed 3D feature extraction algorithm based on Holder coefficient cloud features performs better anti-noise ability, and the classifier based on interval gray relation theory can achieve a recognition rate up to 84.0%, even when the SNR varies from -17dB to -12dB.
https://doi.org/10.3837/tiis.2015.12.011 인용 PDF KSCI KPUBS HTML

Signal Enhancement of a Variable Rate Vocoder with a Hybrid domain SNR Estimator

Park, Hyung Woo
- KSII Transactions on Internet and Information Systems (TIIS)
- /
- 제13권2호
- /
- pp.962-977
- /
- 2019
The human voice is a convenient method of information transfer between different objects such as between men, men and machine, between machines. The development of information and communication technology, the voice has been able to transfer farther than before. The way to communicate, it is to convert the voice to another form, transmit it, and then reconvert it back to sound. In such a communication process, a vocoder is a method of converting and re-converting a voice and sound. The CELP (Code-Excited Linear Prediction) type vocoder, one of the voice codecs, is adapted as a standard codec since it provides high quality sound even though its transmission speed is relatively low. The EVRC (Enhanced Variable Rate CODEC) and QCELP (Qualcomm Code-Excited Linear Prediction), variable bit rate vocoders, are used for mobile phones in 3G environment. For the real-time implementation of a vocoder, the reduction of sound quality is a typical problem. To improve the sound quality, that is important to know the size and shape of noise. In the existing sound quality improvement method, the voice activated is detected or used, or statistical methods are used by the large mount of data. However, there is a disadvantage in that no noise can be detected, when there is a continuous signal or when a change in noise is large.This paper focused on finding a better way to decrease the reduction of sound quality in lower bit transmission environments. Based on simulation results, this study proposed a preprocessor application that estimates the SNR (Signal to Noise Ratio) using the spectral SNR estimation method. The SNR estimation method adopted the IMBE (Improved Multi-Band Excitation) instead of using the SNR, which is a continuous speech signal. Finally, this application improves the quality of the vocoder by enhancing sound quality adaptively.
https://doi.org/10.3837/tiis.2019.02.026 인용 PDF KSCI HTML

혈관조영검사에서 매개변수 변화에 따른 Roadmap 영상의 화질평가 (Evaluation of Roadmap Image Quality by Parameter Change in Angiography)

공창기;송종남;한재복
- 한국방사선학회논문지
- /
- 제14권1호
- /
- pp.53-60
- /
- 2020
이 연구의 목적은 Roadmap 영상에서 화질에 영향을 미치는 인자들을 알아보기 위한 것으로, 조영제의 희석률, Collimation Field, Flow Rate를 변화하여 연구를 하였다. 화질의 정량적인 평가를 위해, 아크릴를 이용하여 3mm 혈관모형의 Water Phantom을 자체 제작하였고, 자체 제작한 혈관모형의 Water Phantom으로 Roadmap 영상을 획득하고, SNR(Signal to Noise Ratio)과 CNR(Contrast to Noise Ratio)을 분석하였다. CM : N/S 희석률 변화에 대한 연구에서 CM : N/S 희석률을 (100%~10% : 100%)로 변화를 주었으며, 혈관모형 Water Phantom을 이용하여 촬영한 Roadmap 영상의 SNR과 CNR의 측정 결과 CM에 N/S 희석률이 높아질수록 SNR의 측정값이 점차적으로 낮아짐을 나타났고, CNR의 측정값도 점차적으로 낮아짐을 나타났다. 결론적으로 CM : N/S의 희석률이 높아질수록 SNR과 CNR 낮아짐을 확인하였고, CM : N/S의 희석률(100%~70 : 30%)에서 유의한 이미지를 얻을 수 있음을 확인하였다. Collimation Field 변화에 대한 연구에서 혈관모형 Water Phantom을 이용하여 Colimation Field를 혈관모형 중심으로 좌, 우 2 cm 간격으로 좁히면서 0 cm, 2 cm, 4 cm, 6 cm, 8 cm 10 cm, 12 cm으로 각각 변화를 주었으며, Roadmap을 촬영한 영상의 SNR과 CNR의 측정 결과는 Collimation Field를 혈관모형 중심으로 좁힐수록 SNR과 CNR의 측정값이 증가하는 것을 확인할 수 있었다. Flow rate 변화에 대한 연구에서 Autoinjector의 Volume을 15로 일정하게 하고, Flow Rate를 1, 2, 3, 4, 5, 6, 7, 8, 9, 10 으로 각각 변화를 주었다. 혈관모형 Water Phantom을 이용하여 Roadmap 영상을 촬영한 이미지의 SNR과 CNR의 측정 결과 Flow Rate를 증가했을 때, SNR의 측정값이 점차적으로 감소하다가 Flow Rate 9~10에서 SNR의 측정값이 점차적 증가를 보였고, CNR의 측정값도 점차적으로 감소하다가 Flow Rate 9~10에서 CNR의 측정값이 점차적으로 증가를 보였다. 그러나 ROI Mean 값과 Background Mean 값으로 SNR과 CNR의 상관관계를 확인할 수 없었다. 상관관계를 확인하기 위해 Flow Rate 변화에 따른 Roadmap 연구는 향후 더 많은 연구로 확인해야 할 것으로 사료된다. 결론적으로 Roadmap 영상의 화질에 영향을 미치는 인자들을 알아보기 위해 조영제의 희석률, Collimation Field, Flow Rate 변화에 대한 연구에서 조영제에 N/S의 희석률이 증가할수록 SNR과 CNR이 낮아져 화질과 대조도가 낮아지는 것을 확인하였으며, Collimation Field를 좁힐수록 SNR과 CNR이 증가하여 화질과 대조도가 높아지는 것을 확인하였다. 그러나 Flow Rate 변화에 대한 연구에서는 상관관계를 확인할 수 없었다. 검사 및 시술을 할 때 신장의 영향을 최소화하기 위해 적절한 조영제 농도 선택과 대조도 향상 및 피폭 감소를 위한 적절한 Collimation Field를 사용하는 것이 유용할 것으로 판단된다.
https://doi.org/10.7742/jksr.2020.14.1.53 인용 PDF KSCI HTML

잡음에 강인한 음성인식을 위한 스펙트럼 보상 방법 (A Spectral Compensation Method for Noise Robust Speech Recognition)

조정호
- 전자공학회논문지 IE
- /
- 제49권2호
- /
- pp.9-17
- /
- 2012
음성 인식 시스템의 용용에서 실제 문제점의 하나는 음성신호의 왜곡에 의한 인식성능의 저하이다. 음성신호의 왜곡에 가장 중요한 원인은 부가적인 잡음이다. 이 논문은 잡음에 강인한 음성인식을 위하여, 스펙트럼 피크 향상 기법과 효과적인 잡음 차감 기법에 기초한 스펙트럼 보상 방법을 기술한다. 제안한 방법은 음성 스펙트럼의 포먼트 구조를 향상시키고 스펙트럼 기울기를 보상하면서도 광 대역폭 스펙트럼 요소는 그대로 유지한다. 백색 가우스 잡음, 자동차 잡음, 음성 잡음 또는 지하철 잡음에 의해 왜곡된 음성을 이용한 인식실험을 수행한 결과, 새로운 방법은 스펙트럼 보상을 하지 않은 경우에 비해, 높은 SNR(Signal to Noise Ratio) 환경에서는 평균 오인식율을 약간 줄였으며, 낮은 SNR(10 dB) 환경에서는 평균 오인식율을 1/2로 크게 줄였다.
PDF KSCI

잡음 에너지 제어를 통한 지각 필터 성능 개선 (Performance Improvement of Perceptual Filter Using Noise Energy Control)

서정국;차형태
- 한국음향학회지
- /
- 제24권1호
- /
- pp.43-51
- /
- 2005
본 논문에서는 잡음 에너지 제어를 통한 지각 필터의 성능을 향상시킴으로써 잡음에 의해 열화 된 오디오 신호의 음질을 개선하는 알고리즘을 제안한다. 기존의 방식에서는 묵음 구간에서 획득한 잡음 에너지를 사용하여 필터를 구성하여 사용하지만, 신호 구간마다 달라지는 신호의 세기 및 잡음의 환경 정도에 많은 영향을 받아 잡음의 에너지가 급격하게 변화한다면 음질의 개선률이 감소함을 알 수 있다. 그러나 제안하는 방식에서는 묵음 구간에서 추정한 잡음의 에너지 제어를 통해 초기 추정 잡음보다 가까운 추정 잡음을 얻음으로써 잡음 에너지가 급격하게 변화하여도 음질 개선률에는 변화가 적음을 알 수 있었다. 또한 저 대역에 영향을 미치는 잡음의 경우에도 다른 방법들과는 달리 음질의 개선이 뚜렷하였다. 기존 방식과의 비교를 위해 다양한 신호 대 잡음 비 (signal-to-noise ratio, SNR)에서 열화 된 오디오 신호를 입력으로 사용하였다. 입력 SNR이 5dB, l0dE, 15dB와 20dB의 각각의 경우에 대하여 SSNR (Segmental SNR)과 잡음 대 마스킹 비 (Noise-to-mask ratio, NMR), 음질 테스트를 수행한 결과, 청감 테스트 (Mean Opinion Score, MOS Test)결과의 향상과 음질의 개선을 확인할 수 있었다.
PDF KSCI

Signal-to-Noise Ratio Formulas of a Scalar Gaussian Quantizer Mismatched to a Laplacian Source

이재건;나상신
- 한국통신학회논문지
- /
- 제36권6C호
- /
- pp.384-390
- /
- 2011
The paper derives formulas for the mean-squared error distortion and resulting signal-to-noise (SNR) ratio of a fixed-rate scalar quantizer designed optimally in the minimum mean-squared error sense for a Gaussian density with the standard deviation ${\sigma}_q$ when it is mismatched to a Laplacian density with the standard deviation ${\sigma}_q$. The SNR formulas, based on the key parameter and Bennett's integral, are found accurate for a wide range of $p${\equiv}\frac{\sigma_p}{\sigma_q}${\geqq}0.25$. Also an upper bound to the SNR is derived, which becomes tighter with increasing rate R and indicates that the SNR behaves asymptotically as $\frac{20\sqrt{3{\ln}2}}{{\rho}{\ln}10}\;{\sqrt{R}}$ dB.
https://doi.org/10.7840/KICS.2011.36C.6.384 인용 PDF KSCI

운율 패턴, 강도, 신호대소음비에 따른 문장 지각 변화 (Perception of sentences varying with prosody pattern, sound intensity, and signal-to-noise ratio)

장선아;장은주;장재진
- 말소리와 음성과학
- /
- 제9권2호
- /
- pp.119-124
- /
- 2017
This study investigates how perception of easy sentences varies with prosody pattern, sound intensity, and signal-to-noise ratio(SNR) in young adults with normal hearing who were in their 20's. The results showed that the presence of proper prosody pattern in the sentences increased correct perception rate of the target sentences, and that the lower the intensity and SNR, the lower the sentence perception scores. The results also showed that SNR had a greater effect on the sentence perception scores than sound intensity. There was a significant decrease of perception scores starting at the level of 15 dB and +3 SNR for the sentences with prosody pattern, while starting at the level of 18 dB and +6 SNR for the sentences without prosody pattern, ending up with a very poor perception score as sound intensity and SNR gets lower. There was a significant difference in the perception score of the sentences with prosody pattern between 20 year-old group and 21 year or older group in several listening conditions of sound intensity and SNR.
https://doi.org/10.13064/KSSS.2017.9.2.119 인용 PDF KSCI

간섭과 잡음이 존재하는 Hard-Limiting 위성채널상에서의 DS-BPSK신호의 오율특성 (Error Rate Performance of DS-BPSK Signal transmitted through a Hard-Limiting Satellite Channel in the presence of Interference and Noise)

신동일;조성준
- 한국통신학회논문지
- /
- 제11권1호
- /
- pp.64-72
- /
- 1986
동일채널간섭(cochannel interference)과 다운링크(downlink) 가우스성잡음이 존재하는 환경하에서 비선형 위성 트랜스폰더(transponder)를 통과하는 DS-BPSK(Direct Sequence Binary Phase Shift Keying)신호의 오율식을 구했다. 이 때 위성 트랜스폰더의 입력으로는 DS-BPSK신호와 스펙트럼 확산된 광대역의 동일채널간섭신호와의 합성파를 가정하였다. 구해진 오율식에 의한 계산결과는 반송파 대 간섭파 전력 비(CIR), 다운링크 신호 대 잡음 전력 비(downlink SNR) 그리고 처리이득(process gain)을 파라미터로 하여 그래프로 나타내고 분석했다. 그 결과, DS-BPSK신호와 간섭신호가 하드 리미터(hard limiter)특성의 트랜스폰더를 통과하게 되면 수신기의 복조단에서는 처리이득을 증가시키더라도 개선되어 지지 않는 협대역의 상호 변조적 성분이 생긴다는 것을 알 수 있었다. 오율면에서는 CIR이 낮을 경우(약 10dB이하) 에는 CIR의 증가에 따른 오율 개선도가 현격하지만 약 20dB이상의 경우에서는 별다른 개선 효과가 없었다. 또한 처리이득의 경우는 일정한 오율값에 대해 처리이득을 약 10배 증가시키므로써 약 10dB 정도의 다운링크 SNR개선을 얻을 수 있었다.
PDF

SNR 매핑을 이용한 환경적응 기반 음성인식 (Speech Recognition based on Environment Adaptation using SNR Mapping)

정용주
- 한국전자통신학회논문지
- /
- 제9권5호
- /
- pp.543-548
- /
- 2014
다 모델 기반의 음성인식기는 음성인식에서 매우 성공적임이 알려져 있다. 그것은 다양한 신호-대-잡음비(SNR)와 잡음종류에 해당하는 다수의 HMM을 사용함으로서 선택된 음향모델이 인식잡음음성에 매우 근접한 일치성을 가질 수 있기 때문이다. 그러나 실제 사용시에 HMM의 개수가 제한됨에 따라서 음향모델의 불일치는 여전히 문제로 남아 있다. 본 논문에서는 인식잡음음성과 HMM 간의 SNR 불일치를 줄이고자 이들 간의 최적의 SNR 매핑 (mapping)을 실험적으로 결정하였다. 인식잡음음성으로 부터 추정된 SNR 값을 사용하는 대신 제안된 SNR 매핑을 사용함으로서 향상된 인식결과를 얻을 수 있었다. 다 모델 기반인식기에 제안된 방법을 적용하여 Aurora 2 데이터베이스에 대해서 인식 실험한 결과 기존의 MTR 이나 다 모델 기반 음성인식기에 비해서 6.3%와 9.4%의 상대적 단어 오인식율 감소를 이룰 수 있었다.
https://doi.org/10.13067/JKIECS.201.9.5.543 인용 PDF KSCI

검색결과 286건 처리시간 0.025초

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

자세히 찾기

이미지 검색 (β)