• 제목/요약/키워드: SNR(signal-to­noise rate)

Search Result 288, Processing Time 0.024 seconds

A Study on Variation and Determination of Gaussian function Using SNR Criteria Function for Robust Speech Recognition (잡음에 강한 음성 인식에서 SNR 기준 함수를 사용한 가우시안 함수 변형 및 결정에 관한 연구)

  • 전선도;강철호
    • The Journal of the Acoustical Society of Korea
    • /
    • v.18 no.7
    • /
    • pp.112-117
    • /
    • 1999
  • In case of spectral subtraction for noise robust speech recognition system, this method often makes loss of speech signal. In this study, we propose a method that variation and determination of Gaussian function at semi-continuous HMM(Hidden Markov Model) is made on the basis of SNR criteria function, in which SNR means signal to noise ratio between estimation noise and subtracted signal per frame. For proving effectiveness of this method, we show the estimation error to be related with the magnitude of estimated noise through signal waveform. For this reason, Gaussian function is varied and determined by SNR. When we test recognition rate by computer simulation under the noise environment of driving car over the speed of 80㎞/h, the proposed Gaussian decision method by SNR turns out to get more improved recognition rate compared with the frequency subtracted and non-subtracted cases.

  • PDF

A New Robust Signal Recognition Approach Based on Holder Cloud Features under Varying SNR Environment

  • Li, Jingchao
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.9 no.12
    • /
    • pp.4934-4949
    • /
    • 2015
  • The unstable characteristic values of communication signals along with the varying SNR (Signal Noise Ratio) environment make it difficult to identify the modulations of signals. Most of relevant literature revolves around signal recognition under stable SNR, and not applicable for signal recognition at varying SNR. To solve the problem, this research developed a novel communication signal recognition algorithm based on Holder coefficient and cloud theory. In this algorithm, the two-dimensional (2D) Holder coefficient characteristics of communication signals were firstly calculated, and then according to the distribution characteristics of Holder coefficient under varying SNR environment, the digital characteristics of cloud model such as expectation, entropy, and hyper entropy are calculated to constitute the three-dimensional (3D) digital cloud characteristics of Holder coefficient value, which aims to improve the recognition rate of the communication signals. Compared with traditional algorithms, the developed algorithm can describe the signals' features more accurately under varying SNR environment. The results from the numerical simulation show that the developed 3D feature extraction algorithm based on Holder coefficient cloud features performs better anti-noise ability, and the classifier based on interval gray relation theory can achieve a recognition rate up to 84.0%, even when the SNR varies from -17dB to -12dB.

Signal Enhancement of a Variable Rate Vocoder with a Hybrid domain SNR Estimator

  • Park, Hyung Woo
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.13 no.2
    • /
    • pp.962-977
    • /
    • 2019
  • The human voice is a convenient method of information transfer between different objects such as between men, men and machine, between machines. The development of information and communication technology, the voice has been able to transfer farther than before. The way to communicate, it is to convert the voice to another form, transmit it, and then reconvert it back to sound. In such a communication process, a vocoder is a method of converting and re-converting a voice and sound. The CELP (Code-Excited Linear Prediction) type vocoder, one of the voice codecs, is adapted as a standard codec since it provides high quality sound even though its transmission speed is relatively low. The EVRC (Enhanced Variable Rate CODEC) and QCELP (Qualcomm Code-Excited Linear Prediction), variable bit rate vocoders, are used for mobile phones in 3G environment. For the real-time implementation of a vocoder, the reduction of sound quality is a typical problem. To improve the sound quality, that is important to know the size and shape of noise. In the existing sound quality improvement method, the voice activated is detected or used, or statistical methods are used by the large mount of data. However, there is a disadvantage in that no noise can be detected, when there is a continuous signal or when a change in noise is large.This paper focused on finding a better way to decrease the reduction of sound quality in lower bit transmission environments. Based on simulation results, this study proposed a preprocessor application that estimates the SNR (Signal to Noise Ratio) using the spectral SNR estimation method. The SNR estimation method adopted the IMBE (Improved Multi-Band Excitation) instead of using the SNR, which is a continuous speech signal. Finally, this application improves the quality of the vocoder by enhancing sound quality adaptively.

Evaluation of Roadmap Image Quality by Parameter Change in Angiography (혈관조영검사에서 매개변수 변화에 따른 Roadmap 영상의 화질평가)

  • Kong, Chang gi;Song, Jong Nam;Han, Jae Bok
    • Journal of the Korean Society of Radiology
    • /
    • v.14 no.1
    • /
    • pp.53-60
    • /
    • 2020
  • The purpose of this study is to identify factors affecting picture quality in Roadmap images, which were studied by varying the dilution rate, collimation field and flow rate of contrast medium. For a quantitative evaluation of the quality of the picture, a 3mm vessel model Water Phantom was self-produced using acrylic, a roadmap image was acquired with a self-produced vascular model Water Phantom, and the SNR(Signal to Noise Ratio) and CNR (Contrast to Noise Ratio) were analyzed. CM:N/S In the study on the change of dilution rate, CM:N/S dilution rate changed to (100%~10%:100%), and the measurement of the roadmap image taken using the vascular model Water Phantom showed that the measurement value of SNR gradually decreased as the N/S dilution rate was increased, and the measurement of CNR was gradually reduced. It was confirmed that the higher the dilution rate of CM:N/S, the lower the SNR and CNR, and also significant image can be obtained at the dilution rate of CM:N/S (100%~70:30%). The study showed the value of SNR and CNR in Roadmap image was increased as the Collimation Field was narrowed to the center of the vascular phantom; the Collimation Field was narrowed to the center of the vessel model by 2cm intervals to 0cm through 12cm. To verify the relationship with Roadmap image and Flow Rate, volume of the autoinjector was kept constant at 15 and the flow rate was gradually increased 1, 2, 3, 4, 5, 6, 7, 8, 9, 10. The value of SNR and CNR of images taken by using water Phantom gradually decreased as the Flow Rate increased, but at Flow Rate 9 and 10, the SNR and CNR value was increase. It was not possible to confirm the relationship with SNR and CNR by ROI mean value and Background mean value. It is considered that further study is needed to evaluate the correlation about Roadmap image and Flow Rate. In conclusion, as the dilution rate of N/S in contrast medium was increased, the value of SNR and CNR was decreased. The narrower the Collimation Field, the higher image quality by increasing value of SNR and CNR. However, it is not confirmed the relationship Roadmap image and Flow Rate. It is considered that appropriate contrast medium concentration to minimize the effects of kidney and proper Collimation Field to improve contrast of image and reduce exposure X-ray during procedure is needed.

A Spectral Compensation Method for Noise Robust Speech Recognition (잡음에 강인한 음성인식을 위한 스펙트럼 보상 방법)

  • Cho, Jung-Ho
    • 전자공학회논문지 IE
    • /
    • v.49 no.2
    • /
    • pp.9-17
    • /
    • 2012
  • One of the problems on the application of the speech recognition system in the real world is the degradation of the performance by acoustical distortions. The most important source of acoustical distortion is the additive noise. This paper describes a spectral compensation technique based on a spectral peak enhancement scheme followed by an efficient noise subtraction scheme for noise robust speech recognition. The proposed methods emphasize the formant structure and compensate the spectral tilt of the speech spectrum while maintaining broad-bandwidth spectral components. The recognition experiments was conducted using noisy speech corrupted by white Gaussian noise, car noise, babble noise or subway noise. The new technique reduced the average error rate slightly under high SNR(Signal to Noise Ratio) environment, and significantly reduced the average error rate by 1/2 under low SNR(10 dB) environment when compared with the case of without spectral compensations.

Performance Improvement of Perceptual Filter Using Noise Energy Control (잡음 에너지 제어를 통한 지각 필터 성능 개선)

  • Seo Joung-Kook;Cha Hyung-Tai
    • The Journal of the Acoustical Society of Korea
    • /
    • v.24 no.1
    • /
    • pp.43-51
    • /
    • 2005
  • In this paper, we propose an algorithm that improves a tone quality of a noisy audio signal in order to enhance a Performance of perceptual filter using noise energy control. Most of the algorithms which were proposed by the other researchers usually applied a filter using the noise energy acquired from a silent range. In this case. the improvement rate of tone quality decreases if the noise energy is changed by the magnitude or environment variation in a signal frame. But the Proposed method Provides the means to find a food estimated noise through energy control of the estimated noise which is obtained from a silent range. Also we can get the enhancement of tone qualify in low frequency band unlike other methods. To show the performance of the Proposed algorithm, various input signals which had a different signal-to-noise ratio (SNR) such as 5dB, l0dB, 15dB and 20dB were used to test the proposed algorithm. With the proposed algorithm, we could confirm the enhancement of tone quality in terms of segmental SNR (SSNR). noise-to-mask ration (NMR) and mean opinion score (MOS) test.

Signal-to-Noise Ratio Formulas of a Scalar Gaussian Quantizer Mismatched to a Laplacian Source

  • Rhee, Ja-Gan;Na, Sang-Sin
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.36 no.6C
    • /
    • pp.384-390
    • /
    • 2011
  • The paper derives formulas for the mean-squared error distortion and resulting signal-to-noise (SNR) ratio of a fixed-rate scalar quantizer designed optimally in the minimum mean-squared error sense for a Gaussian density with the standard deviation ${\sigma}_q$ when it is mismatched to a Laplacian density with the standard deviation ${\sigma}_q$. The SNR formulas, based on the key parameter and Bennett's integral, are found accurate for a wide range of $p\({\equiv}\frac{\sigma_p}{\sigma_q}\){\geqq}0.25$. Also an upper bound to the SNR is derived, which becomes tighter with increasing rate R and indicates that the SNR behaves asymptotically as $\frac{20\sqrt{3{\ln}2}}{{\rho}{\ln}10}\;{\sqrt{R}}$ dB.

Perception of sentences varying with prosody pattern, sound intensity, and signal-to-noise ratio (운율 패턴, 강도, 신호대소음비에 따른 문장 지각 변화)

  • Chang, Son-A;Jang, Eunjoo;Jang, Jaejin
    • Phonetics and Speech Sciences
    • /
    • v.9 no.2
    • /
    • pp.119-124
    • /
    • 2017
  • This study investigates how perception of easy sentences varies with prosody pattern, sound intensity, and signal-to-noise ratio(SNR) in young adults with normal hearing who were in their 20's. The results showed that the presence of proper prosody pattern in the sentences increased correct perception rate of the target sentences, and that the lower the intensity and SNR, the lower the sentence perception scores. The results also showed that SNR had a greater effect on the sentence perception scores than sound intensity. There was a significant decrease of perception scores starting at the level of 15 dB and +3 SNR for the sentences with prosody pattern, while starting at the level of 18 dB and +6 SNR for the sentences without prosody pattern, ending up with a very poor perception score as sound intensity and SNR gets lower. There was a significant difference in the perception score of the sentences with prosody pattern between 20 year-old group and 21 year or older group in several listening conditions of sound intensity and SNR.

Error Rate Performance of DS-BPSK Signal transmitted through a Hard-Limiting Satellite Channel in the presence of Interference and Noise (간섭과 잡음이 존재하는 Hard-Limiting 위성채널상에서의 DS-BPSK신호의 오율특성)

  • 신동일;조성준
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.11 no.1
    • /
    • pp.64-72
    • /
    • 1986
  • The error rate equation fo DS-BPSK(Direct Sequence Binary Phase Shift Keying) signal transmitted through the nonlinear satellite transponder has been derived in the cochannel interference and downlink Gaussian noise environment. The input to the satellite transponder is the superposition of DS-BPSK signal with one interfere which is a cochannel wide-band PN signal. The error rate performance of DS-BPSK system has been evaluated and shown in figures in terms of carrier to interference power ratio(CIR), downlink signal to noise power ratio(downlink SNR) and process gain. In the analysis, it has been shown that the use of a hard limiter in DS-BPSK satellite system leads to the generation of narrow-band intermodulation products which is independent of the process gain. Also it is known that the error rate performance can be improved in the low levels (below 10dB) of CIR as the CIR increase. As the process gain varies from 10 to 100 the curve gives the about 10 dB gain in downlink SNR to maintain a fixed error rate.

  • PDF

Speech Recognition based on Environment Adaptation using SNR Mapping (SNR 매핑을 이용한 환경적응 기반 음성인식)

  • Chung, Yong-Joo
    • The Journal of the Korea institute of electronic communication sciences
    • /
    • v.9 no.5
    • /
    • pp.543-548
    • /
    • 2014
  • Multiple-model based speech recognition framework (MMSR) has been known to be very successful in speech recognition. Since it uses multiple hidden Markov modes (HMMs) that corresponds to various noise types and signal-to-noise ratio (SNR) values, the selected acoustic model can have a close match with the test noisy speech. However, since the number of HMM sets is limited in practical use, the acoustic mismatch still remains as a problem. In this study, we experimentally determined the optimal SNR mapping between the test noisy speech and the HMM set to mitigate the mismatch between them. Improved performance was obtained by employing the SNR mapping instead of using the estimated SNR from the test noisy speech. When we applied the proposed method to the MMSR, the experimental results on the Aurora 2 database show that the relative word error rate reduction of 6.3% and 9.4% was achieved compared to a conventional MMSR and multi-condition training (MTR), respectively.