• Title/Summary/Keyword: 잡음에 대한 강인함

Search Result 230, Processing Time 0.035 seconds

Feature Compensation Method Based on Parallel Combined Mixture Model (병렬 결합된 혼합 모델 기반의 특징 보상 기술)

  • 김우일;이흥규;권오일;고한석
    • The Journal of the Acoustical Society of Korea
    • /
    • v.22 no.7
    • /
    • pp.603-611
    • /
    • 2003
  • This paper proposes an effective feature compensation scheme based on speech model for achieving robust speech recognition. Conventional model-based method requires off-line training with noisy speech database and is not suitable for online adaptation. In the proposed scheme, we can relax the off-line training with noisy speech database by employing the parallel model combination technique for estimation of correction factors. Applying the model combination process over to the mixture model alone as opposed to entire HMM makes the online model combination possible. Exploiting the availability of noise model from off-line sources, we accomplish the online adaptation via MAP (Maximum A Posteriori) estimation. In addition, the online channel estimation procedure is induced within the proposed framework. For more efficient implementation, we propose a selective model combination which leads to reduction or the computational complexities. The representative experimental results indicate that the suggested algorithm is effective in realizing robust speech recognition under the combined adverse conditions of additive background noise and channel distortion.

Implementation of Low Noise Current Sensor using Low Pass FIR Filter (저역통과 FIR필터를 이용한 저잡음 전류 센서 구현)

  • Kim, Jeong-Hwan;Lee, Seong-Jin;Choi, Yong-geon;Han, Seong-Gye;Kwon, Se-Ik;Kim, Nam-Ho
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2017.05a
    • /
    • pp.499-502
    • /
    • 2017
  • The needs of efficient electricity use and current measurement for electrical safety have been increased. Hence, the current sensor is used, especially non-contact current sensor which can measure the current without blocking the circuit using hall effect. However, the accurate measuring of the current sensor is inhibited due to the inflow of various noises in this current sensor. In this article, a stronger current sensor against the noise is proposed using low pass FIR filter to the existing current sensor. FIR filter was designed to block the range over the certain frequency at the output of the current sensor to eliminate the external noises, and so on. As a result, more accurate and close measurements were possible.

  • PDF

A Study on Robust Speech Emotion Feature Extraction Under the Mobile Communication Environment (이동통신 환경에서 강인한 음성 감성특징 추출에 대한 연구)

  • Cho Youn-Ho;Park Kyu-Sik
    • The Journal of the Acoustical Society of Korea
    • /
    • v.25 no.6
    • /
    • pp.269-276
    • /
    • 2006
  • In this paper, we propose an emotion recognition system that can discriminate human emotional state into neutral or anger from the speech captured by a cellular-phone in real time. In general. the speech through the mobile network contains environment noise and network noise, thus it can causes serious System performance degradation due to the distortion in emotional features of the query speech. In order to minimize the effect of these noise and so improve the system performance, we adopt a simple MA (Moving Average) filter which has relatively simple structure and low computational complexity, to alleviate the distortion in the emotional feature vector. Then a SFS (Sequential Forward Selection) feature optimization method is implemented to further improve and stabilize the system performance. Two pattern recognition method such as k-NN and SVM is compared for emotional state classification. The experimental results indicate that the proposed method provides very stable and successful emotional classification performance such as 86.5%. so that it will be very useful in application areas such as customer call-center.

Robust Speech Reinforcement Based on Gain-Modification incorporating Speech Absence Probability (음성 부재 확률을 이용한 음성 강화 이득 수정 기법)

  • Choi, Jae-Hun;Chang, Joon-Hyuk
    • Journal of the Institute of Electronics Engineers of Korea SP
    • /
    • v.47 no.1
    • /
    • pp.175-182
    • /
    • 2010
  • In this paper, we propose a robust speech reinforcement technique to enhance the intelligibility of the degraded speech signal under the ambient noise environments based on soft decision scheme incorporating a speech absence probability (SAP) with speech reinforcement gains. Since the ambient noise significantly decreases the intelligibility of the speech signal, the speech reinforcement approach to amplify the estimated clean speech signal from the background noise environments for improving the intelligibility and clarity of the corrupted speech signal was proposed. In order to estimate the robust reinforcement gain rather than the conventional speech reinforcement method between speech active periods and nonspeech periods or transient intervals, we propose the speech reinforcement algorithm based on soft decision applying the SAP to the estimation of speech reinforcement gains. The performances of the proposed algorithm are evaluated by the Comparison Category Rating (CCR) of the measurement for subjective determination of transmission quality in ITU-T P.800 under various ambient noise environments and show better performances compared with the conventional method.

Analysis of Geometrical Relations of 2D Affine-Projection Images and Its 3D Shape Reconstruction (정사투영된 2차원 영상과 복원된 3차원 형상의 기하학적 관계 분석)

  • Koh, Sung-Shik;Zin, Thi Thi;Hama, Hiromitsu
    • Journal of the Institute of Electronics Engineers of Korea SP
    • /
    • v.44 no.4 s.316
    • /
    • pp.1-7
    • /
    • 2007
  • In this paper, we analyze geometrical relations of 3D shape reconstruction from 2D images taken under anne projection. The purpose of this research is to contribute to more accurate 3-D reconstruction under noise distribution by analyzing geometrically the 2D to 3D relationship. In situation for no missing feature points (FPs) or no noise in 2D image plane, the accurate solution of 3D shape reconstruction is blown to be provided by Singular Yalue Decomposition (SVD) factorization. However, if several FPs not been observed because of object occlusion and image low resolution, and so on, there is no simple solution. Moreover, the 3D shape reconstructed from noise-distributed FPs is peturbed because of the influence of the noise. This paper focuses on analysis of geometrical properties which can interpret the missing FPs even though the noise is distributed on other FPs.

Speech Recognition Using Noise Robust Features and Spectral Subtraction (잡음에 강한 특징 벡터 및 스펙트럼 차감법을 이용한 음성 인식)

  • Shin, Won-Ho;Yang, Tae-Young;Kim, Weon-Goo;Youn, Dae-Hee;Seo, Young-Joo
    • The Journal of the Acoustical Society of Korea
    • /
    • v.15 no.5
    • /
    • pp.38-43
    • /
    • 1996
  • This paper compares the recognition performances of feature vectors known to be robust to the environmental noise. And, the speech subtraction technique is combined with the noise robust feature to get more performance enhancement. The experiments using SMC(Short time Modified Coherence) analysis, root cepstral analysis, LDA(Linear Discriminant Analysis), PLP(Perceptual Linear Prediction), RASTA(RelAtive SpecTrAl) processing are carried out. An isolated word recognition system is composed using semi-continuous HMM. Noisy environment experiments usign two types of noises:exhibition hall, computer room are carried out at 0, 10, 20dB SNRs. The experimental result shows that SMC and root based mel cepstrum(root_mel cepstrum) show 9.86% and 12.68% recognition enhancement at 10dB in compare to the LPCC(Linear Prediction Cepstral Coefficient). And when combined with spectral subtraction, mel cepstrum and root_mel cepstrum show 16.7% and 8.4% enhanced recognition rate of 94.91% and 94.28% at 10dB.

  • PDF

Sensor Fault-tolerant Controller Design on Gas Turbine Engine using Multiple Engine Models (다중 엔진모델을 이용한 센서 고장허용 가스터빈 엔진제어기 설계)

  • Kim, Jung Hoe;Lee, Sang Jeong
    • Journal of the Korean Society of Propulsion Engineers
    • /
    • v.20 no.2
    • /
    • pp.56-66
    • /
    • 2016
  • Robustness is essential for model based FDI (Fault Detection and Isolation) and it is inevitable to have modeling errors and sensor signal noises during the process of FDI. This study suggests an improved method by applying NARX (Nonlinear Auto Regressive eXogenous) model and Kalman estimator in order to cope with problems caused by linear model errors and sensor signal noises in the process of fault diagnoses. Fault decision is made by the probability of the trend of gradually accumulated errors applying Fuzzy logic, which are robust to instantaneous sensor signal noises. Reliability of fault diagnosis is verified under various fault simulations.

A Study on Content-based Image Retrieval Technique using Texture Information (영상의 텍스쳐 정보를 이용한 내용 기반 영상 검색에 관한 연구)

  • Park, Kyung-Shik;Park, Kang-Seo;Hong, Min-Suk;Chung, Tae-Yun;Park, Sang-Hui
    • Proceedings of the KIEE Conference
    • /
    • 1999.11c
    • /
    • pp.751-753
    • /
    • 1999
  • 본 논문에서는 영상의 텍스쳐 정보를 이용하여 일반 영상에 대한 내용기반 영상 검색을 수행할 수 있는 알고리듬을 제안한다. Gabor 웨이브렛 변환을 이용하여 Gabor 필터 뱅크 내의 각 필터에 의해 필터링된 대역의 평균과 표준편차를 영상의 특징 벡터(Gabor Texture Feature)로 추출하여 영상들간의 유사성을 계산하는데 사용한다. 논문의 목적이 영상에 가해진 외적 변형, 즉 잡음 첨가, 블러링, 샤프닝 등과 같은 변형에 강인하게 동작할 수 있는 텍스쳐 특징 기반 영상 검색 기법을 제안하는 것이므로, 기존의 Gabor 필터만을 사용하여 텍스쳐 특징을 추출하여 검색의 기준으로 삼을 경우에 발생할 수 있는 주파수 성분의 변화에 대한 민감성을 Daubechies의 웨이브렛 필터를 사용하여 낮은 해상도에서 영상을 해석함으로써, 외적 변형에 대하여도 강인하게 동작할 수 있는 알고리듬을 제시하였다. 기존의 텍스쳐를 이용한 검색이 주로 텍스쳐 영역(textured region)에 대한 해석만을 하였지만, 본 논문에서는 이를 일반 영상에 적용하였으며, 일반 영상에 대해서도 효율적인 검색을 수행할 수 있음을 보였다.

  • PDF

Parameter Considering Variance Property for Speech Recognition in Noisy Environment (잡음환경에서의 음성인식을 위한 변이특성을 고려한 파라메터)

  • Park, Jin-Young;Lee, Kwang-Seok;Koh, Si-Young;Hur, Kang-In
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • v.9 no.2
    • /
    • pp.469-472
    • /
    • 2005
  • This paper propose about effective speech feature parameter that have robust character in effect of noise in realizing speech recognition system. Established MFCC that is the basic parameter used to ASR(Automatic Speech Recognition) and DCTCs that use DCT in basic parameter. Also, proposed delta-Cepstrum and delta-delta-Cepstrum parameter that reconstruct Cepstrum to have information for variation of speech. And compared recognition performance in using HMM. For dimension reduction of each parameter LDA algorithm apply and compared recognition. Results are presented reduced dimension delta-delta-Cepstrum parameter in using LDA recognition performance that improve more than existent parameter in noise environment of various condition.

  • PDF

Estimation and Weighting of Sub-band Reliability for Multi-band Speech Recognition (다중대역 음성인식을 위한 부대역 신뢰도의 추정 및 가중)

  • 조훈영;지상문;오영환
    • The Journal of the Acoustical Society of Korea
    • /
    • v.21 no.6
    • /
    • pp.552-558
    • /
    • 2002
  • Recently, based on the human speech recognition (HSR) model of Fletcher, the multi-band speech recognition has been intensively studied by many researchers. As a new automatic speech recognition (ASR) technique, the multi-band speech recognition splits the frequency domain into several sub-bands and recognizes each sub-band independently. The likelihood scores of sub-bands are weighted according to reliabilities of sub-bands and re-combined to make a final decision. This approach is known to be robust under noisy environments. When the noise is stationary a sub-band SNR can be estimated using the noise information in non-speech interval. However, if the noise is non-stationary it is not feasible to obtain the sub-band SNR. This paper proposes the inverse sub-band distance (ISD) weighting, where a distance of each sub-band is calculated by a stochastic matching of input feature vectors and hidden Markov models. The inverse distance is used as a sub-band weight. Experiments on 1500∼1800㎐ band-limited white noise and classical guitar sound revealed that the proposed method could represent the sub-band reliability effectively and improve the performance under both stationary and non-stationary band-limited noise environments.