Search | Korea Science

Classification of Pathological Voice Signal with Severe Noise Component

Li, Ta-O;Jo, Cheol-Woo
- Speech Sciences
- /
- v.10 no.4
- /
- pp.107-115
- /
- 2003
In this paper we tried to classify the pathological voice signal with severe noise component based on two different parameters, the spectral slope and the ratio of energies in the harmonic and noise components (HNR), The spectral slope is obtained by using a curve fitting method and the HNR is computed in cepstrum quefrency domain. Speech data from normal peoples and patients are collected, diagnosed and divided into three different classes (normal, relatively less noisy and severely noisy data), The mean values and the standard deviations of the spectral slope and the HNR are computed and compared with in the three kinds of data to characterize and classify the severely noisy pathological voice signals from others.
PDF

Robust Video-Based Barcode Recognition via Online Sequential Filtering

Kim, Minyoung
- International Journal of Fuzzy Logic and Intelligent Systems
- /
- v.14 no.1
- /
- pp.8-16
- /
- 2014
We consider the visual barcode recognition problem in a noisy video data setup. Unlike most existing single-frame recognizers that require considerable user effort to acquire clean, motionless and blur-free barcode signals, we eliminate such extra human efforts by proposing a robust video-based barcode recognition algorithm. We deal with a sequence of noisy blurred barcode image frames by posing it as an online filtering problem. In the proposed dynamic recognition model, at each frame we infer the blur level of the frame as well as the digit class label. In contrast to a frame-by-frame based approach with heuristic majority voting scheme, the class labels and frame-wise noise levels are propagated along the frame sequences in our model, and hence we exploit all cues from noisy frames that are potentially useful for predicting the barcode label in a probabilistically reasonable sense. We also suggest a visual barcode tracking approach that efficiently localizes barcode areas in video frames. The effectiveness of the proposed approaches is demonstrated empirically on both synthetic and real data setup.
https://doi.org/10.5391/IJFIS.2014.14.1.8 인용 PDF KSCI

Performance Comparison of Multiple-Model Speech Recognizer with Multi-Style Training Method Under Noisy Environments (잡음 환경하에서의 다 모델 기반인식기와 다 스타일 학습방법과의 성능비교)

Yoon, Jang-Hyuk;Chung, Young-Joo
- The Journal of the Acoustical Society of Korea
- /
- v.29 no.2E
- /
- pp.100-106
- /
- 2010
Multiple-model speech recognizer has been shown to be quite successful in noisy speech recognition. However, its performance has usually been tested using the general speech front-ends which do not incorporate any noise adaptive algorithms. For the accurate evaluation of the effectiveness of the multiple-model frame in noisy speech recognition, we used the state-of-the-art front-ends and compared its performance with the well-known multi-style training method. In addition, we improved the multiple-model speech recognizer by employing N-best reference HMMs for interpolation and using multiple SNR levels for training each of the reference HMM.
PDF KSCI

A study on the improvement of fuzzy ARTMAP for pattern recognition problems (Fuzzy ARTMAP 신경회로망의 패턴 인식율 개선에 관한 연구)

이재설;전종로;이충웅
- Journal of the Korean Institute of Telematics and Electronics B
- /
- v.33B no.9
- /
- pp.117-123
- /
- 1996
In this paper, we present a new learning method for the fuzzy ARTMAP which is effective for the noisy input patterns. Conventional fuzzy ARTMAP employs only fuzzy AND operation between input vector and weight vector in learning both top-down and bottom-up weight vectors. This fuzzy AND operation causes excessive update of the weight vector in the noisy input environment. As a result, the number of spurious categories are increased and the recognition ratio is reduced. To solve these problems, we propose a new method in updating the weight vectors: the top-down weight vectors of the fuzzy ART system are updated using weighted average of the input vector and the weight vector itself, and the bottom-up weight vectors are updated using fuzzy AND operation between the updated top-down weitht vector and bottom-up weight vector itself. The weighted average prevents the excessive update of the weight vectors and the fuzzy AND operation renders the learning fast and stble. Simulation results show that the proposed method reduces the generation of spurious categories and increases the recognition ratio in the noisy input environment.
PDF

A Robust Speech Recognition Method Combining the Model Compensation Method with the Speech Enhancement Algorithm (음질향상 기법과 모델보상 방식을 결합한 강인한 음성인식 방식)

Kim, Hee-Keun;Chung, Yong-Joo;Bae, Keun-Seung
- Speech Sciences
- /
- v.14 no.2
- /
- pp.115-126
- /
- 2007
There have been many research efforts to improve the performance of the speech recognizer in noisy conditions. Among them, the model compensation method and the speech enhancement approach have been used widely. In this paper, we propose to combine the two different approaches to further enhance the recognition rates in the noisy speech recognition. For the speech enhancement, the minimum mean square error-short time spectral amplitude (MMSE-STSA) has been adopted and the parallel model combination (PMC) and Jacobian adaptation (JA) have been used as the model compensation approaches. From the experimental results, we could find that the hybrid approach that applies the model compensation methods to the enhanced speech produce better results than just using only one of the two approaches.
PDF

Selective pole filtering based feature normalization for performance improvement of short utterance recognition in noisy environments (잡음 환경에서 짧은 발화 인식 성능 향상을 위한 선택적 극점 필터링 기반의 특징 정규화)

Choi, Bo Kyeong;Ban, Sung Min;Kim, Hyung Soon
- Phonetics and Speech Sciences
- /
- v.9 no.2
- /
- pp.103-110
- /
- 2017
The pole filtering concept has been successfully applied to cepstral feature normalization techniques for noise-robust speech recognition. In this paper, it is proposed to apply the pole filtering selectively only to the speech intervals, in order to further improve the recognition performance for short utterances in noisy environments. Experimental results on AURORA 2 task with clean-condition training show that the proposed selectively pole-filtered cepstral mean normalization (SPFCMN) and selectively pole-filtered cepstral mean and variance normalization (SPFCMVN) yield error rate reduction of 38.6% and 45.8%, respectively, compared to the baseline system.
https://doi.org/10.13064/KSSS.2017.9.2.103 인용 PDF KSCI

Statistical Tests for Edg Detection (에지 검출을 위한 통계적 검정법)

Im, Dong-Hun;Seong, Sin-Hui
- The Transactions of the Korea Information Processing Society
- /
- v.7 no.3
- /
- pp.1021-1024
- /
- 2000
In this paper we describe a nonparametric Wilcoxon test and a parametric Z test based on statistical hypothesis testing for the detection of edges. We use the threshold determined by specifying significance level $\alpha$, while Bovik, Huang and Munson[4] consider the range of possible values of test statistics for the threshold. From the experimental results of edge detection, the Z method performs sensitively to the noisy image, while the Wilcoxon method is robust over both noisy nd noise-free images. Comparison with our statistical tests and Sobel operator shows that our tests perform more effectively in both noisy and noise-free images.
PDF

Edge detection for noisy image (잡음 영상에서의 에지 검출)

Koo, Yun Mo;Kim, Young Ro
- Journal of Korea Society of Digital Industry and Information Management
- /
- v.8 no.3
- /
- pp.41-48
- /
- 2012
In this paper, we propose a method of edge detection for noisy image. The proposed method uses a progressive filter for noise reduction and a Sobel operator for edge detection. The progressive filter combines a median filter and a modified rational filter. The proposed method for noise reduction adjusts rational filter direction according to an edge in the image which is obtained by median filtering. Our method effectively attenuates the noise while preserving the image details. Edge detection is performed by a Sobel operator. This operator can be implemented by integer operation and is therefore relatively fast. Our proposed method not only preserves edge, but also reduces noise in uniform region. Thus, edge detection is well performed. Our proposed method could improve results using further developed Sobel operator. Experimental results show that our proposed method has better edge detection with correct positions than those by existing median and rational filtering methods for noisy image.
KSCI

Performance estimation of the noise reduction by window function on a single tone (단일 신호에 대한 창 함수의 잡음 제거 성능 평가)

Baek, Moon-Yeol;Kim, Byoung-Sam
- Journal of the Korean Society for Precision Engineering
- /
- v.13 no.5
- /
- pp.38-43
- /
- 1996
Windowing routines have as their purpose the reduction of the sidelobes of a spectral output of the FFT or DFT routines. Windowing routines accomplish this by forcing the beginning and end of any sequence to approach each other in value. Since they must work with any sequence they force the beginning and ending samples near zero. To make up for this reduction in power, windowing routines give extra weight to the values near the middle of the sequence. The difference between windows is the way in which they transition from the low weights near the edges to the higher weights neqr the middle of the sequence. Signal-to-noise ratio(SNR) can be determined by the ratio of the output noisy signal variance to the input noisy signal variance of a window. Standard deviation of noise is reduced by windowing. Thus, the windowing operation improved the SNR of the noisy signal. This paper shows a performance estimation of windowing on a single tone with added Gaussian noise and uniform noise.
PDF

Developing a Neural-Based Credit Evaluation System with Noisy Data (불량 데이타를 포함한 신경망 신용 평가 시스템의 개발)

Kim, Jeong-Won;Choi, Jong-Uk;Choi, Hong-Yun;Chuong, Yoon
- The Transactions of the Korea Information Processing Society
- /
- v.1 no.2
- /
- pp.225-236
- /
- 1994
Many research result conducted by neural network researchers claimed that the degree of generalization of the neural network system is higher or at least equal to that of statistical methods. However, those successful results could be brought only if the neural network was trained by appropriately sound data, having a little of noisy data and being large enough to control noisy data. Real data used in a lot of fields, especially business fields, were not so sound that the network have frequently failed to obtain satisfactory prediction accuracy, the degree of generalization. Enhancing the degree of generalization with noisy data is discussed in this study. The suggestion, which was obtained through a series of experiments, to enhance the degree of generalization is to remove inconsistent data by checking overlapping and inconsistencies. Furthermore, the previous conclusion by other reports is also confirmed that the learning mechanism of neural network takes average value of two inconsistent data included in training set［2］. The interim results of on-going research project are reported in this paper These are ann architecture of the neural network adopted in this project and the whole idea of developing on-line credit evaluation system,being intergration of the expert(resoning)system and the neural network(learning system.Another definite result is corroborated through this study that quickprop,being agopted as a learing algorithm, also has more speedy learning process than does back propagation even in very noisy environment.
PDF

Search Result 1,571, Processing Time 0.027 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)