통합 검색 | Korea Science

음성의 특정 주파수 범위를 이용한 잡음환경에서의 감정인식 (Noise Robust Emotion Recognition Feature : Frequency Range of Meaningful Signal)

김은호;현경학;곽윤근
- 한국정밀공학회지
- /
- 제23권5호
- /
- pp.68-76
- /
- 2006
The ability to recognize human emotion is one of the hallmarks of human-robot interaction. Hence this paper describes the realization of emotion recognition. For emotion recognition from voice, we propose a new feature called frequency range of meaningful signal. With this feature, we reached average recognition rate of 76% in speaker-dependent. From the experimental results, we confirm the usefulness of the proposed feature. We also define the noise environment and conduct the noise-environment test. In contrast to other features, the proposed feature is robust in a noise-environment.
PDF KSCI

Harmonics-based Spectral Subtraction and Feature Vector Normalization for Robust Speech Recognition

Beh, Joung-Hoon;Lee, Heung-Kyu;Kwon, Oh-Il;Ko, Han-Seok
- 음성과학
- /
- 제11권1호
- /
- pp.7-20
- /
- 2004
In this paper, we propose a two-step noise compensation algorithm in feature extraction for achieving robust speech recognition. The proposed method frees us from requiring a priori information on noisy environments and is simple to implement. First, in frequency domain, the Harmonics-based Spectral Subtraction (HSS) is applied so that it reduces the additive background noise and makes the shape of harmonics in speech spectrum more pronounced. We then apply a judiciously weighted variance Feature Vector Normalization (FVN) to compensate for both the channel distortion and additive noise. The weighted variance FVN compensates for the variance mismatch in both the speech and the non-speech regions respectively. Representative performance evaluation using Aurora 2 database shows that the proposed method yields 27.18% relative improvement in accuracy under a multi-noise training task and 57.94% relative improvement under a clean training task.
PDF

잡음에 강인한 특징점 정합 기법 (Feature Matching Algorithm Robust To Noise)

정현조;유지상
- 한국방송∙미디어공학회:학술대회논문집
- /
- 한국방송공학회 2015년도 하계학술대회
- /
- pp.9-12
- /
- 2015
본 논문에서는 FAST(Features from Accelerated Segment Test) 특징점 검출기와 SURF 특징점 표현자(descriptor)를 수정하고 조합하여 영상의 왜곡에 강인하면서 정합을 수행할 수 있는 새로운 특징점 정합 기법을 제안한다. 스케일 공간을 생성하여 스케일 변화를 고려하고 잡음에 강인하기 위해 영상에서 특징점 후보군을 결정한다. 기존의 FAST는 에지 부분에서 특징점을 많이 검출하게 되는데 이러한 단점을 주곡률(principal curvatures)을 적용하여 개선하고자 한다. 또한 영상의 회전 변화에 강인하기 위해 SURF 특징점 표현자를 사용한다. 제안하는 정합 기법은 적은 계산량으로 기존의 특징점 정합 기법보다 우수한 성능을 나타낸다. 특별히 잡음이 존재하는 영상에서의 정합에 강인함을 보여준다.
PDF

음성구간검출을 위한 비정상성 잡음에 강인한 특징 추출 (Robust Feature Extraction for Voice Activity Detection in Nonstationary Noisy Environments)

홍정표;박상준;정상배;한민수
- 말소리와 음성과학
- /
- 제5권1호
- /
- pp.11-16
- /
- 2013
This paper proposes robust feature extraction for accurate voice activity detection (VAD). VAD is one of the principal modules for speech signal processing such as speech codec, speech enhancement, and speech recognition. Noisy environments contain nonstationary noises causing the accuracy of the VAD to drastically decline because the fluctuation of features in the noise intervals results in increased false alarm rates. In this paper, in order to improve the VAD performance, harmonic-weighted energy is proposed. This feature extraction method focuses on voiced speech intervals and weighted harmonic-to-noise ratios to determine the amount of the harmonicity to frame energy. For performance evaluation, the receiver operating characteristic curves and equal error rate are measured.
https://doi.org/10.13064/KSSS.2013.5.1.011 인용 PDF

On-Line Blind Channel Normalization for Noise-Robust Speech Recognition

Jung, Ho-Young
- IEIE Transactions on Smart Processing and Computing
- /
- 제1권3호
- /
- pp.143-151
- /
- 2012
A new data-driven method for the design of a blind modulation frequency filter that suppresses the slow-varying noise components is proposed. The proposed method is based on the temporal local decorrelation of the feature vector sequence, and is done on an utterance-by-utterance basis. Although the conventional modulation frequency filtering approaches the same form regardless of the task and environment conditions, the proposed method can provide an adaptive modulation frequency filter that outperforms conventional methods for each utterance. In addition, the method ultimately performs channel normalization in a feature domain with applications to log-spectral parameters. The performance was evaluated by speaker-independent isolated-word recognition experiments under additive noise environments. The proposed method achieved outstanding improvement for speech recognition in environments with significant noise and was also effective in a range of feature representations.
PDF

폐질환 진단을 위한 잡음환경에 강건한 폐음 패턴 분류법에 관한 연구 (A Study on Robust Pattern Classification of Lung Sounds for Diagnosis of Pulmonary Dysfunction in Noise Environment)

여송필;전창익;유세근;김덕영;김성환
- 대한전기학회논문지:시스템및제어부문D
- /
- 제51권3호
- /
- pp.122-128
- /
- 2002
In this paper, a robust pattern classification of breath sounds for the diagnosis of pulmonary dysfunction in noise environment is proposed. The feature parameter extraction method by highpass lifter algorithm and PM(projection measure) algorithm are used. 17 different groups of breath sounds are experimentally classified and investigated. The classification has been performed by 6 different types of combinations with proposed methods to evaluate the performances, such as ARC with EDM and LCC with EDM, WLCC with EDM, ARC with PM, LCC with PM, WLCC with PM. Furthermore, all feature parameters are extracted to 80th orders by 5th orders step, and all experiments are evaluated in increasing noise environments by degrees SNR 24dB to 0dB. As a results, WLCC which is derived from highpass lifter algorithm, is selected for the feature parameter extraction method. Pm is more robust than EDM in noisy environments to test and compare experimental results. WLCC with PM method(WLCC/PM) has a better performance in an increasing noise environment for diagnosis of pulmonary dysfunction.
PDF KSCI

Robust Watermarking Scheme Based on Radius Weight Mean and Feature-Embedding Technique

Yang, Ching-Yu
- ETRI Journal
- /
- 제35권3호
- /
- pp.512-522
- /
- 2013
In this paper, the radius weight mean (RWM) and the feature-embedding technique are used to present a novel watermarking scheme for color images. Simulations validate that the stego-images generated by the proposed scheme are robust against most common image-processing operations, such as compression, color quantization, bit truncation, noise addition, cropping, blurring, mosaicking, zigzagging, inversion, (edge) sharpening, and so on. The proposed method possesses outstanding performance in resisting high compression ratio attacks: JPEG2000 and JPEG. Further, to provide extra hiding storage, a steganographic method using the RWM with the least significant bit substitution technique is suggested. Experiment results indicate that the resulting perceived quality is desirable, whereas the peak signal-to-noise ratio is high. The payload generated using the proposed method is also superior to that generated by existing approaches.
https://doi.org/10.4218/etrij.13.0112.0480 인용 PDF KSCI

신뢰성 높은 서브밴드 특징벡터 선택을 이용한 잡음에 강인한 화자검증 (Noise Robust Speaker Verification Using Subband-Based Reliable Feature Selection)

김성탁;지미경;김회린
- 대한음성학회지:말소리
- /
- 제63호
- /
- pp.125-137
- /
- 2007
Recently, many techniques have been proposed to improve the noise robustness for speaker verification. In this paper, we consider the feature recombination technique in multi-band approach. In the conventional feature recombination for speaker verification, to compute the likelihoods of speaker models or universal background model, whole feature components are used. This computation method is not effective in a view point of multi-band approach. To deal with non-effectiveness of the conventional feature recombination technique, we introduce a subband likelihood computation, and propose a modified feature recombination using subband likelihoods. In decision step of speaker verification system in noise environments, a few very low likelihood scores of a speaker model or universal background model cause speaker verification system to make wrong decision. To overcome this problem, a reliable feature selection method is proposed. The low likelihood scores of unreliable feature are substituted by likelihood scores of the adaptive noise model. In here, this adaptive noise model is estimated by maximum a posteriori adaptation technique using noise features directly obtained from noisy test speech. The proposed method using subband-based reliable feature selection obtains better performance than conventional feature recombination system. The error reduction rate is more than 31 % compared with the feature recombination-based speaker verification system.
PDF

Noise-Robust Speaker Recognition Using Subband Likelihoods and Reliable-Feature Selection

Kim, Sung-Tak;Ji, Mi-Kyong;Kim, Hoi-Rin
- ETRI Journal
- /
- 제30권1호
- /
- pp.89-100
- /
- 2008
We consider the feature recombination technique in a multiband approach to speaker identification and verification. To overcome the ineffectiveness of conventional feature recombination in broadband noisy environments, we propose a new subband feature recombination which uses subband likelihoods and a subband reliable-feature selection technique with an adaptive noise model. In the decision step of speaker recognition, a few very low unreliable feature likelihood scores can cause a speaker recognition system to make an incorrect decision. To overcome this problem, reliable-feature selection adjusts the likelihood scores of an unreliable feature by comparison with those of an adaptive noise model, which is estimated by the maximum a posteriori adaptation technique using noise features directly obtained from noisy test speech. To evaluate the effectiveness of the proposed methods in noisy environments, we use the TIMIT database and the NTIMIT database, which is the corresponding telephone version of TIMIT database. The proposed subband feature recombination with subband reliable-feature selection achieves better performance than the conventional feature recombination system with reliable-feature selection.
PDF

강인한 음성인식을 위한 통계적 특징벡터 추출방법의 개선 (An Improvement of Stochastic Feature Extraction for Robust Speech Recognition)

김회린;고진석
- 한국음향학회지
- /
- 제23권2호
- /
- pp.180-186
- /
- 2004
음성 신호에 존재하는 잡음은 음성 인식기의 성능을 현저하게 감소시킨다. 이것은 잡음이 훈련 조건과 인식 조건 사이의 불일치를 가져오기 때문이다. 본 논문에서는 이러한 불일치를 최소화하기 위해서 통계적 특징벡터의 추출방법을 개선하기 위한 방법을 연구하였다. 밴드 SNR에 따라 잡음 스펙트럼의 차감 레벨을 조절하는 기존의 멀티 밴드 잡음 차감법 (MSS)을 개선하기 위하여 잡음 정규화 상수를 이용하여 잡음 스펙트럼의 차감 레벨을 보다 정확하게 조절하는 방법 (M-MSS)을 제시하였다. 다음으로, 기존의 통계적 특징벡터 추출방법 (SFE)에서 잡음 차감법을 파워 스펙트럼 영역에 적용함으로써 성능을 개선하였다(M-SFE). 마지막으로, 위의 두 가지 방법의 장점을 결합하기 위해서 밴드 SNR에 근거한 통계적 특징벡터 추출방법 (MMSS-MSFE)을 제안하였다. 제안된 방법들은 다양한 잡음 환경 하에서 화자독립 고립 단어 인식으로 성능을 평가하였다. 기본적인 잡음 차감법 (SS)에 비하여 M-MSS, M-SFE와 MMSS-MSFE의 평균 에러율은 각각 18.6%, 15.1%와 33.9% 감소하였다. 위의 결과로부터 제안한 방법이 잡음에 강인한 음성인식을 위해 매우 효과적임을 입증하였다.
PDF KSCI

검색결과 155건 처리시간 0.025초

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

자세히 찾기

이미지 검색 (β)