통합 검색 | Korea Science

파워 스펙트럼 warping을 이용한 성도 정규화 (Vocal Tract Normalization Using The Power Spectrum Warping)

유일수;김동주;노용완;홍광석
- 대한전기학회:학술대회논문집
- /
- 대한전기학회 2003년도 학술회의 논문집 정보 및 제어부문 A
- /
- pp.215-218
- /
- 2003
The method of vocal tract normalization has been known as a successful method for improving the accuracy of speech recognition. A frequency warping procedure based low complexity and maximum likelihood has been generally applied for vocal tract normalization. In this paper, we propose a new power spectrum warping procedure that can be improve on vocal tract normalization performance than a frequency warping procedure. A mechanism for implementing this method can be simply achieved by modifying the power spectrum of filter bank in Mel-frequency cepstrum feature(MFCC) analysis. Experimental study compared our Proposal method with the well-known frequency warping method. The results have shown that the power spectrum warping is better 50% about the recognition performance than the frequency warping.
PDF

화자 정규화를 위한 새로운 파워 스펙트럼 Warping 방법 (A New Power Spectrum Warping Approach to Speaker Warping)

유일수;김동주;노용완;홍광석
- 대한전자공학회논문지SP
- /
- 제41권4호
- /
- pp.103-111
- /
- 2004
화자 정규화 방법은 화자 독립 음성인식 시스템에서 음성 인식의 정확성을 높이기 위한 성공적인 방법으로 알려져 왔다. 널리 사용되는 화자 정규화 방법은 maximum likelihood 반의 주파수 warping 방법이다. 본 논문은 주파수 warping 보다 더 좋은 화자 정규화의 성능 개선을 위해 새로운 파워 스펙트럼 warping 방법을 제안한다. 파워 스펙트럼 warping은 멜 주파수 켑스트럼 분석(MFCC) 방법을 이용하며, MFCC 처리 단계에서 필터 뱅크의 파워 스펙트럼을 조절함으로써 화자 정규화를 수행하는 간단한 메커니즘으로 갖는다. 또한 본 논문은 파워 스펙트럼 warping과 주파수 warping 방법을 서로 결합한 hybrid VTN 방법을 제안한다. 본 논문의 실험은 baseline 시스템에 각 화자 정규화 방법을 적용하여 SKKU PBW DB에서 인식 성능을 비교 분석하였다. 실험 결과를 보면 baseline 시스템의 단어 인식 성능을 기준으로 주파수 warping은 2.06%, 파워 스펙트럼 warping은 3.05%, 그리고 hybrid VTN은 4.07%의 단어 에러 율의 감소를 보였다.
PDF KSCI

TLC/HPTLC에서 측정된 자외/가시부 스펙트럼의 표준화 및 검색 (Normalization and Search of the UV/VIS Spectra Measured from TLC/HPTLC)

강종성
- 약학회지
- /
- 제38권4호
- /
- pp.366-371
- /
- 1994
To improve the identification power of TLC/HPTLC the in situ reflectance spectra obtained directly from plates with commercial scanner are used. The spectrum normalization should be carried out prior to comparing and searching the spectra from library for the identification of compounds. Because the reflectance does not obey the Lambert-Beer's law, there arise some problems in normalization. These problems could be solved to some extent by normalizing the spectra with regression methods. The spectra are manipulated with the regression function of a curve obtained from the correlation plot. When the parabola was used as the manipulating function, the spectra were identified with the accuracy of 97% and this result was better than that of conventionally used the point and area normalization method.
PDF

BOES 관측데이터의 자동처리 프로그램 개발 II (DEVELOPMENT OF AN AUTOMATIC PROCESSING PROGRAM FOR BOES DATA II)

강동일;박홍서;한인우;;이병철;김강민
- 천문학논총
- /
- 제21권2호
- /
- pp.101-112
- /
- 2006
We developed a new program for automatic continuum normalization of Echelle spectrographic data. Using this algorithm, we have determined spectral continuum of almost BOES data. The first advantage of this algorithm is that we can save much time for continuum determination and normalization. The second advantage is that the result of this algorithm is very reliable for almost spectral type of spectrum. But this algorithm cannot be applied directly to the spectrum which has very strong and broad emission lines, for example Wolf-Rayet type spectrum. We implanted this algorithm to the program which was developed in the previous study. And we introduced more upgraded BOES data reduction program. This program has more convenient graphical user interface environment, so users can easily reduce BOES data. Lastly, we presented the result of study on line profile variation of magnetic Ap/Bp stars analyzed using this program.
https://doi.org/10.5303/PKAS.2006.21.2.101 인용 PDF KSCI

선스펙트럼 쌍의 복원에 의한 잡음억제 기법 (Noise Suppression Method for Restoring Line Spectrum Pair)

최재승
- 대한전자공학회논문지SP
- /
- 제47권4호
- /
- pp.112-118
- /
- 2010
본 논문에서는 시간지연신경회로망과 주파수영역의 파라미터를 가지는 선스펙트럼 쌍을 사용하여 정규화 방법에 기초한 잡음억제 시스템을 제안한다. 먼저, 시간지연신경회로망은 선형예측분석하여 구해진 잡음이 부가된 음성신호의 선스펙트럼 쌍을 학습시킨다. 그리고 제안한 시스템은 시간지연신경회로망을 학습시킴으로써 배경잡음에 의하여 열화된 잡음이 부가된 음성신호를 강조한다. 따라서 제안한 시간지연신경회로망은 학습에 의하여 잡음이 부가된 선스펙트럼 쌍의 값을 잡음이 부가되기 전의 값으로 복원하여 잡음을 억제한다. 제안한 시스템은 스펙트럼 왜곡율의 평가법을 사용하여 배경잡음에 의하여 열화된 음성 신호에 대하여 효과적인 것을 확인한다.
PDF KSCI

NORMALIZATION OF THE HAMILTONIAN AND THE ACTION SPECTRUM

OH YONG-GEUN
- 대한수학회지
- /
- 제42권1호
- /
- pp.65-83
- /
- 2005
In this paper, we prove that the two well-known natural normalizations of Hamiltonian functions on the symplectic manifold ($M,\;{\omega}$) canonically relate the action spectra of different normalized Hamiltonians on arbitrary symplectic manifolds ($M,\;{\omega}$). The natural classes of normalized Hamiltonians consist of those whose mean value is zero for the closed manifold, and those which are compactly supported in IntM for the open manifold. We also study the effect of the action spectrum under the ${\pi}_1$ of Hamiltonian diffeomorphism group. This forms a foundational basis for our study of spectral invariants of the Hamiltonian diffeomorphism in [8].
https://doi.org/10.4134/JKMS.2005.42.1.065 인용 PDF KSCI

부분 손상과 기하학적 공격에 강인한 워터마킹 방법 (A Robust Watermarking Method against Partial Damage and Geometric Attack)

김학수
- 한국멀티미디어학회논문지
- /
- 제15권9호
- /
- pp.1102-1111
- /
- 2012
본 논문에서는 워터마크된 영상이 일부 손상되었다 할지라도 기하학적 공격에 강인한 다중 비트 워터마킹 방법을 제안한다. 이 방법은 임의의 영상을 기 정의된 표준 영상으로 변형하는 표준 영상 정규화 과정과 표준 정규화 영상의 DCT 영역에 대역 확산(spread spectrum) 기법을 이용하여 워터마크를 삽입하는 과정으로 구성되어있다. 제안한 표준 영상 정규화 방법은 기존의 영상 정규화 방법을 개선한 것으로써 부분 손상과 임의 기하학적 공격에 강인한 특성을 가지고 있으며, 대역 확산 기법을 이용한 워터마크 삽입과정은 블러링, 샤프닝, 압축 등과 같은 영상 손실에 강인한 특성을 가지고 있다. 또한 제안한 워터마킹 방법은 워터마크 검출을 위해 원영상이 필요하지 않기 때문에 공개 워터마킹(public watermarking) 응용에 적합하다. 몇 가지 실험을 통해 제안한 워터마킹 방법이 부분 손상 및 기하학적 변형을 포함한 여러 가지 공격에 강인하다는 것을 보여준다.
https://doi.org/10.9717/kmms.2012.15.9.1102 인용 PDF KSCI

A Frequency-Domain Normalized MBD Algorithm with Unidirectional Filters for Blind Speech Separation

Kim Hye-Jin;Nam Seung-Hyon
- The Journal of the Acoustical Society of Korea
- /
- 제24권2E호
- /
- pp.54-60
- /
- 2005
A new multichannel blind deconvolution algorithm is proposed for speech mixtures. It employs unidirectional filters and normalization of gradient terms in the frequency domain. The proposed algorithm is shown to be approximately nonholonomic. Thus it provides improved convergence and separation performances without whitening effect for nonstationary sources such as speech and audio signals. Simulations using real world recordings confirm superior performances over existing algorithms and its usefulness for real applications.
PDF KSCI

한국어 단독 숫자음 인식을 위한 DTW 알고리즘의 비교 (Comparison of the Dynamic Time Warping Algorithm for Spoken Korean Isolated Digits Recognition)

홍진우;김순협
- 한국음향학회지
- /
- 제3권1호
- /
- pp.25-35
- /
- 1984
This paper analysis the Dynamic Time Warping algorithms for time normalization of speech pattern and discusses the Dynamic Programming algorithm for spoken Korean isolated digits recognition. In the DP matching, feature vectors of the reference and test pattern are consisted of first three formant frequencies extracted by power spectrum density estimation algorithm of the ARMA model. The major differences in the various DTW algorithms include the global path constrains, the local continuity constraints on the path, and the distance weighting/normalization used to give the overall minimum distance. The performance criterias to evaluate these DP algorithms are memory requirement, speed of implementation, and recognition accuracy.
PDF

Harmonics-based Spectral Subtraction and Feature Vector Normalization for Robust Speech Recognition

Beh, Joung-Hoon;Lee, Heung-Kyu;Kwon, Oh-Il;Ko, Han-Seok
- 음성과학
- /
- 제11권1호
- /
- pp.7-20
- /
- 2004
In this paper, we propose a two-step noise compensation algorithm in feature extraction for achieving robust speech recognition. The proposed method frees us from requiring a priori information on noisy environments and is simple to implement. First, in frequency domain, the Harmonics-based Spectral Subtraction (HSS) is applied so that it reduces the additive background noise and makes the shape of harmonics in speech spectrum more pronounced. We then apply a judiciously weighted variance Feature Vector Normalization (FVN) to compensate for both the channel distortion and additive noise. The weighted variance FVN compensates for the variance mismatch in both the speech and the non-speech regions respectively. Representative performance evaluation using Aurora 2 database shows that the proposed method yields 27.18% relative improvement in accuracy under a multi-noise training task and 57.94% relative improvement under a clean training task.
PDF

검색결과 55건 처리시간 0.034초

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

자세히 찾기

이미지 검색 (β)