Search | Korea Science

A Study of Cepstrum Normalization Using World Model for Robust Speaker Verification (강인한 화자 확인 시스템을 위한 World 모델을 이용한 켑스트럼 정규화 연구)

Kim Yu-Jin;Chung Jae-Ho
- Proceedings of the Acoustical Society of Korea Conference
- /
- spring
- /
- pp.55-58
- /
- 2000
본 논문에서는 화자 확인 시스템의 등록과 확인 과정의 채널 환경 불일치로 성능이 저하되는 문제를 해결하기 위한 새로운 정규화 방법에 대해 설명한다. 제안된 방법은 첫째, 입력 음성으로부터 효과적으로 채널을 추정$\cdot$보상하고 둘째, 스코어 정규화 과정에서 사칭자 모델로서 사용되는 world모델과의 차이를 채널 추정 및 화자 모델 생성에 효과적으로 사용하는 것을 목표로 한다. 이를 위해 입력 음성의 켑스트럼과 HMM world 모델의 파라메터인 평균 켑스트럼과의 차이를 통해 음소열에 종속적인 채널 켑스트럼인 Phone-Dependent Difference Cepstrum을 추정한다. 한편 입력 음성의 음소열은 world모델의 스코어를 얻는 과정에서 함께 얻어질 수 있다. 채널 추정 실험 결과를 통해서 가장 일반적인 채널 정규화방법인 CMS에 의해 추정된 채널에 비해 실제 채널과 유사하며 화자 고유의 특성을 왜곡시키지 않는 채널 추정이 가능함을 확인할 수 있었다.
PDF

Linear Stability Analysis in a Gas Turbine Combustor Using Thermoacoustic Models (열음향 해석 모델을 통한 가스터빈 연소기에서의 선형 안정성 분석)

Kim, Daesik
- Journal of the Korean Society of Combustion
- /
- v.17 no.2
- /
- pp.17-23
- /
- 2012
In this study, thermoacoustic analysis model was developed in order to predict both eigenfrequencies and initial growth rate of combustion instabilities for lean premixed gas turbine combustors. As a first step, a model combustor and nozzle were selected and analytical linear equations for thermoacoustic waves were derived for a given combustion system. Then, methods showing how the equations can be used for analysis of the combustion instability were suggested. It was found that the prediction results showed a good agreement with the measurements. However, there were some limitation in growth rate predictions, which were related with over-simplification of flame structure, acoustic boundary conditions, and temperature distribution in the combustor.
PDF KSCI

Optimal design and analysis of a Class IV Flextensional Transducer (Class IV Flextensional 트랜스듀서의 최적설계 및 특성해석)

Kang Kuk-jin;Roh Yongrae
- Proceedings of the Acoustical Society of Korea Conference
- /
- autumn
- /
- pp.311-314
- /
- 1999
본 연구에서는 저주파 대역에서 고출력 수중 음향센서로 사용되는 Class IV Flextensional 트랜스듀서의 여러 설계변수들에 따른 음압 변화 및 열 발생 경향성을 유한요소 해석법으로 해석하였다. 나아가 해석되어진 결과를 바탕으로 최대 음압을 구현하고, 열 발생이 최소인 중심 주파수 1 kHz를 가지는 Class W Flextensional 트랜스듀서의 최적구조를 설정하였다. 본 연구에서 설정한 최적구조는 기본모델에 비해 음압이 2배 이상 크고, 열 발생은 아주 작은 것으로 나타났다. 본 연구의 결과는 향후 다양한 중심 주파수 및 최대 음압을 구현하고 열 발생이 최소인 Class W Flextensional 트랜스듀서를 설계함에 있어 유용한 자료로 활용될 수 있을 것이다.
PDF

Thermoacoustic Analysis Model for Combustion Instability Prediction - Part 2 : Nonlinear Instability Analysis (연소 불안정 예측을 위한 열음향 해석 모델 - Part 2 : 비선형 안정성 해석)

Kim, Daesik;Kim, Kyu Tae
- Journal of the Korean Society of Propulsion Engineers
- /
- v.16 no.6
- /
- pp.41-47
- /
- 2012
It is very important to predict the nonlinear behavior of combustion instability such as transition phenomena and limit cycle amplitude for fully understanding and controlling the instabilities. These nonlinear instability characteristics are highly dependent upon the flames' nonlinear dynamics in a gas turbine premixed combustor. In this study, nonlinear instability TA(Thermo-acoustic) models were introduced by applying the concept of flame describing function to the thermoacoustic analysis method. As a result of model development, for a given combustor length, the growth rate of instability was greatly affected by the change in amplitude, although the instability frequency was not. Further researches under various operating conditions and model validation on limit cycle amplitude are required.
https://doi.org/10.6108/KSPE.2012.16.6.041 인용 PDF KSCI

Definition and Evaluation of Korean Phone-Like Units using Hidden Markov Network (HM-Net을 이용한 한국어 유사음소 단위의 재 정의와 평가)

Lim Young-Chun;Oh Se-Jin;Jung Ho-Youl;Chung Hyun-Yeol
- Proceedings of the Acoustical Society of Korea Conference
- /
- spring
- /
- pp.183-186
- /
- 2002
최근 음성인식의 인식 단위로서 문맥의존 음향 모델이 널리 사용되고 있다. 이는 음소의 음향학적 특징, 즉 선행 및 후행음소에 의한 중심 음소의 변이음 모델이 문맥독립 모델보다 좀 더 정확하게 모델링 될 수 있기 때문이다. 하지만 강건한 문맥의존 음향 모델을 작성하기 위해서는 모델 파라미터의 병합(tying)과 미지의 문맥(unseen context)의 처리를 위한 좀더 정교한 해결 방법이 필요하다. 따라서 본 논문에서는 이점을 고려하여 음향학적 특징과 언어학적 특징을 결합하여 상태 분할을 수행할 수 있도록 SSS(Successive State Splitting) 알고리즘의 문맥 방향 상태 분할에 음소결정트리를 접목한 HM-Net(Hidden Markov Network) 구조 결정법을 도입하였다. 또한 HM-Net은 연속적인 상태 분할에 의해 한국어에서 많이 발생하는 변이음들을 효과적으로 모델링 할 수 있다는 점을 고려하여 본 연구실에서 기존에 사용하던 48 유사음소 단위에서 문맥의존 음향 모델 작성에 불필요한 변이음을 제거하여 39 유사음소 단위를 재 정의하였다. 도입한 방법과 새로 정의한 유사음소 단위의 유효성을 확인하기 위해 고립 단어, 4연속 숫자음, 연속 음성인식에 대해 인식 실험을 수행한 결과, 모든 실험에서 재 정의한 39 유사음소 단위가 문맥종속형 HM-Net 음향모델을 이용한 한국어 음성인식에 효과적임을 확인할 수 있었다. 특히 연속 음성인식 실험의 경우, 기존의 48 유사음소 단위보다 평균 $15.08\%$의 인식률 향상이 있었다.
PDF

Two-dimensional Localization of Array Elements Placed on a Sea Floor Using M-sequence Signal in Multipath Ocean Environment (M-계열 송신 신호를 이용한 다중 경로 해양 환경에서의 해저면 설치 선배열 센서의 2차원 위치 추정)

오택환;나정열;석동우
- The Journal of the Acoustical Society of Korea
- /
- v.21 no.8
- /
- pp.686-694
- /
- 2002
This paper proposes an algorithm for estimating positions of array elements placed on a sea floor using acoustic signal in multipath ocean environment. The positions of array elements are estimated by using the travel times of m-sequence signal influenced by the multi-paths environment. The horizontal distance between source and receiver calculated based on the ray model. The proposed paper the algorithm is verified by both simulation data and field experiment in the Bast Sea.
PDF KSCI

Improving transformer-based acoustic model performance using sequence discriminative training (Sequence dicriminative training 기법을 사용한 트랜스포머 기반 음향 모델 성능 향상)

Lee, Chae-Won;Chang, Joon-Hyuk
- The Journal of the Acoustical Society of Korea
- /
- v.41 no.3
- /
- pp.335-341
- /
- 2022
In this paper, we adopt a transformer that shows remarkable performance in natural language processing as an acoustic model of hybrid speech recognition. The transformer acoustic model uses attention structures to process sequential data and shows high performance with low computational cost. This paper proposes a method to improve the performance of transformer AM by applying each of the four algorithms of sequence discriminative training, a weighted finite-state transducer (wFST)-based learning used in the existing DNN-HMM model. In addition, compared to the Cross Entropy (CE) learning method, sequence discriminative method shows 5 % of the relative Word Error Rate (WER).
https://doi.org/10.7776/ASK.2022.41.3.335 인용 PDF KSCI

A Study On Continuous Digits Recognition Using the Neural Network (신경망을 이용한 연속 숫자음 인식에 관한 연구)

이성권;김순협
- The Journal of the Acoustical Society of Korea
- /
- v.17 no.4
- /
- pp.3-13
- /
- 1998
본 논문은 음성 다이어링 시스템을 구현하기 위한 한국어 단독 숫자음 및 연속 숫 자음 인식에 관한 것이다. 단독 숫자음의 인식은 미지의 입력 음성을 재귀 신경망을 이용하 여 모델링된 각 모델에 인가하고, 신경 회로망의 출력 노드의 상태열을 검사하여 적절한 상 태 전이를 하며 최고의 확률값을 출력하는 모델을 인식된 결과로 출력한다. 연속 숫자음의 인식은 미지의 연속 숫자음을 재귀 신경 회로망을 이용한 연속 숫자음 모델에 입력하고, 신 경 회로망의 출력에 대하여 적절한 상태 전이에 대한 검사와 레벨 빌딩(Level Building)을 수행하여 최소의 오차를 가지는 모델열을 인식된 결과로 출력한다. 재귀 신경 회로망을 이 용하여 음절 모델을 만드는 과정에서 재귀 노드는 예상치가 주어지지 않으므로 신경 회로망 의 학습에서 제외되어 현저한 학습 속도의 저하를 가져온다. 따라서 본 논문에서는 재귀 신 경 회로망의 학습 속도를 향상시키기 위한 2가지 방법을 제안 한다. 첫 번째는 재귀 신경 회로망의 재귀 노드의 예상치를 실험적으로 주어줌으로써 학습 속도의 향상을 도모하였다. 두 번째는 음절 모델의 출력노드의 개수와 음절 모델의 세그먼트 경계를 알고리듬을 이용하 여 자동적으로 조절하였다. 실험결과, 단독어의 경우 음절 '에'에 포함하는 한국어 11개의 숫 자음에 대하여 화자 종속의 경우 97.3%, 화자 독립의 경우 80.5%의 인식률을 얻었으며, 연 속 숫자음의 경우는 21종류의 연속 숫자음에 대하여 화자 종속에서 88.2%, 화자 독립의 경 우 81.3%의 인식률을 얻을 수 있었다.
PDF

The Application of an HMM-based Clustering Method to Speaker Independent Word Recognition (HMM을 기본으로한 집단화 방법의 불특정화자 단어 인식에 응용)

Lim, H.;Park, S.-Y.;Park, M.-W.
- The Journal of the Acoustical Society of Korea
- /
- v.14 no.5
- /
- pp.5-10
- /
- 1995
In this paper we present a clustering procedure based on the use of HMM in order to get multiple statistical models which can well absorb the variants of each speaker with different ways of saying words. The HMM-clustered models obtained from the developed technique are applied to the speaker independent isolated word recognition. The HMM clustering method splits off all observation sequences with poor likelihood scores which fall below threshold from the training set and create a new model out of the observation sequences in the new cluster. Clustering is iterated by classifying each observation sequence as belonging to the cluster whose model has the maximum likelihood score. If any clutter has changed from the previous iteration the model in that cluster is reestimated by using the Baum-Welch reestimation procedure. Therefore, this method is more efficient than the conventional template-based clustering technique due to the integration capability of the clustering procedure and the parameter estimation. Experimental data show that the HMM-based clustering procedure leads to $1.43\%$ performance improvements over the conventional template-based clustering method and $2.08\%$ improvements over the single HMM method for the case of recognition of the isolated korean digits.
PDF

Acoustic Field Analysis of a Combustor-nozzle System with a Premixing Chamber (예혼합실을 갖는 연소-노즐 시스템의 음향장 해석)

Yoon, Myunggon;Kim, Jina;Kim, Daesik
- Journal of the Korean Society of Propulsion Engineers
- /
- v.21 no.5
- /
- pp.46-53
- /
- 2017
This paper deals with an acoustic model for a lean premixed gas turbine combustor composed of three stages: premixing chamber, nozzle and flame tube. Our model is given as an acoustic transfer function whose input is a heat release rate perturbation and output is a velocity perturbation at a flame location. We have shown that the resonance frequencies are functions of three round-trip frequencies of acoustic wave in each stage, and area ratios between stages. By analyzing poles of the acoustic transfer function, we could characterize resonant frequencies and their dependency on various system parameters of a combustor. It was found that our analytic findings match with existing numerical and experimental results in literature.
https://doi.org/10.6108/KSPE.2017.21.5.046 인용 PDF KSCI

Search Result 110, Processing Time 0.02 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)