Search | Korea Science

The Improvement Performance of Speaker Verification System Through the Multi-Vector Quantization Codebook Structure (멀티 VQ 코드북을 이용한 화자확인 시스템의 성능개선)

Lee, Jae-Hee;Lee, Sang-Cheol;Jung, Yeon-Hai
- Proceedings of the KIEE Conference
- /
- 2005.10a
- /
- pp.176-179
- /
- 2005
In this paper, we propose the new method that separate the existing common VQ code book into two parts, one is the common VQ code book which is the half of existing common VQ code book, another is the personal speaker VQ code book which accommodate the personal speaker characteristic, variation to improve the performance of the text-dependent speaker verification system using discrete HMM. We apply the propose method m this paper to the text-dependent speaker verification system using discrete HMM and have the improvement performance of about 0.24% compared to existing method
PDF

HMM-based Speech Recognition using DMS Model and Double Spectral Feature (DMS 모델과 이중 스펙트럼 특징을 이용한 HMM에 의한 음성 인식)

Ann Tae-Ock
- Journal of the Korea Academia-Industrial cooperation Society
- /
- v.7 no.4
- /
- pp.649-655
- /
- 2006
This paper proposes a HMM-based recognition method using DMSVQ(Dynamic Multi-Section Vector Quantization) codebook by DMS model and double spectral feature, as a method on the speech recognition of speaker-independent. LPC cepstrum parameter is used as a instantaneous spectral feature and LPC cepstrum's regression coefficient is used as a dynamic spectral feature These two spectral features are quantized as each VQ codebook. HMM using DMS model is modeled by receiving instantaneous spectral feature and dynamic spectral feature by input. Other experiments to compare with the results of recognition experiments using proposed method are implemented by the various conventional recognition methods under the equivalent environment of data and conditions. Through the experiment results, it is proved that the proposed method in this paper is superior to the conventional recognition methods.
PDF

Face Recognition Using Wavelet Coefficients and Hidden Markov Model (웨이블렛 계수와 Hidden Markov Model을 이용한 얼굴인식 기법)

Lee, Kyung-Ah;Lee, Dae-Jong;Park, Jang-Hwan;Chun, Myung-Geun
- Journal of the Korean Institute of Intelligent Systems
- /
- v.13 no.6
- /
- pp.673-678
- /
- 2003
In this paper, we proposes a method for face recognition using HMM(hidden Markov model) and wavelet coefficients First, input images are compressed by using the multi-resolution analysis based on the discrete wavelet transform. And then, the wavelet coefficients obtained from each subband are used as feature vectors to construct the HMMs. In the recognition stage, we obtained higher recognition rate by summing of each recognition rate of wavelet subband. The usefulness of the proposed method was shown by comparing with conventional VQ and DCT-HMM ones. The experimental results show that the proposed method is more satisfactory than previous ones.
https://doi.org/10.5391/JKIIS.2003.13.6.673 인용 PDF KSCI

A Comparison of Discrete and Continuous Hidden Markov Models for Korean Digit Recognition (한국어 숫자음 인식을 위한 이산분포 HMM과 연속분포 HMM의 성능 비교 연구)

홍형진
- Proceedings of the Acoustical Society of Korea Conference
- /
- 1994.06c
- /
- pp.157-160
- /
- 1994
본 논문에서는 한국어 숫자음 인식에 대한 이산분포 HMM과 연속분포 HMM의 인식 성능을 비교하였다. 일반적으로 연속분포 HMM은 많은 계산량이 필요하고, 학습시 초기값이 매우 민감하다는 단점이 있지만, 이산분포 HMM의 VQ로 인한 왜곡을 제거함으로써 인식률을 향상시킬 수 있다. 여기서는 성능비교를 위해서 mel-cepstrum의 분석차수, 이산분포 HMM의 codebook 크기, 연속분포 HMM의 miture 개수등에 따른 인식성능을 비교하였다. 실험 결과 이산분포 HMM에서는 mel-cepstrum 벡터가 14차이고, codebook 크기가 64일 때 가장 좋은 성능을 나타냈으며, 연속부포 HMM에서는 mel-cepstrum 벡터가 16차이고 miture가 3개일 때 가장 좋은 결과를 얻을 수 있었다. 특히 학습 데이터의 양이 적은 경우에는 연속분포 HMM이 이산분포 HMM보다 더 좋은 인식률을 나타내었다.
PDF

A Comparative Study of Speaker Adaptation Methods for HMM-Based Speech Recognition (HMM 음성인식 시스템을 위한 화자적응 방법들의 성능비교)

Koo, Myoung-Wan;Un, Chong-Kwan;Lee, Hwang-Soo
- The Journal of the Acoustical Society of Korea
- /
- v.10 no.3
- /
- pp.37-43
- /
- 1991
In this paper, we compare the performances of speaker adaptation which consist of two stages of processing for an HMM-based speech recognition system. We compare three kinds of VQ adaptation methods which may be used in the first stage to reduce the distortion error for a new speaker : label prototype adaptation, adaptation with a codebook from adaptation speech itself, and adaptation with a mapped codebook. We then compare the performance of four kinds of HMM parameter adaptation methods which may be used in the second stage to transform HMM parameters for a new speaker : adaptation by the Viterbi algorithm, that by the DTW algorithm, that by the iterative alignment algorithm. The results show that adaptation based on the fuzzy histogram algorithm yields the highest accuracy in an HMM-based speech recognition system.
PDF

The Method for Face Recognition using Wavelet Coefficients and Hidden Markov Model (웨이블렛 계수와 Hidden Markov Model를 이용한 얼굴인식 기법)

이경아;이대종;박장환;전명근
- Proceedings of the Korean Institute of Intelligent Systems Conference
- /
- 2003.09b
- /
- pp.162-165
- /
- 2003
본 논문에서는 웨이블렛 계수와 Hidden Markov Model(HMM)이용한 얼굴인식 알고리즘을 제안한다. 입력 영상은 이산웨이블렛을 기반으로 한 다해상도 분석기법을 사용하여 데이터 수를 압축한 후, 각각의 해상도에서 얻어진 웨이블렛 계수를 특징벡터로 사용하여 HMM의 모델을 생성한다. 인식단계 에서는 웨이블렛 변환에 의해 생성된 개별대역의 인식값을 더하여 상호 보완함으로써 인식률을 높일 수 있었다. 제안된 알고리즘의 타당성을 검증하기 위하여 기본적 알고리즘인 벡터 양자화(VQ) 기법을 적용한 경우와 기존 얼굴인식에 제안된 DCT-HMM을 이용한 기법과의 인식률 비교를 한 결과, 제안된 방법이 우수한 성능을 보임을 알 수 있었다.
PDF

Speech Recognition Using HMM Based on Fuzzy (피지에 기초를 둔 HMM을 이용한 음성 인식)

안태옥;김순협
- Journal of the Korean Institute of Telematics and Electronics B
- /
- v.28B no.12
- /
- pp.68-74
- /
- 1991
This paper proposes a HMM model based on fuzzy, as a method on the speech recognition of speaker-independent. In this recognition method, multi-observation sequences which give proper probabilities by fuzzy rule according to order of short distance from VQ codebook are obtained. Thereafter, the HMM model using this multi-observation sequences is generated, and in case of recognition, a word that has the most highest probability is selected as a recognized word. The vocabularies for recognition experiment are 146 DDD are names, and the feature parameter is 10S0thT LPC cepstrum coefficients. Besides the speech recognition experiments of proposed model, for comparison with it, we perform the experiments by DP, MSVQ and general HMM under same condition and data. Through the experiment results, it is proved that HMM model using fuzzy proposed in this paper is superior to DP method, MSVQ and general HMM model in recognition rate and computational time.
PDF

A study on the speech feature extraction based on the hearing model (청각 모델에 기초한 음성 특징 추출에 관한 연구)

김바울;윤석현;홍광석;박병철
- Journal of the Korean Institute of Telematics and Electronics B
- /
- v.33B no.4
- /
- pp.131-140
- /
- 1996
In this paper, we propose the method that extracts the speech feature using the hearing model through signal precessing techniques. The proposed method includes following procedure ; normalization of the short-time speech block by its maximum value, multi-resolution analysis using the discrete wavelet transformation and re-synthesize using thediscrete inverse wavelet transformation, differentiation after analysis and synthesis, full wave rectification and integration. In order to verify the performance of the proposed speech feature in the speech recognition task, korean digita recognition experiments were carried out using both the dTW and the VQ-HMM. The results showed that, in case of using dTW, the recognition rates were 99.79% and 90.33% for speaker-dependent and speaker-independent task respectively and, in case of using VQ-HMM, the rate were 96.5% and 81.5% respectively. And it indicates that the proposed speech feature has the potentials to use as a simple and efficient feature for recognition task.
PDF

Speech Feature Extraction Based on the Human Hearing Model

Chung, Kwang-Woo;Kim, Paul;Hong, Kwang-Seok
- Proceedings of the KSPS conference
- /
- 1996.10a
- /
- pp.435-447
- /
- 1996
In this paper, we propose the method that extracts the speech feature using the hearing model through signal processing techniques. The proposed method includes the following procedure ; normalization of the short-time speech block by its maximum value, multi-resolution analysis using the discrete wavelet transformation and re-synthesize using the discrete inverse wavelet transformation, differentiation after analysis and synthesis, full wave rectification and integration. In order to verify the performance of the proposed speech feature in the speech recognition task, korean digit recognition experiments were carried out using both the DTW and the VQ-HMM. The results showed that, in the case of using DTW, the recognition rates were 99.79% and 90.33% for speaker-dependent and speaker-independent task respectively and, in the case of using VQ-HMM, the rate were 96.5% and 81.5% respectively. And it indicates that the proposed speech feature has the potential for use as a simple and efficient feature for recognition task
PDF

Korean Speech Recognition using DHMM (DHMM을 이용한 한국어 음성 인식)

Ann, T.O.;Lee, K.S.;Yoo, H.K.;Lee, H.J.;Cho, H.J.;Byun, Y.G.;Kim, S.H.
- The Journal of the Acoustical Society of Korea
- /
- v.10 no.1
- /
- pp.52-60
- /
- 1991
This paper describes the study on isolated word recognition by using DHMM(Dynamic Hidden Markov Model) which has dynamic feature of spectrum as a parameter. This paper discusses speech recognition experiment basedon HMM which can evaluate not only instantaneous spectral features but also dynamic spectral features. LPC cepstrum parameters is used as a static feature and LPC cepstrum's regression coefficient is used as a dynamic feature. These two features are quantized by each VQ codebook. DHMM is modeled by receiving static vector and dynamic vector by input. In the whole experiment, as recognition experiment using DHMM shows 92.7% of recognition rate while the experiment using conventional HMM shows 88.8% of recognition rate, DHMM proved to be a useful model.
PDF

Search Result 34, Processing Time 0.025 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)