Search | Korea Science

Semi-Continuous Hidden Markov Model with the MIN Module (MIN 모듈을 갖는 준연속 Hidden Markov Model)

Kim, Dae-Keuk;Lee, Jeong-Ju;Jeong, Ho-Kyoun;Lee, Sang-Hee
- Speech Sciences
- /
- v.7 no.4
- /
- pp.11-26
- /
- 2000
In this paper, we propose the HMM with the MIN module. Because initial and re-estimated variance vectors are important elements for performance in HMM recognition systems, we propose a method which compensates for the mismatched statistical feature of training and test data. The MIN module function is a differentiable function similar to the sigmoid function. Unlike a continuous density function, it does not include variance vectors of the data set. The proposed hybrid HMM/MIN module is a unified network in which the observation probability in the HMM is replaced by the MIN module neural network. The parameters in the unified network are re-estimated by the gradient descent method for the Maximum Likelihood (ML) criterion. In estimating parameters, the variance vector is not estimated because there is no variance element in the MIN module function. The experiment was performed to compare the performance of the proposed HMM and the conventional HMM. The experiment measured an isolated number for speaker independent recognition.
PDF

A Comparative Study of Speaker Adaptation Methods for HMM-Based Speech Recognition (HMM 음성인식 시스템을 위한 화자적응 방법들의 성능비교)

Koo, Myoung-Wan;Un, Chong-Kwan;Lee, Hwang-Soo
- The Journal of the Acoustical Society of Korea
- /
- v.10 no.3
- /
- pp.37-43
- /
- 1991
In this paper, we compare the performances of speaker adaptation which consist of two stages of processing for an HMM-based speech recognition system. We compare three kinds of VQ adaptation methods which may be used in the first stage to reduce the distortion error for a new speaker : label prototype adaptation, adaptation with a codebook from adaptation speech itself, and adaptation with a mapped codebook. We then compare the performance of four kinds of HMM parameter adaptation methods which may be used in the second stage to transform HMM parameters for a new speaker : adaptation by the Viterbi algorithm, that by the DTW algorithm, that by the iterative alignment algorithm. The results show that adaptation based on the fuzzy histogram algorithm yields the highest accuracy in an HMM-based speech recognition system.
PDF

A Study on the Korean Syllable As Recognition Unit (인식 단위로서의 한국어 음절에 대한 연구)

Kim, Yu-Jin;Kim, Hoi-Rin;Chung, Jae-Ho
- The Journal of the Acoustical Society of Korea
- /
- v.16 no.3
- /
- pp.64-72
- /
- 1997
In this paper, study and experiments are performed for finding recognition unit fit which can be used in large vocabulary recognition system. Specifically, a phoneme that is currently used as recognition unit and a syllable in which Korean is well characterized are selected. From comparisons of recognition experiments, the study is performed whether a syllable can be considered as recognition unit of Korean recognition system. For report of an objective result of the comparison experiment, we collected speech data of a male speaker and processed them by hand-segmentation for phoneme boundary and labeling to construct speech database. And for training and recognition based on HMM, we used HTK (HMM Tool Kit) 2.0 of commercial tool from Entropic Co. to experiment in same condition. We applied two HMM model topologies, 3 emitting state of 5 state and 6 emitting state of 8 state, in Continuous HMM on training of each recognition unit. We also used 3 sets of PBW (Phonetically Balanced Words) and 1 set of POW(Phonetically Optimized Words) for training and another 1 set of PBW for recognition, that is "Speaker Dependent Medium Vocabulary Size Recognition." Experiments result reports that recognition rate is 95.65% in phoneme unit, 94.41% in syllable unit and decoding time of recognition in syllable unit is faster by 25% than in phoneme.
PDF

Word Verification using Similar Word Information and State-Weights of HMM using Genetic Algorithmin (유사단어 정보와 유전자 알고리듬을 이용한 HMM의 상태하중값을 사용한 단어의 검증)

Kim, Gwang-Tae;Baek, Chang-Heum;Hong, Jae-Geun
- Journal of the Institute of Electronics Engineers of Korea SP
- /
- v.38 no.1
- /
- pp.97-103
- /
- 2001
Hidden Markov Model (HMM) is the most widely used method in speech recognition. In general, HMM parameters are trained to have maximum likelihood (ML) for training data. Although the ML method has good performance, it dose not take account into discrimination to other words. To complement this problem, a word verification method by re-recognition of the recognized word and its similar word using the discriminative function of the two words. To find the similar word, the probability of other words to the HMM is calculated and the word showing the highest probability is selected as the similar word of the mode. To achieve discrimination to each word the weight to each state is appended to the HMM parameter. The weight is calculated by genetic algorithm. The verificator complemented discrimination of each word and reduced the error occurred by similar word. As a result of verification the total error is reduced by about 22%
PDF

Context Recognition Using Environmental Sound for Client Monitoring System (피보호자 모니터링 시스템을 위한 환경음 기반 상황 인식)

Ji, Seung-Eun;Jo, Jun-Yeong;Lee, Chung-Keun;Oh, Siwon;Kim, Wooil
- Journal of the Korea Institute of Information and Communication Engineering
- /
- v.19 no.2
- /
- pp.343-350
- /
- 2015
This paper presents a context recognition method using environmental sound signals, which is applied to a mobile-based client monitoring system. Seven acoustic contexts are defined and the corresponding environmental sound signals are obtained for the experiments. To evaluate the performance of the context recognition, MFCC and LPCC method are employed as feature extraction, and statistical pattern recognition method are used employing GMM and HMM as acoustic models, The experimental results show that LPCC and HMM are more effective at improving context recognition accuracy compared to MFCC and GMM respectively. The recognition system using LPCC and HMM obtains 96.03% in recognition accuracy. These results demonstrate that LPCC is effective to represent environmental sounds which contain more various frequency components compared to human speech. They also prove that HMM is more effective to model the time-varying environmental sounds compared to GMM.
https://doi.org/10.6109/jkiice.2015.19.2.343 인용 PDF KSCI KPUBS HTML

A Face Recognition using the Hidden Markov Model and Karhuman Loevs Transform (Hidden Markov Model과 Karhuman Loevs Transform를 이용한 얼굴인식)

Kim, Do-Hyun;Hwang, Suen-Ki;Kang, Yong-Seok;Kim, Tae-Woo;Kim, Moon-Hwan;Bae, Cheol-Soo
- The Journal of Korea Institute of Information, Electronics, and Communication Technology
- /
- v.4 no.1
- /
- pp.3-8
- /
- 2011
The work presented in this paper describes a Hidden Markov Model(HMM)-based framework for face recognition and face detection. The observation vectors used to characterize the statics of the HMM are obtained using the coefficients of the Karhuman-Loves Transform(KLT). The face recognition method presented in this paper reduces significantly the computational complexity of previous HMM-based face recognition systems, while slightly improving the recognition rate. In addition, the suggested method is more effective than the exiting ones in face extraction in terms of accuracy and others even under complex changes to the surroundings such as lighting.
https://doi.org/10.17661/jkiiect.2011.4.1.003 인용 PDF

HSA-based HMM Optimization Method for Analyzing EEG Pattern of Motor Imagery (운동심상 EEG 패턴분석을 위한 HSA 기반의 HMM 최적화 방법)

Ko, Kwang-Eun;Sim, Kwee-Bo
- Journal of Institute of Control, Robotics and Systems
- /
- v.17 no.8
- /
- pp.747-752
- /
- 2011
HMMs (Hidden Markov Models) are widely used for biological signal, such as EEG (electroencephalogram) sequence, analysis because of their ability to incorporate sequential information in their structure. A recent trends of research are going after the biological interpretable HMMs, and we need to control the complexity of the HMM so that it has good generalization performance. So, an automatic means of optimizing the structure of HMMs would be highly desirable. In this paper, we described a procedure of classification of motor imagery EEG signals using HMM. The motor imagery related EEG signals recorded from subjects performing left, right hand and foots motor imagery. And the proposed a method that was focus on the validation of the HSA (Harmony Search Algorithm) based optimization for HMM. Harmony search algorithm is sufficiently adaptable to allow incorporation of other techniques. A HMM training strategy using HSA is proposed, and it is tested on finding optimized structure for the pattern recognition of EEG sequence. The proposed HSA-HMM can performs global searching without initial parameter setting, local optima, and solution divergence.
https://doi.org/10.5302/J.ICROS.2011.17.8.747 인용 PDF KSCI

Recognition method of stripe waves projected to bodies using HMM (인체에 투사된 스트라이프 파형의 HMM을 이용한 인식방안)

Seok Hyun-tack;Kwak Kyung-sup
- Journal of the Institute of Electronics Engineers of Korea CI
- /
- v.42 no.1
- /
- pp.51-58
- /
- 2005
we can set laser patterns with 3D information from vision camera after projected to object with laser stripes. They are very useful for 3-Dimensional informations. We researched the laser patterns of human body projected by stripes and found out three featuring patterns and made database of patterns using Fourier descriptors to recognize the patterns of bodies. The HMM method and Fourier descriptors to recognize human body were experimented. We found out HMM method can recognize human body in more efficient rate than the other.
PDF KSCI

Implementation of a Robust Speaker Recognition System in Noisy Environment Using AR HMM with Duration-term (지속시간항을 갖는 AR HMM을 이용한 잡음환경에서의 강인 화자인식 시스템 구현)

이기용;임재열
- The Journal of the Acoustical Society of Korea
- /
- v.20 no.6
- /
- pp.26-33
- /
- 2001
Though speaker recognition based on conventional AR HMM shows good performance, its lack of modeling the environmental noise makes its performance degraded in case of practical noisy environment. In this paper, a robust speaker recognition system based on AR HMM is proposed, where noise is considered in the observation signal model for practical noisy environment and duration-term is considered to increase performance. Experimental results, using the digits database from 100 speakers (77 males and 23 females) under white noise and car noise, show improved performance.
PDF

Face Recognition Method Robust to Change in Lighting Condition (조명의 변화에 강건한 얼굴인식)

Nam, Kee-Hwan;Han, Jun-Hee;Park, Ho-Sik;Lee, Young-Sik;Jung, Yen-Gil;Ra, Sang-Dong;Bae, Cheol-Soo
- Proceedings of the Korean Institute of Information and Commucation Sciences Conference
- /
- v.9 no.1
- /
- pp.1137-1140
- /
- 2005
The work presented in this paper describes a Hidden Markov Model(HMM)-based framework for face recognition and face detection. The observation vectors used to characterize the statics of the HMM are obtained using the coefficients of the Karhuman-Loves Transform(KLT). The face recognition method presented in this paper reduces significantly the computational complexity of previous HMM-based face recognition systems, while slightly improving the recognition rate. In addition, the suggested method is more effective than the exiting ones in face extraction in terms of accuracy and others even under complex changes to the surroundings such as lighting.
PDF

Search Result 963, Processing Time 0.026 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)