• Title/Summary/Keyword: Hidden Markov Network

Search Result 111, Processing Time 0.039 seconds

A Soccer Video Analysis Using Product Hierarchical Hidden Markov Model (PHHMM(Product Hierarchical Hidden Markov Model)을 이용한 축구 비디오 분석)

  • Kim, Moo-Sung;Kang, Hang-Bong
    • Proceedings of the IEEK Conference
    • /
    • 2006.06a
    • /
    • pp.681-682
    • /
    • 2006
  • 일반적으로 축구 비디오 데이터는 멀티모달과 멀티레이어 속성을 지닌다. 이러한 데이터를 다루기 적합한 모델은 동적 베이지안 네트워크(Dynamic Bayesian Network: DBN) 형태의 위계적 은닉 마르코프 모델(Hierarchical Hidden Markov Model: HHMM)이다. 이러한 HHMM 중 다중속성의 특징들이 서로 상호작용하는 PHHMM(Product Hierarchical Hidden Markov Model)이 있다. 본 논문에서는 PHHMM 을 축구 경기의 Play/Break 이벤트 검색 및 분석에 적용하였고 바람직한 결과를 얻었다.

  • PDF

Isolated-Word Recognition Using Neural Network and Hidden Markov Model (Neural-HMM을 이용한 고립단어 인식)

  • 김연수;김창석
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.17 no.11
    • /
    • pp.1199-1205
    • /
    • 1992
  • In this paper, a Korean word recognition method which usese Neural Network and Hidden Markov Models(HMM) is proposed to improve a recognition rate with a small amount of learning data. The method reduces the fluctuation due to personal differences which is a problem to a HMM recognition system. In this method, effective recognizer is designed by the complement of each recognition result of the Hidden Markov Models(HMM) and Neural Network. In order to evaluate this model, word recognition experiment is carried out for 28 cities which is DDD area names uttered by two male and a female in twenties. As a result of testing HMM with 8 state, codeword is 64, the recognition rate 91[%], as a result of testing Neural network(NN) with 64 codeword the recognition rate is 89[%]. Finally, as a result of testing NN-HMM with 64 codeword which the best condition in former tests, the recognition rate is 95[%].

  • PDF

A Study on Performance Evaluation of Hidden Markov Network Speech Recognition System (Hidden Markov Network 음성인식 시스템의 성능평가에 관한 연구)

  • 오세진;김광동;노덕규;위석오;송민규;정현열
    • Journal of the Institute of Convergence Signal Processing
    • /
    • v.4 no.4
    • /
    • pp.30-39
    • /
    • 2003
  • In this paper, we carried out the performance evaluation of HM-Net(Hidden Markov Network) speech recognition system for Korean speech databases. We adopted to construct acoustic models using the HM-Nets modified by HMMs(Hidden Markov Models), which are widely used as the statistical modeling methods. HM-Nets are carried out the state splitting for contextual and temporal domain by PDT-SSS(Phonetic Decision Tree-based Successive State Splitting) algorithm, which is modified the original SSS algorithm. Especially it adopted the phonetic decision tree to effectively express the context information not appear in training speech data on contextual domain state splitting. In case of temporal domain state splitting, to effectively represent information of each phoneme maintenance in the state splitting is carried out, and then the optimal model network of triphone types are constructed by in the parameter. Speech recognition was performed using the one-pass Viterbi beam search algorithm with phone-pair/word-pair grammar for phoneme/word recognition, respectively and using the multi-pass search algorithm with n-gram language models for sentence recognition. The tree-structured lexicon was used in order to decrease the number of nodes by sharing the same prefixes among words. In this paper, the performance evaluation of HM-Net speech recognition system is carried out for various recognition conditions. Through the experiments, we verified that it has very superior recognition performance compared with the previous introduced recognition system.

  • PDF

Semi-Continuous Hidden Markov Model with the MIN Module (MIN 모듈을 갖는 준연속 Hidden Markov Model)

  • Kim, Dae-Keuk;Lee, Jeong-Ju;Jeong, Ho-Kyoun;Lee, Sang-Hee
    • Speech Sciences
    • /
    • v.7 no.4
    • /
    • pp.11-26
    • /
    • 2000
  • In this paper, we propose the HMM with the MIN module. Because initial and re-estimated variance vectors are important elements for performance in HMM recognition systems, we propose a method which compensates for the mismatched statistical feature of training and test data. The MIN module function is a differentiable function similar to the sigmoid function. Unlike a continuous density function, it does not include variance vectors of the data set. The proposed hybrid HMM/MIN module is a unified network in which the observation probability in the HMM is replaced by the MIN module neural network. The parameters in the unified network are re-estimated by the gradient descent method for the Maximum Likelihood (ML) criterion. In estimating parameters, the variance vector is not estimated because there is no variance element in the MIN module function. The experiment was performed to compare the performance of the proposed HMM and the conventional HMM. The experiment measured an isolated number for speaker independent recognition.

  • PDF

A Study on Classification of Heart Sounds Using Hidden Markov Models (Hidden Markov Model을 이용한 심음분류에 관한 연구)

  • Kim Hee-Keun;Chung Young-Joo
    • The Journal of the Acoustical Society of Korea
    • /
    • v.25 no.3
    • /
    • pp.144-150
    • /
    • 2006
  • Clinicians usually use stethoscopic auscultation for the diagnosis of heart diseases. However, the heart sound signal has varying characteristics due to the noise and/or the conditions of the patients. Also, it is not easy for junior clinicians to find the acoustical differences between different kinds or heart sound signals. which may result in errors in the diagnosis. Thus it will be quite useful for the clinicians to make use of an automatic classification system using signal processing techniques. In this paper, we propose to use hidden Markov models in stead of artificial neural networks which have been conventionally used for the automatic classification of heart sounds. In the experiments classifying heart sound signals. we could see that the proposed methods were quite successful in the classification accuracy.

Clustering In Tied Mixture HMM Using Homogeneous Centroid Neural Network (Homogeneous Centroid Neural Network에 의한 Tied Mixture HMM의 군집화)

  • Park Dong-Chul;Kim Woo-Sung
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.31 no.9C
    • /
    • pp.853-858
    • /
    • 2006
  • TMHMM(Tied Mixture Hidden Markov Model) is an important approach to reduce the number of free parameters in speech recognition. However, this model suffers from a degradation in recognition accuracy due to its GPDF (Gaussian Probability Density Function) clustering error. This paper proposes a clustering algorithm, called HCNN(Homogeneous Centroid Neural network), to cluster acoustic feature vectors in TMHMM. Moreover, the HCNN uses the heterogeneous distance measure to allocate more code vectors in the heterogeneous areas where probability densities of different states overlap each other. When applied to Korean digit isolated word recognition, the HCNN reduces the error rate by 9.39% over CNN clustering, and 14.63% over the traditional K-means clustering.

Recognizing Hand Digit Gestures Using Stochastic Models

  • Sin, Bong-Kee
    • Journal of Korea Multimedia Society
    • /
    • v.11 no.6
    • /
    • pp.807-815
    • /
    • 2008
  • A simple efficient method of spotting and recognizing hand gestures in video is presented using a network of hidden Markov models and dynamic programming search algorithm. The description starts from designing a set of isolated trajectory models which are stochastic and robust enough to characterize highly variable patterns like human motion, handwriting, and speech. Those models are interconnected to form a single big network termed a spotting network or a spotter that models a continuous stream of gestures and non-gestures as well. The inference over the model is based on dynamic programming. The proposed model is highly efficient and can readily be extended to a variety of recurrent pattern recognition tasks. The test result without any engineering has shown the potential for practical application. At the end of the paper we add some related experimental result that has been obtained using a different model - dynamic Bayesian network - which is also a type of stochastic model.

  • PDF

Research on aging-related degradation of control rod drive system based on dynamic object-oriented Bayesian network and hidden Markov model

  • Kang Zhu;Xinwen Zhao;Liming Zhang;Hang Yu
    • Nuclear Engineering and Technology
    • /
    • v.54 no.11
    • /
    • pp.4111-4124
    • /
    • 2022
  • The control rod drive system is critical to the reactor's reliable operation. The performance of its control system and mechanical system will gradually deteriorate because of operational and environmental stresses, thus increasing the reactor's operational risk. Currently there are few researches on the aging-related degradation of the entire control rod drive system. Because it is difficult to quantify the effect of various environmental stresses and establish an accurate physical model when multiple mechanisms superimposed in the degradation process. Therefore, this paper investigates the aging-related degradation of a control rod drive system by integrating Dynamic Object-Oriented Bayesian Network and Hidden Markov Model. Uncertainties in the degradation of the control system and mechanical system are addressed by using fuzzy theory and the Hidden Markov Model respectively. A system which consists of eight control rod drive mechanisms divided into two groups is used to demonstrate the method. The aging-related degradation of the control rod drive system is analyzed by the Bayesian inference algorithm based on the accelerated life test data, and the impact of different operating schemes on the system performance is also investigated. Meanwhile, the components or units that have major impact on the system's performance are identified at different operational phases. Finally, several essential safety measures are suggested to mitigate the risk caused by the system degradation.

Implementation of the Automatic Speech Editing System Using Keyword Spotting Technique (핵심어 인식을 이용한 음성 자동 편집 시스템 구현)

  • Chung, Ik-Joo
    • Speech Sciences
    • /
    • v.3
    • /
    • pp.119-131
    • /
    • 1998
  • We have developed a keyword spotting system for automatic speech editing. This system recognizes the only keyword 'MBC news' and then sends the time information to the host system. We adopted a vocabulary dependent model based on continuous hidden Markov model, and the Viterbi search was used for recognizing the keyword. In recognizing the keyword, the system uses a parallel network where HMM models are connected independently and back-tracking information for reducing false alarms and missing. We especially focused on implementing a stable and practical real-time system.

  • PDF

Combination Tandem Architecture with Segmental Features for Robust Speech Recognition (강인한 음성 인식을 위한 탠덤 구조와 분절 특징의 결합)

  • Yun, Young-Sun;Lee, Yun-Keun
    • MALSORI
    • /
    • no.62
    • /
    • pp.113-131
    • /
    • 2007
  • It is reported that the segmental feature based recognition system shows better results than conventional feature based system in the previous studies. On the other hand, the various studies of combining neural network and hidden Markov models within a single system are done with expectations that it may potentially combine the advantages of both systems. With the influence of these studies, tandem approach was presented to use neural network as the classifier and hidden Markov models as the decoder. In this paper, we applied the trend information of segmental features to tandem architecture and used posterior probabilities, which are the output of neural network, as inputs of recognition system. The experiments are performed on Auroral database to examine the potentiality of the trend feature based tandem architecture. From the results, the proposed system outperforms on very low SNR environments. Consequently, we argue that the trend information on tandem architecture can be additionally used for traditional MFCC features.

  • PDF