Search | Korea Science

A New Feature for Speech Segments Extraction with Hidden Markov Models (숨은마코프모형을 이용하는 음성구간 추출을 위한 특징벡터)

Hong, Jeong-Woo;Oh, Chang-Hyuck
- Communications for Statistical Applications and Methods
- /
- v.15 no.2
- /
- pp.293-302
- /
- 2008
In this paper we propose a new feature, average power, for speech segments extraction with hidden Markov models, which is based on mel frequencies of speech signals. The average power is compared with the mel frequency cepstral coefficients, MFCC, and the power coefficient. To compare performances of three types of features, speech data are collected for words with explosives which are generally known hard to be detected. Experiments show that the average power is more accurate and efficient than MFCC and the power coefficient for speech segments extraction in environments with various levels of noise.
https://doi.org/10.5351/CKSS.2008.15.2.293 인용 PDF KSCI

Human Activity Recognition Using Spatiotemporal 3-D Body Joint Features with Hidden Markov Models

Uddin, Md. Zia;Kim, Jaehyoun
- KSII Transactions on Internet and Information Systems (TIIS)
- /
- v.10 no.6
- /
- pp.2767-2780
- /
- 2016
Video-based human-activity recognition has become increasingly popular due to the prominent corresponding applications in a variety of fields such as computer vision, image processing, smart-home healthcare, and human-computer interactions. The essential goals of a video-based activity-recognition system include the provision of behavior-based information to enable functionality that proactively assists a person with his/her tasks. The target of this work is the development of a novel approach for human-activity recognition, whereby human-body-joint features that are extracted from depth videos are used. From silhouette images taken at every depth, the direction and magnitude features are first obtained from each connected body-joint pair so that they can be augmented later with motion direction, as well as with the magnitude features of each joint in the next frame. A generalized discriminant analysis (GDA) is applied to make the spatiotemporal features more robust, followed by the feeding of the time-sequence features into a Hidden Markov Model (HMM) for the training of each activity. Lastly, all of the trained-activity HMMs are used for depth-video activity recognition.
https://doi.org/10.3837/tiis.2016.06.017 인용 PDF KSCI KPUBS HTML

Design of A Speech Recognition System using Hidden Markov Models (은닉 마코프 모델을 이용한 음성 인식 시스템 설계)

Lee, Chul-Won;Lim, In-Chil
- Journal of the Korean Institute of Telematics and Electronics B
- /
- v.33B no.1
- /
- pp.108-115
- /
- 1996
This paper proposes an algorithm and a model topology for the connected speech recognition using Discrete Hidden Markov Models. A proposed model uses diphone and triphone model which consider the recognition rate and recognisable vocabulary. Considering more exact inter- phoneme segmentation and execution speed of algorithm, 4 states have to exist in diphone model where the first state and the last state are keeping a steady state, the other states hold a transient state. 7 states have to exist in triphone model where 7 states are specified and improved to 3 steady states and 4 transition states. Also, the proposed speech recognition algorithm is designed to detect the inter-phoneme segmentation during the recognition processing.
PDF

Analysis of Real-Time Estimation Method Based on Hidden Markov Models for Battery System States of Health

Piao, Changhao;Li, Zuncheng;Lu, Sheng;Jin, Zhekui;Cho, Chongdu
- Journal of Power Electronics
- /
- v.16 no.1
- /
- pp.217-226
- /
- 2016
A new method is proposed based on a hidden Markov model (HMM) to estimate and analyze battery states of health. Battery system health states are defined according to the relationship between internal resistance and lifetime of cells. The source data (terminal voltages and currents) can be obtained from vehicular battery models. A characteristic value extraction method is proposed for HMM. A recognition framework and testing datasets are built to test the estimation rates of different states. Test results show that the estimation rates achieved based on this method are above 90% under single conditions. The method achieves the same results under hybrid conditions. We can also use the HMMs that correspond to hybrid conditions to estimate the states under a single condition. Therefore, this method can achieve the purpose of the study in estimating battery life states. Only voltage and current are used in this method, thereby establishing its simplicity compared with other methods. The batteries can also be tested online, and the method can be used for online prediction.
https://doi.org/10.6113/JPE.2016.16.1.217 인용 PDF KSCI KPUBS HTML

Study On the Robustness Of Four Different Face Authentication Methods Under Illumination Changes (얼굴인증 방법들의 조명변화에 대한 견인성 연구)

고대영;천영하;김진영;이주헌
- Proceedings of the IEEK Conference
- /
- 2003.07e
- /
- pp.2036-2039
- /
- 2003
This paper focuses on the study of the robustness of face authentication methods under illumination changes. Four different face authentication methods are tried. These methods are as follows; Principal Component Analysis, Gaussian Mixture Models, 1-Dimensional Hidden Markov Models, 2-Dimensional Hidden Markov Models. Experiment results involving an artificial illumination change to face images are compared with each others. Face feature vector extraction method based on the 2-Dimensional Discrete Cosine Transform is used. Experiments to evaluate the above four different face authentication methods are carried out on the Olivetti Research Laboratory(ORL) face database. For the pseudo 2D HMM, the best EER (Equal Error Rate) performance is observed.
PDF

On the Development of a Continuous Speech Recognition System Using Continuous Hidden Markov Model for Korean Language (연속분포 HMM을 이용한 한국어 연속 음성 인식 시스템 개발)

Kim, Do-Yeong;Park, Yong-Kyu;Kwon, Oh-Wook;Un, Chong-Kwan;Park, Seong-Hyun
- The Journal of the Acoustical Society of Korea
- /
- v.13 no.1
- /
- pp.24-31
- /
- 1994
In this paper, we report on the development of a speaker independent continuous speech recognition system using continuous hidden Markov models. The continuous hidden Markov model consists of mean and covariance matrices and directly models speech signal parameters, therefore does not have quantization error. Filter bank coefficients with their 1st and 2nd-order derivatives are used as feature vectors to represent the dynamic features of speech signal. We use the segmental K-means algorithm as a training algorithm and triphone as a recognition unit to alleviate performance degradation due to coarticulation problems critical in continuous speech recognition. Also, we use the one-pass search algorithm that Is advantageous in speeding-up the recognition time. Experimental results show that the system attains the recognition accuracy of $83\%$ without grammar and $94\%$ with finite state networks in speaker-indepdent speech recognition.
PDF

Combination Tandem Architecture with Segmental Features for Robust Speech Recognition (강인한 음성 인식을 위한 탠덤 구조와 분절 특징의 결합)

Yun, Young-Sun;Lee, Yun-Keun
- MALSORI
- /
- no.62
- /
- pp.113-131
- /
- 2007
It is reported that the segmental feature based recognition system shows better results than conventional feature based system in the previous studies. On the other hand, the various studies of combining neural network and hidden Markov models within a single system are done with expectations that it may potentially combine the advantages of both systems. With the influence of these studies, tandem approach was presented to use neural network as the classifier and hidden Markov models as the decoder. In this paper, we applied the trend information of segmental features to tandem architecture and used posterior probabilities, which are the output of neural network, as inputs of recognition system. The experiments are performed on Auroral database to examine the potentiality of the trend feature based tandem architecture. From the results, the proposed system outperforms on very low SNR environments. Consequently, we argue that the trend information on tandem architecture can be additionally used for traditional MFCC features.
PDF

A Study on Word Recognition using sub-model based Hidden Markov Model (HMM 부모델을 이용한 단어 인식에 관한 연구)

신원호
- Proceedings of the Acoustical Society of Korea Conference
- /
- 1994.06c
- /
- pp.395-398
- /
- 1994
In this paper the word recognition using sub-model based Hidden Markov Model was studied. Phoneme models were composed of 61 phonemes in therms of Korean language pronunciation characteristic. Using this, word model was maded by serial concatenation. But, in case of this phoneme concatenation, the second and the third phoneme of syllable are overlapped in distribution at the same time. So considering this, the method that combines the second and the third phoneme to one model was proposed. And to prevent the increase in number of model, similar phonemes were combined to one, and finially, 57 models were created. In experiment proper model structure of sub-model was searched for, and recognition results were compared. So similar recognition results were maded, and overall recognition rates were increased in case of using parameter tying method.
PDF

Design of Robust Speech Recognition System Using Tandem Architecture (탠덤 구조를 이용한 강인한 음성 인식 시스템 설계)

Yun, Young-Sun;Lee, Yun-Keun
- Proceedings of the KSPS conference
- /
- 2007.05a
- /
- pp.323-326
- /
- 2007
The various studies of combining neural network and hidden Markov models within a single system are done with expectations that it may potentially combine the advantages of both systems. With the influence of these studies, tandem approach was presented to use neural network as the classifier and hidden Markov models as the decoder. In this paper, we applied the trend information of segmental features to tandem architecture and used posterior probabilities, which are the output of neural network, as inputs of recognition system. The experiments are performed on Aurora2 database to examine the potentiality of the trend feature based tandem architecture. The proposed method shows the better results than the baseline system on very low SNR environments.
PDF

Isolated Word Recognition Using Hidden Markov Models with Bounded State Duration (제한적 상태지속시간을 갖는 HMM을 이용한 고립단어 인식)

이기희;임인칠
- Journal of the Korean Institute of Telematics and Electronics B
- /
- v.32B no.5
- /
- pp.756-764
- /
- 1995
In this paper, we proposed MLP(MultiLayer Perceptron) based HMM's(Hidden Markov Models) with bounded state duration for isolated word recognition. The minimum and maximum state duration for each state of a HMM are estimated during the training phase and used as parameters of constraining state transition in a recognition phase. The procedure for estimating these parameters and the recognition algorithm using the proposed HMM's are also described. Speaker independent isolated word recognition experiments using a vocabulary of 10 city names and 11 digits indicate that recognition rate can be improved by adjusting the minimum state durations.
PDF

Search Result 191, Processing Time 0.023 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)