Search | Korea Science

Improved Bimodal Speech Recognition Study Based on Product Hidden Markov Model

Xi, Su Mei;Cho, Young Im
- International Journal of Fuzzy Logic and Intelligent Systems
- /
- v.13 no.3
- /
- pp.164-170
- /
- 2013
Recent years have been higher demands for automatic speech recognition (ASR) systems that are able to operate robustly in an acoustically noisy environment. This paper proposes an improved product hidden markov model (HMM) used for bimodal speech recognition. A two-dimensional training model is built based on dependently trained audio-HMM and visual-HMM, reflecting the asynchronous characteristics of the audio and video streams. A weight coefficient is introduced to adjust the weight of the video and audio streams automatically according to differences in the noise environment. Experimental results show that compared with other bimodal speech recognition approaches, this approach obtains better speech recognition performance.
https://doi.org/10.5391/IJFIS.2013.13.3.164 인용 PDF KSCI

Correlation Analysis of PESQ and MOS Evaluation for HMM-based Synthetic Korean Speech (HMM 기반의 한국어 합성음에 대한 PESQ 및 MOS 평가의 상관도 분석)

Lin, Cang-Song;Bae, Keun-Sung
- Phonetics and Speech Sciences
- /
- v.2 no.1
- /
- pp.71-75
- /
- 2010
The PESQ is an objective speech quality evaluation measure that is known to have a high correlation with a subjective speech quality measure such as MOS. To examine whether it could be useful as an objective quality measure of synthetic speech, we carried out both subjective evaluation tests with MOS and DMOS and an objective evaluation test with PESQ for HMM-based Korean synthetic speech signals and analyzed the correlation between them. Experimental results have shown that the PESQ has correlations of 0.87 with MOS and 0.92 with DMOS. It means that the PESQ holds much promise for evaluating the quality of synthetic Korean speech.
PDF

Implementation of Hidden Markov Model based Speech Recognition System for Teaching Autonomous Mobile Robot (자율이동로봇의 명령 교시를 위한 HMM 기반 음성인식시스템의 구현)

조현수;박민규;이민철
- 제어로봇시스템학회:학술대회논문집
- /
- 2000.10a
- /
- pp.281-281
- /
- 2000
This paper presents an implementation of speech recognition system for teaching an autonomous mobile robot. The use of human speech as the teaching method provides more convenient user-interface for the mobile robot. In this study, for easily teaching the mobile robot, a study on the autonomous mobile robot with the function of speech recognition is tried. In speech recognition system, a speech recognition algorithm using HMM(Hidden Markov Model) is presented to recognize Korean word. Filter-bank analysis model is used to extract of features as the spectral analysis method. A recognized word is converted to command for the control of robot navigation.
PDF

HMM-based Adaptive Frequency-Hopping Cognitive Radio System to Reduce Interference Time and to Improve Throughput

Sohn, Sung-Hwan;Jang, Sung-Jeen;Kim, Jae-Moung
- KSII Transactions on Internet and Information Systems (TIIS)
- /
- v.4 no.4
- /
- pp.475-490
- /
- 2010
Cognitive Radio is an advanced enabling technology for the efficient utilization of vacant spectrum due to its ability to sense the spectrum environment. It is important to determine accurate spectrum utilization of the primary system in a cognitive radio environment. In order to define the spectrum utilization state, many CR systems use what is known as the quiet period (QP) method. However, even when using a QP, interference can occur. This causes reduced system throughput and contrary to the basic condition of cognitive radio. In order to reduce the interference time, a frequency-hopping algorithm is proposed here. Additionally, to complement the loss of throughput in the FH, a HMM-based channel prediction algorithm and a channel allocation algorithm is proposed. Simulations were conducted while varying several parameters. The findings show that the proposed algorithm outperforms conventional channel allocation algorithms.
https://doi.org/10.3837/tiis.2010.08.002 인용 PDF KSCI

A Study on the Implementation of an Automatic Segmentation System of Korean Speech based on the Hidden Markov Model (HMM에 의한 한국어음성의 자동분할 시스템의 구현에 관한 연구)

김윤중;김미경;이인동
- Journal of Information Technology Application
- /
- v.1 no.3_4
- /
- pp.1-23
- /
- 1999
본 연구에서는 HMM(Hidden Markov Model) 및 Levelbuilding 알고리즘을 이용하여 인식대상 음소열의 표본 집합(훈련패턴 집합)을 입력으로 하는 음성의 자동 분할 시스템을 구현하였다. 본 시스템은 자연스럽게 발음되어진 연결음 음성으로부터 표준 음소모델을 생성한다. 본 시스템의 구성은 초기화 과정, HMM학습과정 그리고 Levelbuilding을 이용한 분리 및 CLustering 과정으로 구성되어 있다. 초기화 과정에서는 제어 정보를 이용하여 훈련패턴 집합으로부터 초기 음소 집합 군을 생성한다. Levelbuilding을 이용한 분리 및 Clustering 단계에서는 음소 모델과 제어 정보를 이용하여 훈련패턴들을 음소 단위로 분리하고, 분리된 후보 음소들을 Clustering하여 음소집합 군을 생성한다. 음소모델의 구성에 변화가 없을 때까지 이 작업을 반복 수행하여 최적의 음소모델을 생성한다. 본 연구에서는 3개 이하의 숫자단어로 구성된 연결되어 음성 패턴을 대상으로 실험하였다. 연결단어에 대한 음소의 표준모델 생성과정에서 가장 중요한 처리인 훈련패턴의 자동분할 과정을 분석하기 위하여 각 반복과정에서 분리된 정보를 그래프로 도시화하여 확인하였다.
PDF

EMG Pattern Recognition based on MFCC-HMM-GMM for Prosthetic Arm Control (의수 제어를 위한 MFCC-HMM-GMM 기반의 근전도(EMG) 신호 패턴 인식)

Kim, Jung-Ho;Hong, Joon-Eui;Lee, Dong-Hoon;Choi, Heung-Ho;Kwon, Jang-Woo
- Proceedings of the IEEK Conference
- /
- 2006.06a
- /
- pp.245-246
- /
- 2006
In this paper, we proposed using MFCC coefficients(Mel-Scaled Cepstral Coefficients) and a simple but efficient classifying method. Many other features: IAV, zero crossing, LPCC, $\ldot$ and their derivatives are also tested and compared with MFCC coefficients in order to find the best combination. GMM and HMM (Discrete and Continuous Hidden Markov Model), are studied as well in the hope that the use of continuous distribution and the temporal evolution of this set of features will improve the quality of emotion recognition.
PDF

Attack Type Discrimination for HMM-based IDS Using Viterbi Algorithm (Viterbi 알고리즘을 이용한 HMM기반 침입탐지 시스템의 침입 유형 판별)

Koo, Ja-Min;Cho, Sung-Bae
- Proceedings of the Korea Information Processing Society Conference
- /
- 2003.05c
- /
- pp.2093-2096
- /
- 2003
정보통신 구조의 확산 및 기술이 발전함에 따라 전산 시스템에 대한 침입과 피해가 증가되고 있는 실정이다. 이에 비정상행위 기반 침입탐지 시스템에 대한 연구가 활발히 진행되고 있는 가운데 특히, 시스템 호출 감사자료 척도에 은닉 마르코프 모델(HMM)로 모델링 하는 연구가 많이 이루어지고 있다. 하지만, 이는 일정한 임계값 이하의 비정상행위만을 감지할 뿐, 어떠한 유형의 침입인지를 판별하지 못한다. 본 논문에서는, 이러한 침입탐지 시스템의 맹점을 보완하기 위하여 Viterbi 알고리즘을 이용하여 상태 변화를 분석한 후, 어떤 유형의 침입이 발생하였는지를 판별하는 방법을 제안하고, 실험을 통해 제안한 시스템의 가능성을 보인다.
PDF

The Use of MSVM and HMM for Sentence Alignment

Fattah, Mohamed Abdel
- Journal of Information Processing Systems
- /
- v.8 no.2
- /
- pp.301-314
- /
- 2012
In this paper, two new approaches to align English-Arabic sentences in bilingual parallel corpora based on the Multi-Class Support Vector Machine (MSVM) and the Hidden Markov Model (HMM) classifiers are presented. A feature vector is extracted from the text pair that is under consideration. This vector contains text features such as length, punctuation score, and cognate score values. A set of manually prepared training data was assigned to train the Multi-Class Support Vector Machine and Hidden Markov Model. Another set of data was used for testing. The results of the MSVM and HMM outperform the results of the length based approach. Moreover these new approaches are valid for any language pairs and are quite flexible since the feature vector may contain less, more, or different features, such as a lexical matching feature and Hanzi characters in Japanese-Chinese texts, than the ones used in the current research.
https://doi.org/10.3745/JIPS.2012.8.2.301 인용 PDF KSCI

A Noise Reduction Method Combined with HMM Composition for Speech Recognition in Noisy Environments

Shen, Guanghu;Jung, Ho-Youl;Chung, Hyun-Yeol
- IEMEK Journal of Embedded Systems and Applications
- /
- v.3 no.1
- /
- pp.1-7
- /
- 2008
In this paper, a MSS-NOVO method that combines the HMM composition method with a noise reduction method is proposed for speech recognition in noisy environments. This combined method starts with noise reduction with modified spectral subtraction (MSS) to enhance the input noisy speech, then the noise and voice composition (NOVO) method is applied for making noise adapted models by using the noise in the non-utterance regions of the enhanced noisy speech. In order to evaluate the effectiveness of our proposed method, we compare MSS-NOVO method with other methods, i.e., SS-NOVO, MWF-NOVO. To set up the noisy speech for test, we add White noise to KLE 452 database with different SNRs range from 0dB to 15dB, at 5dB intervals. From the tests, MSS-NOVO method shows average improvement of 66.5% and 13.6% compared with the existing SS-NOVO method and MWF-NOVO method, respectively. Especially our proposed MSS-NOVO method shows a big improvement at low SNRs.
PDF

Subword-based Lip Reading Using State-tied HMM (상태공유 HMM을 이용한 서브워드 단위 기반 립리딩)

Kim, Jin-Young;Shin, Do-Sung
- Speech Sciences
- /
- v.8 no.3
- /
- pp.123-132
- /
- 2001
In recent years research on HCI technology has been very active and speech recognition is being used as its typical method. Its recognition, however, is deteriorated with the increase of surrounding noise. To solve this problem, studies concerning the multimodal HCI are being briskly made. This paper describes automated lipreading for bimodal speech recognition on the basis of image- and speech information. It employs audio-visual DB containing 1,074 words from 70 voice and tri-viseme as a recognition unit, and state tied HMM as a recognition model. Performance of automated recognition of 22 to 1,000 words are evaluated to achieve word recognition of 60.5% in terms of 22word recognizer.
PDF

Search Result 963, Processing Time 0.03 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)