통합 검색 | Korea Science

숨은마코프모형을 이용하는 음성구간 추출을 위한 특징벡터 (A New Feature for Speech Segments Extraction with Hidden Markov Models)

홍정우;오창혁
- Communications for Statistical Applications and Methods
- /
- 제15권2호
- /
- pp.293-302
- /
- 2008
본 논문에서는 숨은마코프모형을 사용하여 음성구간을 추출하는 경우에 사용되는 새로운 특징벡터인 평균파워를 제안하고, 이를 멜주파수 켑스트럴 계수(met frequency cepstral coefficients, MFCC)와 파워계수와 비교한다. 이들 세 가지 특징벡터의 수행력을 비교하기 위하여 일반적으로 추출이 상대적으로 어렵다고 알려진 파열음을 가진 단어에 대한 음성 데이터를 수집하여 실험한다. 다양한 수준의 잡음이 있는 환경에서 음성구간을 추출하는 경우 MFCC나 파워계수에 비해 평균파워가 더 정확하고 효율적임을 실험을 통해 보인다.
https://doi.org/10.5351/CKSS.2008.15.2.293 인용 PDF KSCI

Human Activity Recognition Using Spatiotemporal 3-D Body Joint Features with Hidden Markov Models

Uddin, Md. Zia;Kim, Jaehyoun
- KSII Transactions on Internet and Information Systems (TIIS)
- /
- 제10권6호
- /
- pp.2767-2780
- /
- 2016
Video-based human-activity recognition has become increasingly popular due to the prominent corresponding applications in a variety of fields such as computer vision, image processing, smart-home healthcare, and human-computer interactions. The essential goals of a video-based activity-recognition system include the provision of behavior-based information to enable functionality that proactively assists a person with his/her tasks. The target of this work is the development of a novel approach for human-activity recognition, whereby human-body-joint features that are extracted from depth videos are used. From silhouette images taken at every depth, the direction and magnitude features are first obtained from each connected body-joint pair so that they can be augmented later with motion direction, as well as with the magnitude features of each joint in the next frame. A generalized discriminant analysis (GDA) is applied to make the spatiotemporal features more robust, followed by the feeding of the time-sequence features into a Hidden Markov Model (HMM) for the training of each activity. Lastly, all of the trained-activity HMMs are used for depth-video activity recognition.
https://doi.org/10.3837/tiis.2016.06.017 인용 PDF KSCI KPUBS HTML

은닉 마코프 모델을 이용한 음성 인식 시스템 설계 (Design of A Speech Recognition System using Hidden Markov Models)

이철원;임인칠
- 전자공학회논문지B
- /
- 제33B권1호
- /
- pp.108-115
- /
- 1996
본 논문에서는 이산 은닉 마코프 모델(Discrete Hidden Markov Model)을 이용한 연결 음성 인식에 관한 알고리듬 및 모델 토폴로지를 제안한다. 제안된 모델은 인식률과 인식할 수 있는 어휘를 고려하여 2 음소열 및 3 음소열 모델을 사용하며, 보다 정확한 음소 간의 세그멘테이션과 알고리듬의 수행 속도를 고려하여 2 음소열에서는 첫 번째 상태와 마지막 상태를 안정 상태, 나머지 상태는 천이 상태인 4 개의 상태를 갖도록 하고, 또한 3 음소열에서는 7 개의 상태를 갖도록 하며, 여기서 7개의 상태는 3 개의 안정 상태와 4개의 천이 상태를 갖도록 개선한다. 또한, 제안된 음성 인식 알고리듬은 인식 과정 내에서 음소의 발음 구간을 검출하도록 설계한다.
PDF

Analysis of Real-Time Estimation Method Based on Hidden Markov Models for Battery System States of Health

Piao, Changhao;Li, Zuncheng;Lu, Sheng;Jin, Zhekui;Cho, Chongdu
- Journal of Power Electronics
- /
- 제16권1호
- /
- pp.217-226
- /
- 2016
A new method is proposed based on a hidden Markov model (HMM) to estimate and analyze battery states of health. Battery system health states are defined according to the relationship between internal resistance and lifetime of cells. The source data (terminal voltages and currents) can be obtained from vehicular battery models. A characteristic value extraction method is proposed for HMM. A recognition framework and testing datasets are built to test the estimation rates of different states. Test results show that the estimation rates achieved based on this method are above 90% under single conditions. The method achieves the same results under hybrid conditions. We can also use the HMMs that correspond to hybrid conditions to estimate the states under a single condition. Therefore, this method can achieve the purpose of the study in estimating battery life states. Only voltage and current are used in this method, thereby establishing its simplicity compared with other methods. The batteries can also be tested online, and the method can be used for online prediction.
https://doi.org/10.6113/JPE.2016.16.1.217 인용 PDF KSCI KPUBS HTML

얼굴인증 방법들의 조명변화에 대한 견인성 연구 (Study On the Robustness Of Four Different Face Authentication Methods Under Illumination Changes)

고대영;천영하;김진영;이주헌
- 대한전자공학회:학술대회논문집
- /
- 대한전자공학회 2003년도 하계종합학술대회 논문집 Ⅳ
- /
- pp.2036-2039
- /
- 2003
This paper focuses on the study of the robustness of face authentication methods under illumination changes. Four different face authentication methods are tried. These methods are as follows; Principal Component Analysis, Gaussian Mixture Models, 1-Dimensional Hidden Markov Models, 2-Dimensional Hidden Markov Models. Experiment results involving an artificial illumination change to face images are compared with each others. Face feature vector extraction method based on the 2-Dimensional Discrete Cosine Transform is used. Experiments to evaluate the above four different face authentication methods are carried out on the Olivetti Research Laboratory(ORL) face database. For the pseudo 2D HMM, the best EER (Equal Error Rate) performance is observed.
PDF

연속분포 HMM을 이용한 한국어 연속 음성 인식 시스템 개발 (On the Development of a Continuous Speech Recognition System Using Continuous Hidden Markov Model for Korean Language)

김도영;박용규;권오욱;은종관;박성현
- 한국음향학회지
- /
- 제13권1호
- /
- pp.24-31
- /
- 1994
본 논문에서는 연속분포 hidden Markov모델을 이용한 화자독립 연속 음성 인식 시스템에 관해 기술한다. 연속분포 모델은 평균과 분산 벡터로 구성되며 음성신호를 직접 모델링하여 양자화 왜곡이 없어진다. 특징벡터는 filter bank 계수 및 그 1, 2차 미분계수를 사용하여 음성신호의 동적 특성을 반영하였다. Segmental K-means 알고리즘을 이용하여 학습하였으며, 연속어 인식에서 가장 문제가 되는 조음화 현상으로 인한 인식률 저하를 막기 위해 앞뒤의 음소를 고려해주는 triphone을 인식단위로 사용하였다. Search 알고리즘으로는 시간 면에서 효율이 좋은 one-pass search 알고리즘을 사용하였다 성능 평가를 위한 회자 독립인식 실험에서 문법이 없을 경우 $83\%$, finite state network을 적용한 경우에는 $94\%$의 인식률을 나타내었다.
PDF

강인한 음성 인식을 위한 탠덤 구조와 분절 특징의 결합 (Combination Tandem Architecture with Segmental Features for Robust Speech Recognition)

윤영선;이윤근
- 대한음성학회지:말소리
- /
- 제62호
- /
- pp.113-131
- /
- 2007
It is reported that the segmental feature based recognition system shows better results than conventional feature based system in the previous studies. On the other hand, the various studies of combining neural network and hidden Markov models within a single system are done with expectations that it may potentially combine the advantages of both systems. With the influence of these studies, tandem approach was presented to use neural network as the classifier and hidden Markov models as the decoder. In this paper, we applied the trend information of segmental features to tandem architecture and used posterior probabilities, which are the output of neural network, as inputs of recognition system. The experiments are performed on Auroral database to examine the potentiality of the trend feature based tandem architecture. From the results, the proposed system outperforms on very low SNR environments. Consequently, we argue that the trend information on tandem architecture can be additionally used for traditional MFCC features.
PDF

HMM 부모델을 이용한 단어 인식에 관한 연구 (A Study on Word Recognition using sub-model based Hidden Markov Model)

신원호
- 한국음향학회:학술대회논문집
- /
- 한국음향학회 1994년도 제11회 음성통신 및 신호처리 워크샵 논문집 (SCAS 11권 1호)
- /
- pp.395-398
- /
- 1994
In this paper the word recognition using sub-model based Hidden Markov Model was studied. Phoneme models were composed of 61 phonemes in therms of Korean language pronunciation characteristic. Using this, word model was maded by serial concatenation. But, in case of this phoneme concatenation, the second and the third phoneme of syllable are overlapped in distribution at the same time. So considering this, the method that combines the second and the third phoneme to one model was proposed. And to prevent the increase in number of model, similar phonemes were combined to one, and finially, 57 models were created. In experiment proper model structure of sub-model was searched for, and recognition results were compared. So similar recognition results were maded, and overall recognition rates were increased in case of using parameter tying method.
PDF

탠덤 구조를 이용한 강인한 음성 인식 시스템 설계 (Design of Robust Speech Recognition System Using Tandem Architecture)

윤영선;이윤근
- 대한음성학회:학술대회논문집
- /
- 대한음성학회 2007년도 한국음성과학회 공동학술대회 발표논문집
- /
- pp.323-326
- /
- 2007
The various studies of combining neural network and hidden Markov models within a single system are done with expectations that it may potentially combine the advantages of both systems. With the influence of these studies, tandem approach was presented to use neural network as the classifier and hidden Markov models as the decoder. In this paper, we applied the trend information of segmental features to tandem architecture and used posterior probabilities, which are the output of neural network, as inputs of recognition system. The experiments are performed on Aurora2 database to examine the potentiality of the trend feature based tandem architecture. The proposed method shows the better results than the baseline system on very low SNR environments.
PDF

제한적 상태지속시간을 갖는 HMM을 이용한 고립단어 인식 (Isolated Word Recognition Using Hidden Markov Models with Bounded State Duration)

이기희;임인칠
- 전자공학회논문지B
- /
- 제32B권5호
- /
- pp.756-764
- /
- 1995
In this paper, we proposed MLP(MultiLayer Perceptron) based HMM's(Hidden Markov Models) with bounded state duration for isolated word recognition. The minimum and maximum state duration for each state of a HMM are estimated during the training phase and used as parameters of constraining state transition in a recognition phase. The procedure for estimating these parameters and the recognition algorithm using the proposed HMM's are also described. Speaker independent isolated word recognition experiments using a vocabulary of 10 city names and 11 digits indicate that recognition rate can be improved by adjusting the minimum state durations.
PDF

검색결과 191건 처리시간 0.027초

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

자세히 찾기

이미지 검색 (β)