통합 검색 | Korea Science

바타차랴 알고리즘에서 HMM 특징 추출을 이용한 음성 인식 최적 학습 모델 (Speech Recognition Optimization Learning Model using HMM Feature Extraction In the Bhattacharyya Algorithm)

오상엽
- 디지털융복합연구
- /
- 제11권6호
- /
- pp.199-204
- /
- 2013
음성 인식 시스템은 정확하지 않게 입력된 음성으로부터 학습 모델을 구성하고 유사한 음소 모델로 인식하기 때문에 인식률 저하를 가져온다. 따라서 본 논문에서는 바타차랴 알고리즘을 이용한 음성 인식 최적 학습 모델 구성 방법을 제안하였다. 음소가 갖는 특징을 기반으로 학습 데이터의 음소에 HMM 특징 추출 방법을 이용하였으며 유사한 학습 모델은 바타챠랴 알고리즘을 이용하여 정확한 학습 모델로 인식할 수 있도록 하였다. 바타챠랴 알고리즘을 이용하여 최적의 학습 모델을 구성하여 인식 성능을 평가하였다. 본 논문에서 제안한 시스템을 적용한 결과 음성 인식률에서 98.7%의 인식률을 나타내었다.
https://doi.org/10.14400/JDPM.2013.11.6.199 인용 PDF

HTTP Outbound Traffic에 HMM을 적용한 웹 공격의 비정상 행위 탐지 기법 (Anomaly Detection Scheme of Web-based attacks by applying HMM to HTTP Outbound Traffic)

최병하;최승교;조경산
- 한국컴퓨터정보학회논문지
- /
- 제17권5호
- /
- pp.33-40
- /
- 2012
본 논문은 HTTP Outbound Traffic의 감시를 통해 다양한 웹 공격의 침입 경로에 대응하고, 학습 효율성을 높여 변종 또는 새로운 기법을 이용한 비정상 행위에 대한 오탐을 낮춘 기법을 제안한다. 제안 기법은 HMM(Hidden Markov Model)을 적용하여 HTML 문서속의 태그와 자바스크립트의 학습을 통한 정상 행위 모델을 생성한 후, HTTP Outbound Traffic속의 정보를 정상 행위 모델과 비교하여 웹 공격을 탐지한다. 실제 침입된 환경에서의 검증 분석을 통해, 제안기법이 웹 공격에 대해 0.0001%의 오탐율과 96%의 우수한 탐지능력을 보임을 제시한다.
https://doi.org/10.9708/jksci.2012.17.5.033 인용 PDF KSCI

다양한 기계학습 기법의 암상예측 적용성 비교 분석 (Comparative Application of Various Machine Learning Techniques for Lithology Predictions)

정진아;박은규
- 한국지하수토양환경학회지:지하수토양환경
- /
- 제21권3호
- /
- pp.21-34
- /
- 2016
In the present study, we applied various machine learning techniques comparatively for prediction of subsurface structures based on multiple secondary information (i.e., well-logging data). The machine learning techniques employed in this study are Naive Bayes classification (NB), artificial neural network (ANN), support vector machine (SVM) and logistic regression classification (LR). As an alternative model, conventional hidden Markov model (HMM) and modified hidden Markov model (mHMM) are used where additional information of transition probability between primary properties is incorporated in the predictions. In the comparisons, 16 boreholes consisted with four different materials are synthesized, which show directional non-stationarity in upward and downward directions. Futhermore, two types of the secondary information that is statistically related to each material are generated. From the comparative analysis with various case studies, the accuracies of the techniques become degenerated with inclusion of additive errors and small amount of the training data. For HMM predictions, the conventional HMM shows the similar accuracies with the models that does not relies on transition probability. However, the mHMM consistently shows the highest prediction accuracy among the test cases, which can be attributed to the consideration of geological nature in the training of the model.
https://doi.org/10.7857/JSGE.2016.21.3.021 인용 PDF KSCI KPUBS HTML

2층 구조의 입체 시각형 신경망 기반 음소인식 (Phoneme Recognition based on Two-Layered Stereo Vision Neural Network)

Kim, Sung-Ill;Kim, Nag-Cheol
- 한국멀티미디어학회논문지
- /
- 제5권5호
- /
- pp.523-529
- /
- 2002
본 연구는 입체 시각을 위한 신경망에 대한 연구 결과로서 인간의 음성을 인식하는데 적용된다. 입체 시각신경망(SVNN)에 기반한 음성인식에서, 먼저 입력된 음성 신호를 표준 모델과 비교함으로써 유사성이 얻어진다. 이 값들은 다이나믹한 처리 과정으로 주어지고 이웃한 신경소자들 사이에서 경쟁적이고 협력적인 처리를 거치게 된다. 이러한 다이나믹한 처리과정을 통해 단 하나의 가장 우수한 신경세포(winner neuron)만이 최후에 검출된다. 비교연구에서 2층 구조의 SVNN은 HMM 인식기보다 인식정확도 측면에서 7.7% 더 높았다. 평가 결과. SVNN은 기손리 HMM 인식기 성능을 능가하는 것으로 나타났다.
PDF

부분어절 조건부확률 기반 동형이의어 태깅 모델 (Korean Homograph Tagging Model based on Sub-Word Conditional Probability)

신준철;옥철영
- 정보처리학회논문지:소프트웨어 및 데이터공학
- /
- 제3권10호
- /
- pp.407-420
- /
- 2014
한국어 형태소 분석 및 태깅은 크게 2가지 단계로 나뉜다. 첫 번째 단계는 어절을 분석하여 후보들을 생성하는 것으로, 여러 의미를 가진 어절은 이 단계에서 다양한 후보들이 생성된다. 두 번째는 문맥 정보를 이용하여 후보 중에 가장 적절한 하나를 선택하는 단계로, 흔히 태깅이라 한다. 일반적으로 두 번째 단계에서는 은닉 마르코프 모델(Hidden Markov Model, 이하 HMM)을 자주 사용하지만, 본 논문에서는 처리속도를 향상시킨 부분어절 조건부확률 모델을 제안한다. 이 모델은 우선적으로 인접 어절 정보를 이용하여 현재 처리 중인 어절의 의미를 결정하고, 예외적으로 용언이 인접한 경우에만 후보 정보의 극히 일부분을 이용한다. 실험 결과 정확률은 HMM의 96.49%보다 0.07% 낮았지만, 처리 소요 시간을 약 53% 감소시켰다.
https://doi.org/10.3745/KTSDE.2014.3.10.407 인용 PDF KSCI

저작권 보호를 위한 HMM기반의 음악 식별 시스템 (HMM-based Music Identification System for Copyright Protection)

김희동;김도현;김지환
- 말소리와 음성과학
- /
- 제1권1호
- /
- pp.63-67
- /
- 2009
In this paper, in order to protect music copyrights, we propose a music identification system which is scalable to the number of pieces of registered music and robust to signal-level variations of registered music. For its implementation, we define the new concepts of 'music word' and 'music phoneme' as recognition units to construct 'music acoustic models'. Then, with these concepts, we apply the HMM-based framework used in continuous speech recognition to identify the music. Each music file is transformed to a sequence of 39-dimensional vectors. This sequence of vectors is represented as ordered states with Gaussian mixtures. These ordered states are trained using Baum-Welch re-estimation method. Music files with a suspicious copyright are also transformed to a sequence of vectors. Then, the most probable music file is identified using Viterbi algorithm through the music identification network. We implemented a music identification system for 1,000 MP3 music files and tested this system with variations in terms of MP3 bit rate and music speed rate. Our proposed music identification system demonstrates robust performance to signal variations. In addition, scalability of this system is independent of the number of registered music files, since our system is based on HMM method.
PDF

2단계 히든마코프 모델을 이용한 제스쳐의 성능향상 연구 (Improvement of Gesture Recognition using 2-stage HMM)

정훤재;박현준;김동한
- 제어로봇시스템학회논문지
- /
- 제21권11호
- /
- pp.1034-1037
- /
- 2015
In recent years in the field of robotics, various methods have been developed to create an intimate relationship between people and robots. These methods include speech, vision, and biometrics recognition as well as gesture-based interaction. These recognition technologies are used in various wearable devices, smartphones and other electric devices for convenience. Among these technologies, gesture recognition is the most commonly used and appropriate technology for wearable devices. Gesture recognition can be classified as contact or noncontact gesture recognition. This paper proposes contact gesture recognition with IMU and EMG sensors by using the hidden Markov model (HMM) twice. Several simple behaviors make main gestures through the one-stage HMM. It is equal to the Hidden Markov model process, which is well known for pattern recognition. Additionally, the sequence of the main gestures, which comes from the one-stage HMM, creates some higher-order gestures through the two-stage HMM. In this way, more natural and intelligent gestures can be implemented through simple gestures. This advanced process can play a larger role in gesture recognition-based UX for many wearable and smart devices.
https://doi.org/10.5302/J.ICROS.2015.15.0089 인용 PDF KSCI

HMM(Hidden Markov Model)을 이용한 핸드 제스처인식 (Hand Gesture Recognition Using HMM(Hidden Markov Model))

하정요;이민호;최형일
- 디지털콘텐츠학회 논문지
- /
- 제10권2호
- /
- pp.291-298
- /
- 2009
본 논문에서는 비전 기반의 실시간 손 모양 인식을 위한 알고리즘을 제안하였다. 먼저 피부색을 검출하기 위해 RGB 컬러모델을 YCbCr 컬러모델로 변환하고, 색차성분인 CbCr을 이용하여 피부색을 검출한다. 검출 후 피부색은 흰색, 그 이외의 색은 검은색으로 이진화 하였다. 이진화 후 팔 영역과 얼굴영역을 제거하고, 손 영역만 검출하여 손의 무게중심을 구하기 위해 가로, 세로로 프로젝션을 수행한다. 손의 무게중심을 찾은 후에 손의 궤적을 추적하기 위해 칼만필터를 이용하였다. 손의 궤적 추적 후에 손 모양을 인식시키기 위해 HMM(Hidden Markov Model)을 이용하여 6가지 손의 모양을 학습한 후 인식하였다. 실험을 통하여 제안한 알고리즘의 효과를 입증하였다.
PDF

스펙트럼 기반 여기신호 추출을 통한 HMM기반 음성합성기의 음질 개선 방법 (Spectrum Based Excitation Extraction for HMM Based Speech Synthesis System)

이봉진;김성우;백순호;김종진;강홍구
- 한국음향학회지
- /
- 제29권1호
- /
- pp.82-90
- /
- 2010
본 논문에서는 HMM기반 음성합성시스템에서 합성음의 음질 개선을 위한 방법으로 스펙트럼 정보에 기반한 여기신호 추출방법을 제안한다. 제안된 방법은 스펙트럼 정보와 여기신호를 함께 통계적 모델로 만든 후에 합성 과정에서 스펙트럼 정보를 기반으로 여기신호를 추출해 냄으로써 스펙트럼 파라메터에 가장 적합한 여기신호를 사용할 수 있다. 제안된 방법으로 합성음의 음질을 MUSHRA 테스트 및 WB-FESQ점수를 통해 확인해 본 결과, 비슷한 조건에서 기존에 사용되는 STRAIGHT 방법을 이용한 합성음보다 좋은 음질을 얻을 수 있었다.
https://doi.org/10.7776/ASK.2010.29.1.082 인용 PDF KSCI

상관성있는 VQ-HMM을 이용한 고립 단어 인식 (Isolated Words Recognition using Correlation VQ-HMM)

이진수
- 한국음향학회:학술대회논문집
- /
- 한국음향학회 1993년도 학술논문발표회 논문집 제12권 1호
- /
- pp.109-112
- /
- 1993
In this paper, we propose the modified VQ, applied correlation between codewords in order to reduce the error rate due to personal and speakers' temporal variation. Such a modified VQ is used in the stage of preprocessing of HMM and the temporal variation is absorbed by nonlinear Decimation and Interpolation of vowel part that we obtain higher recognition rate than not so case. The objects of experiment are Korea 142 DDD regional names and we show that the proposed method increase the recognition rate.
PDF

검색결과 963건 처리시간 0.022초

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

자세히 찾기

이미지 검색 (β)