• Title/Summary/Keyword: Hidden Markov

Search Result 708, Processing Time 0.032 seconds

Named Entity Boundary Recognition Using Hidden Markov Model and Hierarchical Information (은닉 마르코프 모델과 계층 정보를 이용한 개체명 경계 인식)

  • Lim, Heui-Seok
    • Journal of the Korea Academia-Industrial cooperation Society
    • /
    • v.7 no.2
    • /
    • pp.182-187
    • /
    • 2006
  • This paper proposes a method for boundary recognition of named entity using hidden markov model and ontology information of biological named entity. We uses smoothing method using 31 feature information of word and hierarchical information to alleviate sparse data problem in HMM. The GENIA corpus version 2.1 was used to train and to experiment the proposed boundary recognition system. The experimental results show that the proposed system outperform the previous system which did not use ontology information of hierarchical information and smoothing technique. Also the system shows improvement of execution time of boundary recognition.

  • PDF

Online Selective-Sample Learning of Hidden Markov Models for Sequence Classification

  • Kim, Minyoung
    • International Journal of Fuzzy Logic and Intelligent Systems
    • /
    • v.15 no.3
    • /
    • pp.145-152
    • /
    • 2015
  • We consider an online selective-sample learning problem for sequence classification, where the goal is to learn a predictive model using a stream of data samples whose class labels can be selectively queried by the algorithm. Given that there is a limit to the total number of queries permitted, the key issue is choosing the most informative and salient samples for their class labels to be queried. Recently, several aggressive selective-sample algorithms have been proposed under a linear model for static (non-sequential) binary classification. We extend the idea to hidden Markov models for multi-class sequence classification by introducing reasonable measures for the novelty and prediction confidence of the incoming sample with respect to the current model, on which the query decision is based. For several sequence classification datasets/tasks in online learning setups, we demonstrate the effectiveness of the proposed approach.

Analysis of spatio-temporal variation on water quality using hidden Markov model (은닉 마코프 모형을 이용한 시공간적 수질 변동성 분석)

  • Jung, Min-Kyu;Cho, Hemie;Kwon, Hyun-Han
    • Proceedings of the Korea Water Resources Association Conference
    • /
    • 2020.06a
    • /
    • pp.111-111
    • /
    • 2020
  • 하천환경과 기후의 변화로 인해 수질오염 과정의 메커니즘이 더욱 복잡해짐에 따라 다양한 요인을 고려한 불확실성 평가 연구가 요구되고 있다. 하천 수질 중에서도 부영양화 문제는 특히 개발로 인한 하천환경 변화 이후 사회 정치적 논점이 되어왔다. 본 연구에서는 지난 7년 동안의 수질 변화의 전반적인 양상을 조사하였으며, 클로로필-a(Chl-a, chlorophyll-a) 농도의 시공간적 의존성의 효과적으로 고려하기 위해 기계학습 기반 분류(classification) 접근법인 다변량 은닉 마코프 모형(MHMM, multivariate hidden Markov model)을 사용하였다. 월 단위 수질 및 수문 자료를 사용하여 Chl-a의 변동성을 군집화하여 수질 상태의 익월 천이확률을 효과적으로 추정하였다. Chl-a와 수질 및 수문기상 조건의 관계를 평가하였으며, 결과적으로 수질 상태의 시공간적 전이가 정확하게 식별되었고 이의 잠재적 원인에 대하여 논의하였다.

  • PDF

Selection of features and hidden Markov model parameters for English word recognition from Leap Motion air-writing trajectories

  • Deval Verma;Himanshu Agarwal;Amrish Kumar Aggarwal
    • ETRI Journal
    • /
    • v.46 no.2
    • /
    • pp.250-262
    • /
    • 2024
  • Air-writing recognition is relevant in areas such as natural human-computer interaction, augmented reality, and virtual reality. A trajectory is the most natural way to represent air writing. We analyze the recognition accuracy of words written in air considering five features, namely, writing direction, curvature, trajectory, orthocenter, and ellipsoid, as well as different parameters of a hidden Markov model classifier. Experiments were performed on two representative datasets, whose sample trajectories were collected using a Leap Motion Controller from a fingertip performing air writing. Dataset D1 contains 840 English words from 21 classes, and dataset D2 contains 1600 English words from 40 classes. A genetic algorithm was combined with a hidden Markov model classifier to obtain the best subset of features. Combination ftrajectory, orthocenter, writing direction, curvatureg provided the best feature set, achieving recognition accuracies on datasets D1 and D2 of 98.81% and 83.58%, respectively.

Assessing Misdiagnosis of Relapse in Patients with Gastric Cancer in Iran Cancer Institute Based on a Hidden Markov Multi-state Model

  • Zare, Ali;Mahmoodi, Mahmood;Mohammad, Kazem;Zeraati, Hojjat;Hosseini, Mostafa;Naieni, Kourosh Holakouie
    • Asian Pacific Journal of Cancer Prevention
    • /
    • v.15 no.9
    • /
    • pp.4109-4115
    • /
    • 2014
  • Background: Accurate assessment of disease progression requires proper understanding of natural disease process which is often hidden and unobservable. For this purpose, disease status should be clearly detected. But in most diseases it is not possible to detect such status. This study, therefore, aims to present a model which both investigates the unobservable disease process and considers the error probability in diagnosis of disease states. Materials and Methods: Data from 330 patients with gastric cancer undergoing surgery at the Iran Cancer Institute from 1995 to 1999 were analyzed. Moreover, to estimate and assess the effect of demographic, diagnostic and clinical factors as well as medical and post-surgical variables on transition rates and the probability of misdiagnosis of relapse, a hidden Markov multi-state model was employed. Results: Classification errors of patients in alive state without a relapse ($e_{21}$) and with a relapse ($e_{12}$) were 0.22 (95% CI: 0.04-0.63) and 0.02 (95% CI: 0.00-0.09), respectively. Only variables of age and number of renewed treatments affected misdiagnosis of relapse. In addition, patient age and distant metastasis were among factors affecting the occurrence of relapse (state1${\rightarrow}$state2) while the number of renewed treatments and the type and extent of surgery had a significant effect on death hazard without relapse (state2${\rightarrow}$state3)and death hazard with relapse (state2${\rightarrow}$state3). Conclusions: A hidden Markov multi-state model provides the possibility of estimating classification error between different states of disease. Moreover, based on this model, factors affecting the probability of this error can be identified and researchers can be helped with understanding the mechanisms of classification error.

A hidden Markov model for predicting global stock market index (은닉 마르코프 모델을 이용한 국가별 주가지수 예측)

  • Kang, Hajin;Hwang, Beom Seuk
    • The Korean Journal of Applied Statistics
    • /
    • v.34 no.3
    • /
    • pp.461-475
    • /
    • 2021
  • Hidden Markov model (HMM) is a statistical model in which the system consists of two elements, hidden states and observable results. HMM has been actively used in various fields, especially for time series data in the financial sector, since it has a variety of mathematical structures. Based on the HMM theory, this research is intended to apply the domestic KOSPI200 stock index as well as the prediction of global stock indexes such as NIKKEI225, HSI, S&P500 and FTSE100. In addition, we would like to compare and examine the differences in results between the HMM and support vector regression (SVR), which is frequently used to predict the stock price, due to recent developments in the artificial intelligence sector.

Markov Model-based Static Obstacle Map Estimation for Perception of Automated Driving (자율주행 인지를 위한 마코브 모델 기반의 정지 장애물 추정 연구)

  • Yoon, Jeongsik;Yi, Kyongsu
    • Journal of Auto-vehicle Safety Association
    • /
    • v.11 no.2
    • /
    • pp.29-34
    • /
    • 2019
  • This paper presents a new method for construction of a static obstacle map. A static obstacle is important since it is utilized to path planning and decision. Several established approaches generate static obstacle map by grid method and counting algorithm. However, these approaches are occasionally ineffective since the density of LiDAR layer is low. Our approach solved this problem by applying probability theory. First, we converted all LiDAR point to Gaussian distribution to considers an uncertainty of LiDAR point. This Gaussian distribution represents likelihood of obstacle. Second, we modeled dynamic transition of a static obstacle map by adopting the Hidden Markov Model. Due to the dynamic characteristics of the vehicle in relation to the conditions of the next stage only, a more accurate map of the obstacles can be obtained using the Hidden Markov Model. Experimental data obtained from test driving demonstrates that our approach is suitable for mapping static obstacles. In addition, this result shows that our algorithm has an advantage in estimating not only static obstacles but also dynamic characteristics of moving target such as driving vehicles.

Application of Hidden Markov Chain Model to identify temporal distribution of sub-daily rainfall in South Korea

  • Chandrasekara, S.S.K;Kim, Yong-Tak;Kwon, Hyun-Han
    • Proceedings of the Korea Water Resources Association Conference
    • /
    • 2018.05a
    • /
    • pp.499-499
    • /
    • 2018
  • Hydro-meteorological extremes are trivial in these days. Therefore, it is important to identify extreme hydrological events in advance to mitigate the damage due to the extreme events. In this context, exploring temporal distribution of sub-daily extreme rainfall at multiple rain gauges would informative to identify different states to describe severity of the disaster. This study proposehidden Markov chain model (HMM) based rainfall analysis tool to understand the temporal sub-daily rainfall patterns over South Korea. Hourly and daily rainfall data between 1961 and 2017 for 92 stations were used for the study. HMM was applied to daily rainfall series to identify an observed hidden state associated with rainfall frequency and intensity, and further utilized the estimated hidden states to derive a temporal distribution of daily extreme rainfall. Transition between states over time was clearly identified, because HMM obviously identifies the temporal dependence in the daily rainfall states. The proposed HMM was very useful tool to derive the temporal attributes of the daily rainfall in South Korea. Further, daily rainfall series were disaggregated into sub-daily rainfall sequences based on the temporal distribution of hourly rainfall data.

  • PDF

Development of Multi-Site Daily Rainfall Simulation Based on Homogeneous Hidden Markov Chain Model Coupled with Chow-Liu Tree Structures (Chow-Liu Tree 모형과 동질성 Hidden Markov Model을 연계한 다지점 일강수량 모의기법 개발)

  • Kwon, Hyun-Han;Kim, Tae Jeong;Kim, Oon Ki;Lee, Dong Ryul
    • Journal of Korea Water Resources Association
    • /
    • v.46 no.10
    • /
    • pp.1029-1040
    • /
    • 2013
  • This study aims to develop a multivariate daily rainfall simulation model considering spatial coherence across watershed. The existing Hidden Markov Model (HMM) has been mainly applied to single site case so that the spatial coherences are not properly addressed. In this regard, HMM coupled with Chow-Liu Tree (CLT) that is designed to consider inter-dependences across rainfall networks was proposed. The proposed approach is applied to Han-River watershed where long-term and reliable hydrologic data is available, and a rigorous validation is finally conducted to verify the model's capability. It was found that the proposed model showed better performance in terms of reproducing daily rainfall statistics as well as seasonal rainfall statistics. Also, correlation matrix across stations for observation and simulation was compared and examined. It was confirmed that the spatial coherence was well reproduced via CLT-HMM model.

A Hybrid SVM-HMM Method for Handwritten Numeral Recognition

  • Kim, Eui-Chan;Kim, Sang-Woo
    • 제어로봇시스템학회:학술대회논문집
    • /
    • 2003.10a
    • /
    • pp.1032-1035
    • /
    • 2003
  • The field of handwriting recognition has been researched for many years. A hybrid classifier has been proven to be able to increase the recognition rate compared with a single classifier. In this paper, we combine support vector machine (SVM) and hidden Markov model (HMM) for offline handwritten numeral recognition. To improve the performance, we extract features adapted for each classifier and propose the modified SVM decision structure. The experimental results show that the proposed method can achieve improved recognition rate for handwritten numeral recognition.

  • PDF