• Title/Summary/Keyword: Finite state automata

Search Result 48, Processing Time 0.038 seconds

An Interrupted and Uninterrupted Compound Unit Recognizer using Regular Expression (정규표현을 이용한 연속 및 불연속 복합단위 인식기)

  • Yuh, Sang-Hwa;Seo, Jung-Yun
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2003.11a
    • /
    • pp.487-490
    • /
    • 2003
  • 기계번역 시스템에서 복합단위 처리는 원문의 분석 부담을 줄이고 조합적으로 대역문의 의미를 생성하지 못하는 원문의 처리를 위해 필수적이다. 본 논문에서는 정규표현(Regular Expression)을 이용하여 영어의 연속(Non-Interrupted) 및 불연속(Interrupted) 복합 단위를 인식하는 복합단위 인식기를 제안한다. 제안된 방법은, 기존에 trie 와 같은 index 의 갱신 과정이 불필요하므로, 다수의 작업자에 의해 복합단위 사전을 동시에 구축하는 경우에, 한 작업자의 결과가 실시간으로 다른 작업자의 작업에 반영되는 장점이 있으며, 복합단위 인식에 있어 정규 표현을 이용함으로써 복합단위 인식기의 성능을 선언적으로 향상시킬 수 있다. 번역 실행시의 고속 탐색을 위해서는 전체 복합단위로부터 FSA(finite State Automata) 를 자동으로 구축하여 빠른 속도로 인식 가능하도록 하였다.

  • PDF

An Information Extraction System Using Finite State Automata (유한 오토마타를 이용한 정보 추출 시스템의 구현 및 분석)

  • Oh, Hyo-Jung;Lim, Jeong-Mook;Lee, Mann-Ho;Myaeng, Sung-Hyon
    • Annual Conference on Human and Language Technology
    • /
    • 1998.10c
    • /
    • pp.97-104
    • /
    • 1998
  • 인터넷의 사용자가 폭발적으로 증가함에 따라, 인터넷을 이용한 다양한 정보 서비스가 생성되었으며, 이로 인해 일반 사용자들이 접할 수 있는 디지털 문서의 양은 기하 급수적으로 증가 되었다. 본 논문에서는 유사한 정보를 갖는 다량의 문서들로부터 사용자가 원하는 정보만을 추출하는 정보 추출 시스템의 개발 과정 및 결과를 기술한다. 개발된 시스템은 필요한 정보를 포함하는 문장들을 걸러 낸 후, 필요한 사실정보의 출현을 나타내는 패턴을 사용한 유한 오토마타를 통하여 사용자가 원하는 정보를 추출한다. 관광지 안내 텍스트를 대상으로 한 실험 및 분석 결과를 기술한다.

  • PDF

A Formal Model of Coordination for Supporting Community Computing in a Ubiquitous Environment (유비쿼터스 환경에서 커뮤니티 컴퓨팅 지원을 위한 코디네이터 개발)

  • Nam, Jin-Gyu;Kim, Hyun-Woo;Shin, Dong-Min;Park, Jae-Il;Hur, Sun
    • Journal of Korean Society of Industrial and Systems Engineering
    • /
    • v.31 no.3
    • /
    • pp.43-51
    • /
    • 2008
  • Recent advances in mobile computing technologies and platform-independent information systems have enabled to realize a ubiquitous environment. Community computing has been developed as a useful tool for realizing collaborative services in a ubiquitous environment. In this paper, we present a formal model of a ubiquitous space that takes community concept into consideration and propose two management frameworks that prevent conflicts among communities. To demonstrate the validity of the proposed frameworks, an example for coordinating two communities is provided.

A Transformation-Based Learning Method on Generating Korean Standard Pronunciation

  • Kim, Dong-Sung;Roh, Chang-Hwa
    • Proceedings of the Korean Society for Language and Information Conference
    • /
    • 2007.11a
    • /
    • pp.241-248
    • /
    • 2007
  • In this paper, we propose a Transformation-Based Learning (TBL) method on generating the Korean standard pronunciation. Previous studies on the phonological processing have been focused on the phonological rule applications and the finite state automata (Johnson 1984; Kaplan and Kay 1994; Koskenniemi 1983; Bird 1995). In case of Korean computational phonology, some former researches have approached the phonological rule based pronunciation generation system (Lee et al. 2005; Lee 1998). This study suggests a corpus-based and data-oriented rule learning method on generating Korean standard pronunciation. In order to substituting rule-based generation with corpus-based one, an aligned corpus between an input and its pronunciation counterpart has been devised. We conducted an experiment on generating the standard pronunciation with the TBL algorithm, based on this aligned corpus.

  • PDF

Supervisory Control of Dynamic Oligopolistic Markets: How can Firms Reach Profit-Maximization? (동적 과점시장의 관리제어: 기업들은 어떻게 이윤극대화에 이를 수 있는가?)

  • Park, Seong-Jin
    • Journal of Institute of Control, Robotics and Systems
    • /
    • v.17 no.4
    • /
    • pp.304-312
    • /
    • 2011
  • In an oligopolistic market, only a few firms account for most or all of total production, e.g., automobile, steel, and computer industries. For a dynamic oligopolistic market with two firms competing in quantities, we show that supervisory control theory of discrete event systems provides a novel approach to solve the dynamic oligopoly problem with the aim of maximizing the profits of both firms. Specifically, we show that the controllability, observability, and nonblocking property (which are the core concepts in supervisory control theory) are the necessary and sufficient conditions for two oligopolistic firms in disequilibrium to eventually reach equilibrium states of maximizing the profits of both firms.

Connected Digit Recognition Using Phonetical Features (음성학적 특징을 이용한 연속 숫자음인식)

  • 김민정
    • Proceedings of the Acoustical Society of Korea Conference
    • /
    • 1998.06d
    • /
    • pp.72-75
    • /
    • 1998
  • 본 논문에서는 숫자음 인식시스템의 인식률 향상을 위한 연구로서 4연속 숫자음을 대상으로 연음 현상 및 경음화 현상등과 같은 음성학적 특징을 고려하여 숫자음에 강건한 모델을 작성하는 방법을 제안하고 인식실험을 통하여 그 유효성을 확인하고자 한다. 이를 위하여 음성자료로서는 국어공학센터(KLE)에서 채록한 4연속 숫자음을 사용하며 인식의 기본단위로서 음향학적 특징을 고려한 19개의 연속분포 HMM을 유사음소 단위(Phoneme Like Units ; PLUS) 로 사용한다. 또한 , 인식실험에 있어서는 기존의 방법으로 모델을 작성한 경우와 연음 현상과 경음화 현상 등과 같은 음성학적 특징을 고려하여 모델을 작성한 경우에 대해서 유한상태 오토마타(finite State Automata ; FSA)에 의한 구문제어를 통한 OPDP(One Pass Dynamic Programming)법으로 인식실험을 수행하여 그 결과를 비교 검토하였다. 그 결과, 기존이 방법의 경우 64.6%, 음성학적 특징을 고려한 경우 68.6%의 인식률을 보여, 음성학적 특징을 고려한 경우가 4.0% 향상된 인식률을 얻어 제안한 방법의 유효성을 확인하였다.

  • PDF

90/150 RCA Corresponding to Maximum Weight Polynomial with degree 2n (2n 차 최대무게 다항식에 대응하는 90/150 RCA)

  • Choi, Un-Sook;Cho, Sung-Jin
    • The Journal of the Korea institute of electronic communication sciences
    • /
    • v.13 no.4
    • /
    • pp.819-826
    • /
    • 2018
  • The generalized Hamming weight is one of the important parameters of the linear code. It determines the performance of the code when the linear codes are applied to a cryptographic system. In addition, when the block code is decoded by soft decision using the lattice diagram, it becomes a measure for evaluating the state complexity required for the implementation. In particular, a bit-parallel multiplier on finite fields based on trinomials have been studied. Cellular automata(CA) has superior randomness over LFSR due to its ability to update its state simultaneously by local interaction. In this paper, we deal with the efficient synthesis of the pseudo random number generator, which is one of the important factors in the design of effective cryptosystem. We analyze the property of the characteristic polynomial of the simple 90/150 transition rule block, and propose a synthesis algorithm of the reversible 90/150 CA corresponding to the trinomials $x^2^n+x^{2^n-1}+1$($n{\geq}2$) and the 90/150 reversible CA(RCA) corresponding to the maximum weight polynomial with $2^n$ degree by using this rule block.

A state transition based situation modeling and its application to design of SAC(Situation-Action Converter) for situation-aware control for embedded systems (임베디드 시스템에서의 상황인식 제어를 위한 상태전이 기반 상황 모델링과 이를 응용한 상황-동작 변환기 (SAC)의 설계)

  • Heo Gil;Park Joshua;Cho We-Duke;Choi Jae-Young
    • Journal of KIISE:Computer Systems and Theory
    • /
    • v.33 no.9
    • /
    • pp.642-649
    • /
    • 2006
  • In order to recognize a situation from a environment which provides an intelligent service, we propose state-transition based situation modeling which is suitable for a low computing power and restricted resources like embedded systems, and we designed its application to a situation-action converter(SAC)which is consist of two parts; situation detector recognized wanted situations and action generator generated various control actions. Then, we implemented a situation manager for smart scheduler service by using a SAC which is installed to a ARM processor based embedded Linux evaluation board.

Stochastic Pronunciation Lexicon Modeling for Large Vocabulary Continous Speech Recognition (확률 발음사전을 이용한 대어휘 연속음성인식)

  • Yun, Seong-Jin;Choi, Hwan-Jin;Oh, Yung-Hwan
    • The Journal of the Acoustical Society of Korea
    • /
    • v.16 no.2
    • /
    • pp.49-57
    • /
    • 1997
  • In this paper, we propose the stochastic pronunciation lexicon model for large vocabulary continuous speech recognition system. We can regard stochastic lexicon as HMM. This HMM is a stochastic finite state automata consisting of a Markov chain of subword states and each subword state in the baseform has a probability distribution of subword units. In this method, an acoustic representation of a word can be derived automatically from sample sentence utterances and subword unit models. Additionally, the stochastic lexicon is further optimized to the subword model and recognizer. From the experimental result on 3000 word continuous speech recognition, the proposed method reduces word error rate by 23.6% and sentence error rate by 10% compare to methods based on standard phonetic representations of words.

  • PDF

A Study on Spoken Digits Analysis and Recognition (숫자음 분석과 인식에 관한 연구)

  • 김득수;황철준
    • Journal of Korea Society of Industrial Information Systems
    • /
    • v.6 no.3
    • /
    • pp.107-114
    • /
    • 2001
  • This paper describes Connected Digit Recognition with Considering Acoustic Feature in Korea. The recognition rate of connected digit is usually lower than word recognition. Therefore, speech feature parameter and acoustic feature are employed to make robust model for digit, and we could confirm the effect of Considering. Acoustic Feature throughout the experience of recognition. We used KLE 4 connected digit as database and 19 continuous distributed HMM as PLUs(Phoneme Like Units) using phonetical rules. For recognition experience, we have tested two cases. The first case, we used usual method like using Mel-Cepstrum and Regressive Coefficient for constructing phoneme model. The second case, we used expanded feature parameter and acoustic feature for constructing phoneme model. In both case, we employed OPDP(One Pass Dynamic Programming) and FSA(Finite State Automata) for recognition tests. When appling FSN for recognition, we applied various acoustic features. As the result, we could get 55.4% recognition rate for Mel-Cepstrum, and 67.4% for Mel-Cepstrum and Regressive Coefficient. Also, we could get 74.3% recognition rate for expanded feature parameter, and 75.4% for applying acoustic feature. Since, the case of applying acoustic feature got better result than former method, we could make certain that suggested method is effective for connected digit recognition in korean.

  • PDF