Search | Korea Science

TAKTAG: Two phase learning method for hybrid statistical/rule-based part-of-speech disambiguation (TAKTAG: 통계와 규칙에 기반한 2단계 학습을 통한 품사 중의성 해결)

Shin, Sang-Hyun;Lee, Geun-Bae;Lee, Jong-Hyeok
- Annual Conference on Human and Language Technology
- /
- 1995.10a
- /
- pp.169-174
- /
- 1995
품사 태깅은 형태소 분석 이후 발생한 모호성을 제거하는 것으로, 통계적 방법과 규칙에 기 반한 방법이 널리 사용되고 있다. 하지만, 이들 방법론에는 각기 한계점을 지니고 있다. 통계적인 방법인 은닉 마코프 모델(Hidden Markov Model)은 유연성(flexibility)을 지니지만, 교착어(agglutinative language)인 한국어에 있어서 제한된 윈도우로 인하여, 중의성 해결의 실마리가 되는 어휘나 품사별 제대로 참조하지 못하는 경우가 있다. 반면, 규칙에 기반한 방법은 차체가 품사에 영향을 받으므로 인하여, 새로운 태그집합(tagset)이나 언어에 대하여 유연성이나 정확성을 제공해 주지 못한다. 이러한 각기 서로 다른 방법론의 한계를 극복하기 위하여, 본 논문에서는 통계와 규칙을 통합한 한국어 태깅 모델을 제안한다. 즉 통계적 학습을 통한 통계 모델이후에 2차적으로 규칙을 자동학습 하게 하여, 통계모델이 다루지 못하는 범위의 규칙을 생성하게 된다. 이처럼 2단계의 통계와 규칙의 자동 학습단계를 거치게 됨으로써, 두개 모델의 단점을 보강한 높은 정확도를 가지는 한국어 태거를 개발할 수 있게 하였다.
PDF

A Study on Improving Prediction Accuracy by Modeling Multiple Similar Time Series (다중 유사 시계열 모델링 방법을 통한 예측정확도 개선에 관한 연구)

Cho, Young-Hee;Lee, Gye-Sung
- The Journal of the Institute of Internet, Broadcasting and Communication
- /
- v.10 no.6
- /
- pp.137-143
- /
- 2010
A method for improving prediction accuracy through processing time series data has been studied in this research. We have designed techniques to model multiple similar time series data and avoided the shortcomings of single prediction model. We predicted the future changes by effective rules derived from these models. The methods for testing prediction accuracy consists of three types: fixed interval, sliding, and cumulative method. Among the three, cumulative method produced the highest accuracy.
PDF KSCI

An Interface Agent for Creating Information Extraction Rules and Ontology in Electronic Commerce (전자상거래 정보추출 규칙과 Ontology 생성을 위한 인터페이스 에이전트)

서희경;양재영;구남숙;최중민
- Proceedings of the Korean Information Science Society Conference
- /
- 1999.10b
- /
- pp.30-32
- /
- 1999
인터넷의 증가로 온라인 상점들의 수는 매우 빠르게 증가하고 있다. 상점의 수가 늘어날수록 사용자가 이러한 상점들에서 원하는 정보를 찾는 일은 쉽지 않다. 사용자의 어려움을 줄이고자 여러 쇼핑몰의 정보들을 통합해서 보여주는 전자상거래 통합 시스템들이 생겨나고 있지만, 새로운 쇼핑몰이 추가될 때마다 관리자가 추가되는 쇼핑몰의 정보를 추출하기 위한 규칙이나, Ontology등을 수동으로 만들거나 확장해야 하기 때문에 사람이 소비해야 하는 시간과 노력이 많고, 시스템을 관리하는 사람에 다라 정보추출의 정확도도 다르다. 따라서 사람이 소비하는 시간을 줄이고, 좀 더 정확한 정보추출을 위해 쇼핑몰마다 만들어야 하는 규칙과 그러한 규칙 생성에 필요한 Ontology를 자동으로 생성하는 방법과 이 방법에서 요구되는 사용자의 입력을 최소한 줄인 인터페이스 에이전트를 제안한다.
PDF

빈발 패턴 네트워크에서 연관 규칙 발견을 위한 아이템 클러스터링

O, Gyeong-Jin;Jeong, Jin-Guk;Jo, Geun-Sik
- Proceedings of the Korea Inteligent Information System Society Conference
- /
- 2007.05a
- /
- pp.321-328
- /
- 2007
데이터마이닝은 대용량의 데이터에 숨겨진 의미있고 유용한 패턴과 상관관계를 추출하여 의사결정에 활용하는 작업이다. 그 중에서도 고객 트랜잭션의 데이터베이스에서 아이템 사이에 존재하는 연관규칙을 찾는 것은 중요한 일이 되었다. Apriori 알고리즘 이후 연관규칙을 찾기 위해 대용량 데이터베이스로부터 압축된 의미있는 정보를 저장하기 위한 데이터 구조와 알고리즘들이 제안되어 왔다. 본 논문에서는 정점으로 아이템을 표현하고, 간선으로 두 아이템집합을 표현하는 빈발 패턴 네트워크(FPN)이라 불리는 새 자료 구조를 제안한다. 빈발 패턴 네트워크에서 아이템 사이의 연관 관계를 발견하기 위해 이 구조를 어떻게 효율적으로 사용 하느냐에 초점을 두고 있다. 구조의 효율적인 사용을 위하여 한 아이템이 클러스터 내의 아이템과는 유사도가 높고, 다른 클러스터의 아이템과는 유사도가 낮도록 네트워크의 정점을 클러스터링하는 방법을 사용한다. 실험은 신뢰도, 상관관계 그리고 간선 가중치 유사도를 이용하여 네트워크에서 아이템 클러스터링의 정확도를 보여준다. 본 논문의 실험 결과를 통해 신뢰도 유사도가 네트워크의 정점을 클러스터링할 때 클러스터의 정확성에 가장 많은 영향을 미친다는 것을 알 수 있었다.
PDF

Rule-based and Probabilistic Access Method for Event Recognition of Independent Objects (독립 객체의 이벤트 인식을 위한 규칙 기반 및 확률적 접근 방법)

Ko, Jae-Hyung;Choi, Chang-Gyu;Cho, Jeong-Hyun;Kim, Sung-Ho
- Proceedings of the Korean Information Science Society Conference
- /
- 2007.06c
- /
- pp.488-493
- /
- 2007
본 논문에서는 실내 또는 실외 환경에서의 비전 감시 시스템에 응용할 객체의 이벤트 인식 방법을 정의한다. 객체의 이벤트를 실시간으로 적용하기 위해서는 연산 시간이 적어야하며 이벤트 인식 정확률 또한 높아야 한다. 기존의 규칙 또는 확률 기반 이벤트 인식 방법은 객체의 움직임 이벤트에 대해 한정적이거나 객체의 특징 벡터 추출이 규칙에 어긋나는 경우 오류가 발생하여 이벤트 인식의 정확도가 떨어지는 문제점이 있다. 본 논문에서는 시간의 흐름에 따른 독립 객체의 특징 벡터와 확률 정보를 통해 정해진 규칙에 가장 가까운 상황에 대해 계속적으로 이루어지는 이벤트 인식 방법을 제안한다.
PDF

한글 단어를 발음 기호로 변환 시키는 인공신경망에 관한 연구

Yang, Jae-U;Kim, Doo-Hyeon
- ETRI Journal
- /
- v.10 no.3
- /
- pp.113-124
- /
- 1988
본 논문에서는 한글 단어를 발음 기호로 변환시키는 인공신경망의 설계와 이를 시뮬레이션한 결과에 대하여 논한다. 이 인공신경망은 multi-layer perceptron 구조를 가지며 error back-propagation 학습 알고리즘을 사용하였다. 이 인공신경망에 한글 발음 사전의 일부를 반복적으로 제시하여 학습시킨 결과, 학습한 단어에 대하여 최고 97%의 정확도로 변환 작업을 수행하였고 학습하지 않은 단어에 대해서는 91%의 정확도를 보였다. 이는 설계된 인공신경망이 발음 사전 내에 포괄적으로 내재되어 있는 발음규칙을 스스로 학습하였음을 나타낸다. 아울러 신경망의 학습 성취도와 입력 코드와의 관계도 연구하였는데, 한글단어를 발음기호로 변환하는 데에 있어서 compact 코드 보다 local 코드일 때 학습 성취도가 높은 것이 실험을 통해 밝혀졌다.
PDF

Improving Recall for Context-Sensitive Spelling Correction Rules using Conditional Probability Model with Dynamic Window Sizes (동적 윈도우를 갖는 조건부확률 모델을 이용한 한국어 문맥의존 철자오류 교정 규칙의 재현율 향상)

Choi, Hyunsoo;Kwon, Hyukchul;Yoon, Aesun
- Journal of KIISE
- /
- v.42 no.5
- /
- pp.629-636
- /
- 2015
The types of errors corrected by a Korean spelling and grammar checker can be classified into isolated-term spelling errors and context-sensitive spelling errors (CSSE). CSSEs are difficult to detect and to correct, since they are correct words when examined alone. Thus, they can be corrected only by considering the semantic and syntactic relations to their context. CSSEs, which are frequently made even by expert wiriters, significantly affect the reliability of spelling and grammar checkers. An existing Korean spelling and grammar checker developed by P University (KSGC 4.5) adopts hand-made correction rules for correcting CSSEs. The KSGC 4.5 is designed to obtain very high precision, which results in an extremely low recall. Our overall goal of previous works was to improve the recall without considerably lowering the precision, by generalizing CSSE correction rules that mainly depend on linguistic knowledge. A variety of rule-based methods has been proposed in previous works, and the best performance showed 95.19% of average precision and 37.56% of recall. This study thus proposes a statistics based method using a conditional probability model with dynamic window sizes. in order to further improve the recall. The proposed method obtained 97.23% of average precision and 50.50% of recall.
https://doi.org/10.5626/JOK.2015.42.5.629 인용 KSCI

A Music Recommendation System based on Context-awareness using Association Rules (연관규칙을 이용한 상황인식 음악 추천 시스템)

Oh, Jae-Taek;Lee, Sang-Yong
- Journal of Digital Convergence
- /
- v.17 no.9
- /
- pp.375-381
- /
- 2019
Recently, the recommendation system has attracted the attention of users as customized recommendation services have been provided focusing on fashion, video and music. But these services are difficult to provide users with proper service according to many different contexts because they do not use contextual information emerging in real time. When applied contextual information expands dimensions, it also increases data sparsity and makes it impossible to recommend proper music for users. Trying to solve these problems, our study proposed a music recommendation system to recommend proper music in real time by applying association rules and using relationships and rules about the current location and time information of users. The accuracy of the recommendation system was measured according to location and time information through 5-fold cross validation. As a result, it was found that the accuracy of the recommendation system was improved as contextual information accumulated.
https://doi.org/10.14400/JDC.2019.17.9.375 인용 PDF KSCI

A Personalized Clothing Recommender System Based on the Algorithm for Mining Association Rules (연관 규칙 생성 알고리즘 기반의 개인화 의류 추천 시스템)

Lee, Chong-Hyeon;Lee, Suk-Hoon;Kim, Jang-Won;Baik, Doo-Kwon
- Journal of the Korea Society for Simulation
- /
- v.19 no.4
- /
- pp.59-66
- /
- 2010
We present a personalized clothing recommender system - one that mines association rules from transaction described in ontologies and infers a recommendation from the rules. The recommender system can forecast frequently changing trends of clothing using the Onto-Apriori algorithm, and it makes appropriate recommendations for each users possible through the inference marked as meta nodes. We simulates the rule generator and the inferential search engine of the system with focus on accuracy and efficiency, and our results validate the system.
https://doi.org/10.9709/JKSS.2010.19.4.059 인용 PDF KSCI

Korean Compound Noun Decomposition Only Using Syllabic Information (음절 정보만 이용한 한국어 복합 명사 분해)

Park, Seong-Bae;Zhang, Byoung-Tak
- Annual Conference on Human and Language Technology
- /
- 2003.10d
- /
- pp.33-39
- /
- 2003
한국어에서는 복합 명사 생성이 매우 자유스럽다. 즉, 독립된 명사를 연속으로 붙여 쓰는 것이 가능하다. 하지만, 기계번역이나 정보 검색과 같이 복합 명사를 처리하는 시스템에서 정확한 분석을 위해서는 복합 명사를 다시 단일 명사들로 분해하는 과정이 필요하다. 본 논문에서는 한국어 복합 명사 분해를 위해 GECORAM(GEneralized Combination of Rule-based learning And Memory-based learning) 알고리듬을 제시한다. 규칙 학습 알고리듬의 장점은 생성된 학습 결과를 사람이 쉽게 이해할 수 있다는 점이지만, 다른 지도학습 알고리듬에 비해 성능이 떨어진다는 단점이 있다. 본 논문에서는 이를 위해 규칙 학습 알고리듬과 기억기반 학습을 결합하는 방법을 제시한다. 실험 결과, GECORAM 알고리듬은 규칙 기반 학습이나 기억 기반 학습을 단독으로 쓰는 경우보다 높은 정확도를 보였다.
PDF

Search Result 289, Processing Time 0.024 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)