[KSCI] Korea Science Citation Index Service

Eojeol Syntactic Tag Prediction of Korean Text using Entropy Guided CRF

Oh, Jin-Young (창원대학교 컴퓨터공학과)
Cha, Jeong-Won (창원대학교 컴퓨터공학과)

Publication Information

Journal of KIISE:Computing Practices and Letters / v.15, no.5, 2009 , pp. 395-399 More about this Journal

Abstract

In this work, we describe the syntactic tag prediction system for Korean using the decision tree and CRFs. Generally they select features by their intuition. It depends on their prior knowledge. In this works, we combine features systematically using the decision tree. We also analyze errors and optimize features for the best performance. From the result of experiments, we can see that the proposed method is effective for the syntactic tag estimation and will be helpful for the syntactic analysis.

Keywords

Eojeol Syntactic Tag; Decision Tree; CRFs;

Citations & Related Records

Times Cited By KSCI : 2 (Citation Analysis)

Reference
Cited By KSCI

1	차정원, "힌국어 결합범주문법을 위한 통계적 구문분석", 포항공대 박사학위 논문, 2002
2	Milidiu, R. L., C. N. d. Santos, et al., Pharse Chunking Using Entropy Guided Transformation Learning. Proceedings of ACL-08: HLT. Colum-bus, Ohio, Association for Computational Linguistics: pp. 647-655, 2008
3	A. L. Berger, V. J. Della Pietra, S. A. Della Pietra, "A maximum entropy approach to natural language processing," Computational Linguistics, Vol.22, No.1, pp. 39-71, 1996
4	박의규, 나동열, "한국어 구문분석을 위한 구묶음 기반 의존명사 처리", 한국인지과학회 논문지 제 17권, 제2호, pp. 119-138, 2006 과학기술학회마을 ScienceOn
5	J. Lafferty, A. McCallum, F. Pereira. "Conditional Random Fields: Probabilistic Models for Segmen-ting and Labeling Sequence Data," Proceedings of International Conference on Machine Learning, ICML-01, pp. 282-289, 2001
6	박성배, 장병탁, "한국어 구 단위화를 위한 규칙 기반방법과 기억 기반 학습의 결함", 정보과학회논문지, 소프트웨어 및 응용 제 31권, 제3호, pp. 369-378, 2004 과학기술학회마을 ScienceOn
7	김미영, 강신재, 이종혁, "단위(chunks)분석과 의존문법에 기반한 한국어 구문분석". 한국정보과학회 봄 학술발표논문집, Vol.27, No.1, pp. 327-329, 2000 과학기술학회마을
8	신효필, "최소자원 최대효과 구문분석", 한국 정보 과학회 언어 공학연구회, 학술대회지(한글 및 한국어정보처리), pp. 242-248, 1999
9	Bangalore, S. and A. K. Joshi, "Supertagging: An Approach to Almost Parsing," Computational Linguistics 25: pp. 237-265, 1999
10	박성배, 장병탁, 김영탁, "k-NN으로 확장된 한국어 단위화", 한국정보과학회 가을 학술발표논문집, Vol.27, No.2, pp. 182-184, 2004 과학기술학회마을
11	세종계획 21, http://www.sejong.or.kr/
12	Abney, S. and S. P. Abney. Parsing by Chunks. Principle-Based Parsing. R. C. Berwick, S. P. Abney and C. Tenny, Kluwer Academic Publi-shers: pp. 257-278, 1991
13	황영숙, 정후중, 박소영, 곽용재, 임해창, "자질집합선택 기반의 기계학습을 통한 한국어 기본구 인식의 성능향상", 정보과학회논문지, 소프트웨어 및 응용 제 29권, 제9호, pp. 654-668, 2002 과학기술학회마을 ScienceOn

KSCI

Eojeol Syntactic Tag Prediction of Korean Text using Entropy Guided CRF 엔트로피 지도 CRF를 이용한 한국어 어절 구문태그 예측

Eojeol Syntactic Tag Prediction of Korean Text using Entropy Guided CRF