Browse > Article
http://dx.doi.org/10.3745/KIPSTB.2003.10B.4.389

Implementing Korean Partial Parser based on Rules  

Lee, Kong-Joo (이화여자대학교 컴퓨터학과)
Kim, Jae-Hoon (한국해양대학교 컴퓨터공학과)
Abstract
In this paper, we present a Korean partial parser based on rules, which is used for running applications such as a grammar checker and a machine translation. Basically partial parsers construct one or more morphemes and/or words into one syntactical unit, but not complete syntactic trees, and accomplish some additional operations for syntactical parsing. The system described in this paper adopts a set of about 140 manually-written rules for partial parsing. Each rule consists of conditional statements and action statement that defines which one is head node and also describes an additional action to do if necessary. To observe that this approach can improve the efficiency of overall processing, we make simple experiments. The experimental results have shown that the average number of edges generated in processing without the partial parser is about 2 times more than that with the partial parser.
Keywords
Korean partial parser; canonicalized morphological structure;
Citations & Related Records
Times Cited By KSCI : 3  (Citation Analysis)
연도 인용수 순위
1 Cardie, C., Ng, V., Pierce, D., Buckley, C. 'Examining the role of statistical and linguistic knowledge sources in a general-knowledge question-answering system,' Proceedings of the Sixth Applied Natural Language Processing Conference(ANLP-2000), pp.180-187, 2000   DOI
2 Church ,K., 'A stochastic PARTS program and nour phrase parser for unrestricted texts,' Proceedings of ANLP-88, Austin, Texas, 1988
3 Chen K.-H. and Chen H. H., 'Extracting noun phrase phrases from large scale texts : Hybrid approach and its automatic evaluation,' Proceedings of ACL-94, pp. 234-241, 1994   DOI
4 Brants, T., 'Cascaded Markov Models,' Proceedings of EACL-99, Bergen, Norway, 1999   DOI
5 Dagan, I. and Krymolowski, Y.,Compositional partial Parsing by memory-based sequence learning, Data-oriented Parsing,' Rens Bod, Remko Scha, and Khalil Sima'an, CSLI publications, 2001
6 Hobbs, J., Appelt, D., Bear, J., Israel, D., Karmeyama, M., Stickel, M. and Tyson, M., 'FASTUS : a cascaded finite-state transducer for extracting information from natural-language text ,' Finite State Devices for Natural Lanaguage Processing, E. Roche and Y. Schabes, eds., Cambridge MA : MIT Press, 1996
7 Hindle, D., User manual for Fidditch, Technical Memorandum, #7590-142, Naval Research Laboratory, 1993
8 Daelemans, W., Buchholz, S. and Veenstra, J., 'Memory-Based Shallow Parsing,' Proceedings of CoNLL-99, Bergen, Norway, 1999
9 http://msdn.microsoft.com/library/default.asp?url=/library/en-us/dnofftalk/html/office01022003.asp
10 Joshi, A., Hopely, P., 'A parser from antiquity : an early application of finite state transducers to natural language parsing,' Extended Finite State Models of Language, Kornai, A. eds, Cambridge University Press, pp.6-15, 1999
11 INUI,T. and INUI K. 'An application of Probabilistic Partial Parsing : Detection of Syntactic-Tag Errors in Treebanks,' IPSJ SIGNotes Natural Language Abstract, No.134-003, 1999
12 Skut, W. and Brants, T. 'A maximum-entropy partial parser for unrestricted text,' Proceedings of the Sixth Workshop on Very Large Corpora. Montreal, Canada., 1998a
13 Voutilainen, A., 'NPtool, a detector of English noun phrases,' The computation and Language E-Print Archive (http://arXiv.org/), cmp-lg/9502010, 1995
14 Zhang,T., Damerau, F. and Johnson, D. 'Text Chunking based on a Generalization of Winnow,' Journal of Machine Learning Research, Vol.2, pp.615-637, Mar., 2002   DOI
15 Rawshaw, L. A. and Marcus, M. P., 'Text chunking using transformation-based learning,' Proceedings of the 3rd Workshop.on Very Large Corpora, MIT, pp.82-94, 1995
16 Skut, W. and Brants, T., 'Chunk tagger-statistical recognition of noun phrases,' Proceedings of the ESSLLI Workshop on Automated Acquisition of Syntax and Parsing. Saarbrcken, Germany, 1998
17 Tjong Kim Sang, 'Noun phase representation by system combination,' Proceedings of ANLP-NAACL2000, Seattle, Washington, USA, 2000
18 Voutilainen, A. and Padro, L., 'Developing a hybrid NP parser,' Proceedings of ANLP-97, 1997   DOI
19 박수호, 권혁철, '확장된 어휘적 중의성 제거 규칙에 따른 부분 문장 분석에 기반한 한국어 문법검사기', 제13회 한글 및 한국어 정보처리 학술대회 발표논문집, pp.516-522, 2001
20 국어어문규정집
21 안동언, 기계번역을 위한 한국어 해석에서 형태소로부터 구문요소의 형성에 관한 연구, 한국과학기술원, 전산학과, 석사학위논문, 1987
22 Abney, S. 'Partial Parsing via Finite-State Cascades,' J. of Natural Language Engineering, 2(4), pp.337-344, 1996   DOI
23 이중영, 신병훈, 이공주, 김지은, 안상규, COM 기반의 다목적 형태소 분석기를 이용한 명사 추출기, 제1회 형태소 분석기 및 품사태거 평가 워크숍 논문지, pp.167-172, 1999   과학기술학회마을
24 김재훈, 부분 구문분석 방법론, 정보처리학회지, 제7권 제6호, pp.83-96, 2000   과학기술학회마을
25 김재훈, 한국어 부분 구문분석의 단위와 그 표지, 한국해양대학교, 컴퓨터 공학과, KMU-NLP-TR-2000-006, 2000
26 김홍규 외, 현대국어 기초 말뭉치 개발, 문화공보부, 2002
27 Abney, S., 'Chunk and dependencies : Bringing processing evidence to bear on syntxt,' 'Conputational Linguistics and the Foundations of Linguistic Theory, CLSI, 1995
28 Abney, S. Chunk Stylebook, http://sfs.npil.unituebingen.de/~abney/Papers.html #98i, 1996
29 At-Mohtar, S. and Chanod, J. P, 'Incremental Finite-State Parsing,' Proceedings of ANLP '97, Washington, pp. 72-79, 1997
30 Bourigault, D., 'Surface grammatical analysis for the extraction of terminological noun phrases,' Proceedings of COLING-92, pp.977-981, 1992   DOI
31 Cardie, C. and Pierce, D., 'Error-driven pruning of treebank grammars for base noun phrase identification,' Proceedings of COLING-ACL-98, 1998   DOI