Browse > Article

KKMA : A Tool for Utilizing Sejong Corpus based on Relational Database  

Lee, Dong-Joo (서울대학교 컴퓨터공학부)
Yeon, Jong-Heum (서울대학교 컴퓨터공학부)
Hwang, In-Beom (서울대학교 컴퓨터공학부)
Lee, Sang-Goo (서울대학교 컴퓨터공학부)
Abstract
Corpus is widely used as a fundamental resource for various purposes in linguistic studies. There are several large corpora such as Sejong corpus in Korea. However, it is hard to find a tool utilizing such large corpora. In this paper, we propose a method of utilizing Sejong corpus based on the relational database. We designed the relational database scheme to store corpus and implemented a Web-based application so that many researchers can easily access and utilize the Sejong corpus.
Keywords
Corpus Linguistics; Sejong Corpus; Relational Database; Corpus Utility;
Citations & Related Records
연도 인용수 순위
  • Reference
1 Beom-mo Kang, Building Corpora and Making Use of Frequency (Statistics) for Linguistic Descriptions, Journal of Korealex, no.12, pp.7-40, 2008.
2 김경서, 김대철, 정강석, 송만석, "말뭉치를 이용한 형태소 분석 단계에서의 중의성 해결에 관한 연구", 제 3회 한글 및 한국어정보처리 학술발표논문집, pp.36-43, 1991.
3 Changdeok Lee, Kyeongseo Kim and Mansuk Song, "An Implementation of Concordance System in Large Corpus," In KISS 1994, voI.21, no.1, pp. 825-828, 1994.
4 Jonghun Shin, Soonho Kwon and Hyuk-Chul Kwon, "Implementation of Web-based Information Retrieval System for Korean-English Parallel Corpus," In KGG 2009, voI.36, no.1A, pp.33-34, 2009.
5 서상규, 한영균, "국어정보학 입문", 태학사, 1999.
6 Taek Chan Kang and Yoon Chul Choy, "The Design and Implementation of Tools For Dictionary Editing and Retrieval by Multi - users," In KISS 1990, voI.18, no.2, pp.825-828, 1990.