Heuristic-based Korean Coreference Resolution for Information Extraction

Euisok Chung;Soojong Lim;Yun, Bo-Hyun;

한국언어정보학회:학술대회논문집 (Proceedings of the Korean Society for Language and Information Conference)

한국언어정보학회 2002년도 Language, Information, and Computation Proceedings of The 16th Pacific Asia Conference
/
Pages.50-58
/
2002

한국언어정보학회 (Korean Society for Language and Information)

Heuristic-based Korean Coreference Resolution for Information Extraction

Euisok Chung (Human Information Processing Dept., Electronics and Telecommunications Research Institute, 161, Kajong-Dong, Yusong-Gu, Daejon, 305-350, KOREA) ;
Soojong Lim (Human Information Processing Dept., Electronics and Telecommunications Research Institute, 161, Kajong-Dong, Yusong-Gu, Daejon, 305-350, KOREA) ;
Yun, Bo-Hyun (Human Information Processing Dept., Electronics and Telecommunications Research Institute, 161, Kajong-Dong, Yusong-Gu, Daejon, 305-350, KOREA)

발행 : 2002.02.01

PDF

PDF 다운로드

⟨ 이전 논문 다음 논문 ⟩

초록

The information extraction is to delimit in advance, as part of the specification of the task, the semantic range of the output and to filter information from large volumes of texts. The most representative word of the document is composed of named entities and pronouns. Therefore, it is important to resolve coreference in order to extract the meaningful information in information extraction. Coreference resolution is to find name entities co-referencing real-world entities in the documents. Results of coreference resolution are used for name entity detection and template generation. This paper presents the heuristic-based approach for coreference resolution in Korean. We constructed the heuristics expanded gradually by using the corpus and derived the salience factors of antecedents as the importance measure in Korean. Our approach consists of antecedents selection and antecedents weighting. We used three kinds of salience factors that are used to weight each antecedent of the anaphor. The experiment result shows 80% precision.

한국언어정보학회:학술대회논문집 (Proceedings of the Korean Society for Language and Information Conference)

Heuristic-based Korean Coreference Resolution for Information Extraction

초록

키워드

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

자세히 찾기

이미지 검색 (β)