Semi-Automatic Annotation Tool to Build Large Dependency Tree-Tagged Corpus

Park, Eun-Jin;Kim, Jae-Hoon;Kim, Chang-Hyun;Kim, Young-Kill;

한국언어정보학회:학술대회논문집 (Proceedings of the Korean Society for Language and Information Conference)

한국언어정보학회 (Korean Society for Language and Information)

Semi-Automatic Annotation Tool to Build Large Dependency Tree-Tagged Corpus

발행 : 2007.11.01

PDF

PDF 다운로드

⟨ 이전 논문 다음 논문 ⟩

초록

Corpora annotated with lots of linguistic information are required to develop robust and statistical natural language processing systems. Building such corpora, however, is an expensive, labor-intensive, and time-consuming work. To help the work, we design and implement an annotation tool for establishing a Korean dependency tree-tagged corpus. Compared with other annotation tools, our tool is characterized by the following features: independence of applications, localization of errors, powerful error checking, instant annotated information sharing, user-friendly. Using our tool, we have annotated 100,904 Korean sentences with dependency structures. The number of annotators is 33, the average annotation time is about 4 minutes per sentence, and the total period of the annotation is 5 months. We are confident that we can have accurate and consistent annotations as well as reduced labor and time.

한국언어정보학회:학술대회논문집 (Proceedings of the Korean Society for Language and Information Conference)

Semi-Automatic Annotation Tool to Build Large Dependency Tree-Tagged Corpus

초록

키워드

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

자세히 찾기

이미지 검색 (β)