Language- Independent Sentence Boundary Detection with Automatic Feature Selection

Lee, Do-Gil;

Journal of the Korean Data and Information Science Society

제19권4호
/
Pages.1297-1304
/
2008
/
1598-9402(pISSN)

한국데이터정보과학회 (The Korean Data and Information Science Society)

Language- Independent Sentence Boundary Detection with Automatic Feature Selection

Lee, Do-Gil

발행 : 2008.11.30

PDF

PDF 다운로드

⟨ 이전 논문 다음 논문 ⟩

초록

This paper proposes a machine learning approach for language-independent sentence boundary detection. The proposed method requires no heuristic rules and language-specific features, such as part-of-speech information, a list of abbreviations or proper names. With only the language-independent features, we perform experiments on not only an inflectional language but also an agglutinative language, having fairly different characteristics (in this paper, English and Korean, respectively). In addition, we obtain good performances in both languages. We have also experimented with the methods under a wide range of experimental conditions, especially for the selection of useful features.

Journal of the Korean Data and Information Science Society

Language- Independent Sentence Boundary Detection with Automatic Feature Selection

초록

키워드

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

자세히 찾기

이미지 검색 (β)