Improvements on Phrase Breaks Prediction Using CRF (Conditional Random Fields)

CRF를 이용한 운율경계추성 성능개선

  • 김승원 (포항공과대학교 컴퓨터공학과) ;
  • 이근배 (포항공과대학교 컴퓨터공학과) ;
  • 김병창 (대구가톨릭대학교 컴퓨터정보통신공학부)
  • Published : 2006.03.01

Abstract

In this paper, we present a phrase break prediction method using CRF(Conditional Random Fields), which has good performance at classification problems. The phrase break prediction problem was mapped into a classification problem in our research. We trained the CRF using the various linguistic features which was extracted from POS(Part Of Speech) tag, lexicon, length of word, and location of word in the sentences. Combined linguistic features were used in the experiments, and we could collect some linguistic features which generate good performance in the phrase break prediction. From the results of experiments, we can see that the proposed method shows improved performance on previous methods. Additionally, because the linguistic features are independent of each other in our research, the proposed method has higher flexibility than other methods.

Keywords