Browse > Article

A Statistical Prediction Model of Speakers' Intentions in a Goal-Oriented Dialogue  

Kim, Dong-Hyun (㈜다이퀘스트연구소)
Kim, Hark-Soo (강원대학교 컴퓨터정보통신공학)
Seo, Jung-Yun (서강대학교 컴퓨터공학과)
Abstract
Prediction technique of user's intention can be used as a post-processing method for reducing the search space of an automatic speech recognizer. Prediction technique of system's intention can be used as a pre-processing method for generating a flexible sentence. To satisfy these practical needs, we propose a statistical model to predict speakers' intentions that are generalized into pairs of a speech act and a concept sequence. Contrary to the previous model using simple n-gram statistic of speech acts, the proposed model represents a dialogue history of a current utterance to a feature set with various linguistic levels (i.e. n-grams of speech act and a concept sequence pairs, clue words, and state information of a domain frame). Then, the proposed model predicts the intention of the next utterance by using the feature set as inputs of CRFs (Conditional Random Fields). In the experiment in a schedule management domain, The proposed model showed the precision of 76.25% on prediction of user's speech act and the precision of 64.21% on prediction of user's concept sequence. The proposed model also showed the precision of 88.11% on prediction of system's speech act and the Precision of 87.19% on prediction of system's concept sequence. In addition, the proposed model showed 29.32% higher average precision than the previous model.
Keywords
Intention prediction; speech act prediction; concept sequence prediction;
Citations & Related Records
Times Cited By KSCI : 1  (Citation Analysis)
연도 인용수 순위
1 Reithinger, N., "Some Experiments in Speech Act Prediction," Proceedings of Empirical Methods in Discourse Interpretation and Generation, 1995
2 Lee, H., Kim, H., and Seo, J., Efficient Domain Action Classification Using Neural Networks, Lecture Notes in Computer Science, Vol.4233, pp.150-158, 2006
3 Fei, S. and Pereira, F., "Shallow Parsing with Conditional Random Fields," Proceedings of HLT and NAACL, 2003
4 Lambert, L. and Caberry, S., "A Tripartite Plan- based Model of Dialogue," Proceedings of ACL, pp.47-54, 1991
5 Wahlster, W. "Verbmobil-Translation of Face-to- Face Dialogs," Proceedings of MT Summit IV, 1993
6 오종건, 작업수행영역에서 계획에 기반한 대화 시스템의 설계, 석사학위논문, 서강대학교, 1999
7 Smith, R. W. and Hipp, D. R., Spoken Natural Language Dialogue Systems: A Practical Approach, Oxford University Press, 1994
8 은종민, 이성욱, 서정연, 지지벡터기계를 이용한 한국어 화행분석, 한국정보처리학회 논문지, Vol.12B, No.3, pp.365-368, 2005   과학기술학회마을   DOI
9 Yang, Y. and Pedersen, J. O., "A Comparative Study on Feature Selection in Text Categorization," Proceedings of ICML, 1997
10 Langley, C. "Analysis for Speech Translation Using Grammar-based Parsing and Automatic Classification," Proceedings of the ACL Student Research Workshop, 2002
11 김용재, 데이타베이스 검색을 위한 한국어 대화 인터페이스 시스템의 설계, 석사학위논문, 서강대학교, 1997
12 Levin, L., Langley, C., Lavie, A., Gates, D., Wallace, D., and Peterson, K., "Domain Specific Speech Acts for Spoken Language Translation," Proceedings of 4th SIGdial Workshop on Discourse and Dialogue, 2003
13 Goddeau, D., Meng, H., Polifroni, J., Seneff, S. and Busayapongchai, S., "A Form-based Dialogue Manager for Spoken Language Applications," Proceedings of International Conference on Spoken Language Processing, pp.701-704, 1996
14 Lafferty, J., McCallum, A., and Pereira, F., "Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Sequence Data", Proceedings of ICML, pp.282-289, 2001