Browse > Article

An Effective Adaptive Dialogue Strategy Using Reinforcement Loaming  

Kim, Won-Il (삼성전자 영상디스플레이 사업부)
Ko, Young-Joong (동아대학교 컴퓨터공학과)
Seo, Jung-Yun (서강대학교 컴퓨터학과)
Abstract
In this paper, we propose a method to enhance adaptability in a dialogue system using the reinforcement learning that reduces response errors by trials and error-search similar to a human dialogue process. The adaptive dialogue strategy means that the dialogue system improves users' satisfaction and dialogue efficiency by loaming users' dialogue styles. To apply the reinforcement learning to the dialogue system, we use a main-dialogue span and sub-dialogue spans as the mathematic application units, and evaluate system usability by using features; success or failure, completion time, and error rate in sub-dialogue and the satisfaction in main-dialogue. In addition, we classify users' groups into beginners and experts to increase users' convenience in training steps. Then, we apply reinforcement learning policies according to users' groups. In the experiments, we evaluated the performance of the proposed method on the individual reinforcement learning policy and group's reinforcement learning policy.
Keywords
Dialogue System; Adaptive Dialogue Strategy; Reinforcement Learning; Main-dialogue and Sub-dialogue; Q-learning;
Citations & Related Records
연도 인용수 순위
  • Reference
1 S. Moller, 'A new Taxonomy of the Quality of Telephone Service Based on Spoken Dialogue System,' In proceedings of the 3 th SIGdial Workshop on Discourse and Dialogue, Philadel phia, PA. pp. 142-153, 2002
2 은지현, 최준기, 장두성, 김현정, 구명완, '마르코프 의사결정 과정에 기반한 대화 관리 시스템', In Proceedings of the HCI 2007, pp. 475-480, 2007
3 M. McTear, Spoken Dialogue Technology Toward the Conversational User Interface, Springer-Verlag London, 2004
4 J.W. Wallis, E.H. Shortliffe, 'Customized Explanations Using Causal Knowledge,' In B.C. Buchanan, E.H. Shortliffe(eds): Rule-based Expert System: The MYCIN Experiments of the standford Heuristic Programming Project, Addison-Wesley Publishing Company, pp. 371-388, 1985
5 T. Dean J.Allen, Y. Aloimomos, Artificial Intelligence Theory and Practice, Addison-Wesley, 1995
6 K. Jokinen, M. Kaipainen, T. Jauhuainen, G. Wilcock, M. Turunen, J. Akulinen, J. Kussis, K. Lagu, 'Adaptive Dialogue System Interaction with interact,' In Proceedings of the 3rd SIGdial Workshop on Discourse and Dialogue, Philadelphia, PA, pp. 64-73, 2002
7 D. Litman, S. Pan, 'Empirically Evaluating an adaptable spoken dialogue systems,' In Proceedings of the 7th International Conference on User Modeling(UM'99), pp. 55-64, 1999
8 K. Jokinen, K. Kanto, 'User Expertise Modeling and Adaptive in a Speech-Based E-mail System,' In proceedings of Annual Meeting of the Association of Computational Linguistics 2004(ACL 2004), pp. 87-94, 2004
9 R. S. Sutton, A. G. Barto, Reinforcement Learning An Introduction, MIT Press, 1998
10 M. A. Walker, J. Wright, I. Langkilde, 'Using natural Language Processing and Discourse features to identify understanding errors in a spoken dialogue system,' In Proceedings of the 17th International Conference on Machine Learning, Palo Alto, CA. pp. 1111-1118, 2000
11 M. A. Walker, D. Litman, C. A. Kamm, A. Abella 'PARADISE: A Framework for Evaluating Spoken Dialogue Agents,' In Proceedings of the 35th Annual Meeting of the Association of Computational Linguistics(ACL 97), pp. 271-280, 1997
12 I. Zukerman, D. Litman, 'Natural Language Processing and User Modeling: Synergies and Limitations,' User Modeling and User-Adapted Interaction Vol.11, pp. 129-158, 2001   DOI   ScienceOn