Browse > Article

Decision of the Korean Speech Act using Feature Selection Method  

김경선 (다이퀘스트 연구소)
서정연 (서강대학교 컴퓨터학과)
Abstract
Speech act is the speaker's intentions indicated through utterances. It is important for understanding natural language dialogues and generating responses. This paper proposes the method of two stage that increases the performance of the korean speech act decision. The first stage is to select features from the part of speech results in sentence and from the context that uses previous speech acts. We use x$^2$ statistics(CHI) for selecting features that have showed high performance in text categorization. The second stage is to determine speech act with selected features and Neural Network. The proposed method shows the possibility of automatic speech act decision using only POS results, makes good performance by using the higher informative features and speed up by decreasing the number of features. We tested the system using our proposed method in Korean dialogue corpus transcribed from recording in real fields, and this corpus consists of 10,285 utterances and 17 speech acts. We trained it with 8,349 utterances and have test it with 1,936 utterances, obtained the correct speech act for 1,709 utterances(88.3%). This result is about 8% higher accuracy than without selecting features.
Keywords
Speech Act Decision; Feature Selection; Neural Network; CHI Statistics;
Citations & Related Records
연도 인용수 순위
  • Reference
1 Jae-hoon Kim, Jungyun Seo, Gilchang Kim, Estimating Membership Functions in a Fuzzy Network Model for Part-Of-Speech Tagging, Journal of Intelligent and Fuzzy Systems, Vol. 4, pp.309-320, 1996
2 Wiener, E., J.O. Pedersen and A.S. Weigend. A neural network approach to topic spotting. In Proceedings of the Fourth Annual Symposium on Document Analysis and Information Retrieval (SDAIR'95), 1995
3 Yang, Yiming and Jan O. Pedersen. A comparative study on Feature selection in text categorization. In proceedings of the 14th Inter national conference on Machine Learning, 1997
4 Samuel, K., S. Carberry, and K. Vijay-Shanker. Dialogue Act Tagging with Transformation-Based Learning. In Proceddings of the $17^{th}$ International Conference on computational Linguistics and the 36th Annual Meeting of the Association for computational Linguistics, 1998. pp 1150-1156   DOI
5 Samuel, K., S. Carberry, and K. Vijay-Shanker. An Investigation of Transformation-Based Learning in Discourse. Machine Learning: Proceedings of the $15_{th}$ International Conference. 1998
6 Schutze, H., D.A. Hull, and J.O. Pedersen. A comparison of classifiers and document representations for the routing problem. In 18th Annual international ACM SIGIR conference on Research and Development in Information Retrieval (SIGIR 95), 1995
7 Lee, Songwook, and Jungyun Seo. Korean Speech Act Analysis Using Decision Tree. In Proceedings of the Conference on Hangul and Korean Language Information Processing, 1999. pp. 377-381
8 Lweis, D.D. and M. Ringuette, 1994. Comparison of two learning algorims for text categorization. In Proceedings of the Third Annual Symposium on Document Analysis and Information Retrieval (SDAIR'94), 1994
9 Kim, jin Ah, et al. A response generation in dialogue system based on dialogue flow diagrams. In Proceedings of NLPHS, 1995
10 Choi, Won Seug, Jeong-Mi Cho, and Jungyun Seo. Analysis System of Speech Acts and Discourse Structures Using Maximum Entropy Model. In Proceedings of the 37th Annual Meeting of the Association for computational Lin guistics, 1999, pp. 230-237   DOI
11 Chu-Carroll, J. and S. Carberry. Response Generation in Collaborative Negotiation. ACL-95, 1995   DOI
12 Lambert, L. and S. Caberry. A Tripatite Plan-Based Model of Dialogue. In Proceedings of ACL , 1991. pp. 47-54
13 Samuel, K. and S. Carberry and K. Vijay-Shanker. Automatically Selecting Useful Phrases for Dialogue Act Tagging. In Proceedings of the Fourth Conference of the Pacific Association for Computational Linguistics, 1999
14 Lee, Jae-won, Jungyun Seo, Gilchang Kim. A dialogue analysis Model with statistical speech act processing for Dialogue Machine Translation, Proceedings of Spoken Language Translation (Workshop in conjunction with (E)ACL 97, page 10-15, 1997
15 Lee, Hyunjung, Jae-Won Lee, Jungyun Seo. Speech Act Analysis Model of Korean Utterances for Automatic Dialog Translation, Journal of Korea Information Science Society (B): Software and Applications, 25(10) : 1433-1552, 1998
16 Rumelhart, D.E. and J.L. McClelland. Parallel Distributed Processing, volume 1. MIT Press. 1986