Browse > Article

Efficient Semantic Structure Analysis of Korean Dialogue Sentences using an Active Learning Method  

Kim, Hark-Soo (강원대학교 컴퓨터정보통신공학)
Abstract
In a goal-oriented dialogue, speaker's intention can be approximated by a semantic structure that consists of a pair of a speech act and a concept sequence. Therefore, it is very important to correctly identify the semantic structure of an utterance for implementing an intelligent dialogue system. In this paper, we propose a model to efficiently analyze the semantic structures based on an active teaming method. To reduce the burdens of high-level linguistic analysis, the proposed model only uses morphological features and previous semantic structures as input features. To improve the precisions of semantic structure analysis, the proposed model adopts CRFs(Conditional Random Fields), which show high performances in natural language processing, as an underlying statistical model. In the experiments in a schedule arrangement domain, we found that the proposed model shows similar performances(92.4% in speech act analysis and 89.8% in concept sequence analysis) to the previous models although it uses about a third of training data.
Keywords
active learning; semantic structure; speech act; concept sequence;
Citations & Related Records
Times Cited By KSCI : 2  (Citation Analysis)
연도 인용수 순위
1 Lee, S. and Seo, J., "An analysis of Korean speech act using hidden Markov model with decision trees," in Proceedings of ICCPOL 2001, pp. 397- 400, 2001
2 Samuel, K., Caberry, S., and Vijay-Shanker, K., "Computing dialogue acts from features with transform-based learning," in Proceedings of the AAAI Spring Symposium, pp. 90-97, 1998
3 이현정, 한국어 대화체 문장의 화행 분석. 석사학위논문, 서강대학교, 1997
4 Goddeau, D., Meng, H., Polifroni, J., Seneff, S., and Busayapongchai, S., "A form-based dialogue manager for spoken language applications," in Proceedings of International Conference on Spoken Language Processing, pp. 701-704, 1996
5 Lee, H., Kim, H., and Seo, J., Efficient domain action classification using neural networks. Lecture Notes in Computer Science, Vol.4233, pp. 150-158, 2006
6 Lafferty, J., McCallum, A., and Pereira, F., "Conditional random fields: Probabilistic models for segmenting and labeling sequence data," in Proceedings of ICML 2001, 2001
7 Pinto, D., McCallum, A., Wei, X., and Croft, W. B., "Table extraction using conditional random fields," in Proceedings of SIGIR 2003, 2003
8 Riccardi, G. and Hankkani-Tur, D., Active learning: theory and applications to automatic speech recognition, IEEE Transactions on Speech and Audio Processing, Vol.13, No.4, pp. 504-511, 2005   DOI
9 Cohn, D., Ghahramani, Z., Jordan, M., Active learning with statistical models. Journal of Artificial Intelligence Research, Vol.4, pp. 129-145, 1999
10 Fei, S. and Pereira, F., "Shallow parsing with conditional random fields," in Proceedings of HLT & NAACL 2003, 2003
11 Kim, H., A dialogue-based NLIDB system in a schedule management domain. Lecture Notes in Computer Science, Vol.4362, pp. 869-877, 2007
12 Yang, Y. and Pedersen, J. O., "A comparative study on feature selection in text categorization," in Proceedings of ICML 1997, 1997
13 Lambert, L. and Caberry, S., "A tripartite plan- based model of dialogue," in Proceedings of ACL 1991, pp. 47-54, 1991
14 Caberry, S., A pragmatics-based approach to ellipsis resolution. Computational Linguistics, Vol.15, No.2, pp. 75-96, 1989
15 김경선, 서정연, 자질 선택 기법을 이용한 한국어 화행 결정, 한국정보과학회 논문지, 제30권 3호, pp. 278- 284, 2003
16 Litman, D. J. and Allen, J. F., A plan recognition model for subdialogues in conversations, Cognitive Science, Vol.11, pp. 163-200, 1987   DOI   ScienceOn
17 Levin, L., Langley, C., Lavie, A., Gates, D. Wallace, D., and Peterson, K., "Domain specific speech acts for spoken language translation," in Proceedings of 4th SIGdial Workshop on Discourse and Dialogue, 2003
18 Langley, C., "Analysis for speech translation using grammar-based parsing and automatic classification," in Proceedings of the ACL Student Research Workshop, 2002
19 Kim, H., Seon, C., and Seo, J., A dialogue-based information retrieval assistant using shallow NLP techniques in online sales domains, IEICE Information and Systems, Vol.E88D, No.5. pp. 801-808, 2005
20 은종민, 이성욱, 서정연, 지지벡터기계(support vector machines)를 이용한 한국어 화행분석, 한국정보처리학회 논문지, 제12B권 3호, pp. 365-368, 2005   과학기술학회마을   DOI