Review of Korean Speech Act Classification: Machine Learning Methods

Kim, Hark-Soo;Seon, Choong-Nyoung;Seo, Jung-Yun;

doi:10.5626/JCSE.2011.5.4.288

Journal of Computing Science and Engineering

Volume 5 Issue 4
/
Pages.288-293
/
2011
/
1976-4677(pISSN)
/
2093-8020(eISSN)

Korean Institute of Information Scientists and Engineers (한국정보과학회)

DOI QR Code

Review of Korean Speech Act Classification: Machine Learning Methods

Kim, Hark-Soo (Department of Computer and Communications Engineering, Kangwon National University) ;
Seon, Choong-Nyoung (Department of Computer Science and Engineering, Sogang University) ;
Seo, Jung-Yun (Department of Computer Science and Engineering/Interdisciplinary Program of Integrated Biotechnology, Sogang University)

Received : 2011.06.17
Accepted : 2011.11.08
Published : 2011.12.30

https://doi.org/10.5626/JCSE.2011.5.4.288 Citation PDF KPUBS

Download PDF

⟨ Previous Next ⟩

Abstract

To resolve ambiguities in speech act classification, various machine learning models have been proposed over the past 10 years. In this paper, we review these machine learning models and present the results of experimental comparison of three representative models, namely the decision tree, the support vector machine (SVM), and the maximum entropy model (MEM). In experiments with a goal-oriented dialogue corpus in the schedule management domain, we found that the MEM has lighter hardware requirements, whereas the SVM has better performance characteristics.

Keywords

References

J. Allen, Natural Language Understanding, Menlo Park, CA: Benjamin/Cummings, 1995.
L. Lambert and S. Carberry, "A tripartite plan-based model of dialogue," Proceedings of the 29th Annual Meeting on Association for Computational Linguistics, Berkeley, CA, 1991, June 18- 21, pp. 47-54.
D. J. Litman and J. F. Allen, "A plan recognition model for subdialogues in conversations," Cognitive Science, vol. 11, no. 2, pp. 163-200, 1987. https://doi.org/10.1207/s15516709cog1102_4
M. Nagata and T. Morimoto, "First steps towards statistical modeling of dialogue to predict the speech act type of the next utterance," Speech Communication, vol. 15, no. 3-4, pp. 193-203, 1994. https://doi.org/10.1016/0167-6393(94)90071-X
J. Lee, G. C. Kim, and J. Seo, "A dialogue analysis model with statistical speech act processing for dialogue machine translation," Proceedings of Spoken Language Translation Workshop in conjunction with ACL/EACL, Madrid, Spain, 1997, July 11, pp. 10-15.
N. Reithinger and M. Klesen, "Dialogue act classification using language models," Proceedings of EuroSpeech, Rhodos, Greece, 1997, pp. 2235-2238.
K. Samuel, S. Carberry, and K. Vijay-Shanker, "Dialogue act tagging with transformation-based learning," Proceedings of the 17th International Conference on Computational Linguistics, Montreal, QC, 1998, pp. 1150-1156.
A. Stolcke, N. Coccaro, R. Bates, P. Taylor, C. Van Ess-Dykema, K. Ries, E. Shriberg, D. Jurafsky, R. Martin, and M. Meteer, "Dialogue act modeling for automatic tagging and recognition of conversational speech," Computational Linguistics, vol. 26, no. 3, pp. 339-373, 2000. https://doi.org/10.1162/089120100561737
C. T. Langley, "Analysis for speech translation using grammarbased parsing and automatic classification," Proceedings of Student Research Workshop at the 40th Annual Meeting of the Association of Computational Linguistics, Philadelphia, PA, 2002, July.
H. Kim and J. Seo, "An efficient trigram model for speech act analysis in small training corpus," Journal of Cognitive Science, vol. 4, no. 1, pp. 107-120, 2003.
N. Webb, M. Hepple, and Y. Wilks, "Dialogue act classification based on intra-utterance features," Proceedings of the AAAI Workshop on Spoken Language Understanding, Pittsburgh, PA, 2005, July 9-10.
W. S. Choi, H. Kim, and J. Seo, "An integrated dialogue analysis model for determining speech acts and discourse structures," IEICE Transactions on Information and Systems, vol. E88-D, no. 1, pp. 150-157, 2005.
H. Lee, H. Kim, and J. Seo, "Domain action classification using a maximum entropy model in a schedule management domain," AI Communications, vol. 21, no. 4, pp. 221-229, 2008.
S. Kang, H. Kim, and J. Seo, "A reliable multidomain model for speech act classification," Pattern Recognition Letters, vol. 31, no. 1, pp. 71-74, 2010. https://doi.org/10.1016/j.patrec.2009.08.013
M. J. Kim, J. H. Park, S. B. Kim, H. C. Rim, and D. G. Lee, "A comparative study on optimal feature identification and combination for korean dialogue act classification," Journal of Korean Institute of Information Scientists and Engineers: Software and Applications, vol. 35, no. 11, pp. 681-691, Nov 2008.
K. Kim, H. Kim, and J. Seo, "A neural network model with feature selection for Korean speech act classification," International Journal of Neural Systems, vol. 14, no. 6, pp. 407-414, 2004. https://doi.org/10.1142/S0129065704002157
Y. Yang and J. O. Pedersen, "A comparative study on feature selection in text categorization," Proceedings of the 14th International Conference on Machine Learning, Nashville, TN, 1997, July, pp. 412-420.
E. Charniak, Statistical Language Learning, Cambridge, MA: MIT Press, 1993.
S. Lee and J. Seo, "Korean speech act analysis system using hidden markov model with decision trees," International Journal of Computer Processing of Oriental Languages, vol. 15, no. 3, pp. 231-243, 2002. https://doi.org/10.1142/S0219427902000625
D. Surendran and G. A. Levow, "Dialogue act tagging with support vector machines and hidden markov models," Proceedings of Interspeech/ICSLP, Pittsburgh, PA, 2006, Sep.
H. Lee, H. Kim, and J. Seo, "Efficient domain action classification using neural networks," Neural Information Processing. Lecture Notes in Computer Science Vol. 4223, I. King, J. Wang, L. W. Chan, and D. Wang, Eds., Heidelberg, Germany: Springer Berlin, 2006, pp. 150-158.
D. Kim, H. Kim, and J. Seo, "A statistical prediction model of speakers' intentions in a goal-oriented dialogue," Journal of Korean Institute of Information Scientists and Engineers: Software and Applications, vol. 35, no. 9, pp. 554-561, Sep 2008.
J. D. Lafferty, A. McCallum, and F. C. N. Pereira, "Conditional random fields: probabilistic models for segmenting and labeling sequence data," Proceedings of the Eighteenth International Conference on Machine Learning, Williamstown, MA, 2001, June 28-July 1, pp. 282-289.
J. C. Reynar and A. Ratnaparkhi, "A maximum entropy approach to identifying sentence boundaries," Proceedings of the 5th Conference on Applied Natural Language Processing, Washington, DC, 1997, March 31-April 3, pp. 16-19.
J. R. Quinlan, C4.5: Programs for Machine Learning, San Mateo, CA: Morgan Kaufmann Publishers, 1993.
T. Joachims, "SVMLight: Support Vector Machine Version: 6.02," http://svmlight.joachims.org/.
E. S. Ristad, Maximum entropy modeling toolkit, Technical Report, Department of Computer Science, Princeton University, 1996.

Cited by

A novel density-based clustering method using word embedding features for dialogue intention recognition vol.19, pp.4, 2016, https://doi.org/10.1007/s10586-016-0649-7
Two-phase reanalysis model for understanding user intention vol.42, 2014, https://doi.org/10.1016/j.patrec.2013.12.015
Cross-Lingual Annotation Projection for Weakly-Supervised Relation Extraction vol.13, pp.1, 2014, https://doi.org/10.1145/2529994
Hierarchical speech-act classification for discourse analysis vol.34, pp.10, 2013, https://doi.org/10.1016/j.patrec.2013.03.008
Improving domain action classification in goal-oriented dialogues using a mutual retraining method vol.45, 2014, https://doi.org/10.1016/j.patrec.2014.03.021
An Integrated Neural Network Model for Domain Action Determination in Goal-Oriented Dialogues vol.9, pp.2, 2013, https://doi.org/10.3745/JIPS.2013.9.2.259
Post-error Correction in Automatic Speech Recognition Using Discourse Information vol.14, pp.2, 2014, https://doi.org/10.4316/AECE.2014.02009
An Efficient Framework for Development of Task-Oriented Dialog Systems in a Smart Home Environment vol.18, pp.5, 2018, https://doi.org/10.3390/s18051581

Journal of Computing Science and Engineering

Review of Korean Speech Act Classification: Machine Learning Methods

Abstract

Keywords

References

Cited by

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)