DOI QR코드

DOI QR Code

Review of Korean Speech Act Classification: Machine Learning Methods

  • Kim, Hark-Soo (Department of Computer and Communications Engineering, Kangwon National University) ;
  • Seon, Choong-Nyoung (Department of Computer Science and Engineering, Sogang University) ;
  • Seo, Jung-Yun (Department of Computer Science and Engineering/Interdisciplinary Program of Integrated Biotechnology, Sogang University)
  • Received : 2011.06.17
  • Accepted : 2011.11.08
  • Published : 2011.12.30

Abstract

To resolve ambiguities in speech act classification, various machine learning models have been proposed over the past 10 years. In this paper, we review these machine learning models and present the results of experimental comparison of three representative models, namely the decision tree, the support vector machine (SVM), and the maximum entropy model (MEM). In experiments with a goal-oriented dialogue corpus in the schedule management domain, we found that the MEM has lighter hardware requirements, whereas the SVM has better performance characteristics.

Keywords

References

  1. J. Allen, Natural Language Understanding, Menlo Park, CA: Benjamin/Cummings, 1995.
  2. L. Lambert and S. Carberry, "A tripartite plan-based model of dialogue," Proceedings of the 29th Annual Meeting on Association for Computational Linguistics, Berkeley, CA, 1991, June 18- 21, pp. 47-54.
  3. D. J. Litman and J. F. Allen, "A plan recognition model for subdialogues in conversations," Cognitive Science, vol. 11, no. 2, pp. 163-200, 1987. https://doi.org/10.1207/s15516709cog1102_4
  4. M. Nagata and T. Morimoto, "First steps towards statistical modeling of dialogue to predict the speech act type of the next utterance," Speech Communication, vol. 15, no. 3-4, pp. 193-203, 1994. https://doi.org/10.1016/0167-6393(94)90071-X
  5. J. Lee, G. C. Kim, and J. Seo, "A dialogue analysis model with statistical speech act processing for dialogue machine translation," Proceedings of Spoken Language Translation Workshop in conjunction with ACL/EACL, Madrid, Spain, 1997, July 11, pp. 10-15.
  6. N. Reithinger and M. Klesen, "Dialogue act classification using language models," Proceedings of EuroSpeech, Rhodos, Greece, 1997, pp. 2235-2238.
  7. K. Samuel, S. Carberry, and K. Vijay-Shanker, "Dialogue act tagging with transformation-based learning," Proceedings of the 17th International Conference on Computational Linguistics, Montreal, QC, 1998, pp. 1150-1156.
  8. A. Stolcke, N. Coccaro, R. Bates, P. Taylor, C. Van Ess-Dykema, K. Ries, E. Shriberg, D. Jurafsky, R. Martin, and M. Meteer, "Dialogue act modeling for automatic tagging and recognition of conversational speech," Computational Linguistics, vol. 26, no. 3, pp. 339-373, 2000. https://doi.org/10.1162/089120100561737
  9. C. T. Langley, "Analysis for speech translation using grammarbased parsing and automatic classification," Proceedings of Student Research Workshop at the 40th Annual Meeting of the Association of Computational Linguistics, Philadelphia, PA, 2002, July.
  10. H. Kim and J. Seo, "An efficient trigram model for speech act analysis in small training corpus," Journal of Cognitive Science, vol. 4, no. 1, pp. 107-120, 2003.
  11. N. Webb, M. Hepple, and Y. Wilks, "Dialogue act classification based on intra-utterance features," Proceedings of the AAAI Workshop on Spoken Language Understanding, Pittsburgh, PA, 2005, July 9-10.
  12. W. S. Choi, H. Kim, and J. Seo, "An integrated dialogue analysis model for determining speech acts and discourse structures," IEICE Transactions on Information and Systems, vol. E88-D, no. 1, pp. 150-157, 2005.
  13. H. Lee, H. Kim, and J. Seo, "Domain action classification using a maximum entropy model in a schedule management domain," AI Communications, vol. 21, no. 4, pp. 221-229, 2008.
  14. S. Kang, H. Kim, and J. Seo, "A reliable multidomain model for speech act classification," Pattern Recognition Letters, vol. 31, no. 1, pp. 71-74, 2010. https://doi.org/10.1016/j.patrec.2009.08.013
  15. M. J. Kim, J. H. Park, S. B. Kim, H. C. Rim, and D. G. Lee, "A comparative study on optimal feature identification and combination for korean dialogue act classification," Journal of Korean Institute of Information Scientists and Engineers: Software and Applications, vol. 35, no. 11, pp. 681-691, Nov 2008.
  16. K. Kim, H. Kim, and J. Seo, "A neural network model with feature selection for Korean speech act classification," International Journal of Neural Systems, vol. 14, no. 6, pp. 407-414, 2004. https://doi.org/10.1142/S0129065704002157
  17. Y. Yang and J. O. Pedersen, "A comparative study on feature selection in text categorization," Proceedings of the 14th International Conference on Machine Learning, Nashville, TN, 1997, July, pp. 412-420.
  18. E. Charniak, Statistical Language Learning, Cambridge, MA: MIT Press, 1993.
  19. S. Lee and J. Seo, "Korean speech act analysis system using hidden markov model with decision trees," International Journal of Computer Processing of Oriental Languages, vol. 15, no. 3, pp. 231-243, 2002. https://doi.org/10.1142/S0219427902000625
  20. D. Surendran and G. A. Levow, "Dialogue act tagging with support vector machines and hidden markov models," Proceedings of Interspeech/ICSLP, Pittsburgh, PA, 2006, Sep.
  21. H. Lee, H. Kim, and J. Seo, "Efficient domain action classification using neural networks," Neural Information Processing. Lecture Notes in Computer Science Vol. 4223, I. King, J. Wang, L. W. Chan, and D. Wang, Eds., Heidelberg, Germany: Springer Berlin, 2006, pp. 150-158.
  22. D. Kim, H. Kim, and J. Seo, "A statistical prediction model of speakers' intentions in a goal-oriented dialogue," Journal of Korean Institute of Information Scientists and Engineers: Software and Applications, vol. 35, no. 9, pp. 554-561, Sep 2008.
  23. J. D. Lafferty, A. McCallum, and F. C. N. Pereira, "Conditional random fields: probabilistic models for segmenting and labeling sequence data," Proceedings of the Eighteenth International Conference on Machine Learning, Williamstown, MA, 2001, June 28-July 1, pp. 282-289.
  24. J. C. Reynar and A. Ratnaparkhi, "A maximum entropy approach to identifying sentence boundaries," Proceedings of the 5th Conference on Applied Natural Language Processing, Washington, DC, 1997, March 31-April 3, pp. 16-19.
  25. J. R. Quinlan, C4.5: Programs for Machine Learning, San Mateo, CA: Morgan Kaufmann Publishers, 1993.
  26. T. Joachims, "SVMLight: Support Vector Machine Version: 6.02," http://svmlight.joachims.org/.
  27. E. S. Ristad, Maximum entropy modeling toolkit, Technical Report, Department of Computer Science, Princeton University, 1996.

Cited by

  1. A novel density-based clustering method using word embedding features for dialogue intention recognition vol.19, pp.4, 2016, https://doi.org/10.1007/s10586-016-0649-7
  2. Two-phase reanalysis model for understanding user intention vol.42, 2014, https://doi.org/10.1016/j.patrec.2013.12.015
  3. Cross-Lingual Annotation Projection for Weakly-Supervised Relation Extraction vol.13, pp.1, 2014, https://doi.org/10.1145/2529994
  4. Hierarchical speech-act classification for discourse analysis vol.34, pp.10, 2013, https://doi.org/10.1016/j.patrec.2013.03.008
  5. Improving domain action classification in goal-oriented dialogues using a mutual retraining method vol.45, 2014, https://doi.org/10.1016/j.patrec.2014.03.021
  6. An Integrated Neural Network Model for Domain Action Determination in Goal-Oriented Dialogues vol.9, pp.2, 2013, https://doi.org/10.3745/JIPS.2013.9.2.259
  7. Post-error Correction in Automatic Speech Recognition Using Discourse Information vol.14, pp.2, 2014, https://doi.org/10.4316/AECE.2014.02009
  8. An Efficient Framework for Development of Task-Oriented Dialog Systems in a Smart Home Environment vol.18, pp.5, 2018, https://doi.org/10.3390/s18051581