DOI QR코드

DOI QR Code

A comparison of grammatical error detection techniques for an automated english scoring system

  • Lee, Songwook (Department of Computer Science and Information Engineering, Korea National University of Transportation) ;
  • Lee, Kong Joo (Dept. of Information Communications Engineering, Chungnam National University)
  • 투고 : 2013.09.11
  • 심사 : 2013.10.17
  • 발행 : 2013.11.30

초록

Detecting grammatical errors from a text is a long-history application. In this paper, we compare the performance of two grammatical error detection techniques, which are implemented as a sub-module of an automated English scoring system. One is to use a full syntactic parser, which has not only grammatical rules but also extra-grammatical rules in order to detect syntactic errors while paring. The other one is to use a finite state machine which can identify an error covering a small range of an input. In order to compare the two approaches, grammatical errors are divided into three parts; the first one is grammatical error that can be handled by both approaches, and the second one is errors that can be handled by only a full parser, and the last one is errors that can be done only in a finite state machine. By doing this, we can figure out the strength and the weakness of each approach. The evaluation results show that a full parsing approach can detect more errors than a finite state machine can, while the accuracy of the former is lower than that of the latter. We can conclude that a full parser is suitable for detecting grammatical errors with a long distance dependency, whereas a finite state machine works well on sentences with multiple grammatical errors.

키워드

참고문헌

  1. Yongmei Shi and Lina Zhou. "Error detection using linguistic features." Proceedings of Human Language Technology Conference and Conference on Empirical Methods in Natural Language Processing, pp. 41-48 2005.
  2. George E. Heidorn, Intelligent Writing Assistance, Handbook of Natural Language Processing. Robert Dale, Hermann Moisl and Harold Somers (Eds), CRC Press, 2000.
  3. Lisa N. Michaud, Kathleen F. McCoy, and Christopher A. Pennington. "An intelligent tutoring system for deaf learners of written english." Proceedings of the Fourth International ACM Conference on Assistive Technologies, pp. 92-100, 2000.
  4. Emily M. Bender, Dan Flickinger, Stephan Oepen, Annemarie Walsh, and Timothy Baldwin. "Arboretum: Using a precision grammar for grammar checking in call." Proceedings of InSTIL/ICALL Symposium on Computer Assisted Learning., pp. 83-86, 2004.
  5. Wael H. Gomaa and Aly A. Fahmy, "Short answer grading using string similarity and corpus-based similarity," International Journal of Advanced Computer Science and Application, vol. 3, no. 11, pp. 115-121, 2012.
  6. Sylvana Sofkova Hashemi, Robin Cooper, and Robert Andersson, "Positive grammar checking: A finite state approach," Computational Linguistics and Intelligent Text Processing Lecture Notes in Computer Science, vol. 2588, pp. 635-646, 2003.
  7. Kong Joo Lee, "Compositional rules of Korean auxiliary predicates for sentiment analysis," Journal of the Korean Society of Marine Engineering, vol. 37, no. 3, pp. 291-299, 2013. https://doi.org/10.5916/jkosme.2013.37.3.291
  8. Kong Joo Lee, Songwook Lee, and Jee Eun Kim, "A bidirectional Korean-Japanese statistical machine translation system by using MOSES," Journal of the Korean Society of Marine Engineering, vol. 36, no. 5, pp. 683-693, 2012. https://doi.org/10.5916/jkosme.2012.36.5.683
  9. Emmanuel Roche and Yves Schabes, Finite-State Language Processing, Cambridge, The MIT Press, 1997.
  10. Kong Joo Lee, Yong-Seok Choi, and Jee Eun Kim, "Building an automated English sentence evaluation system for students learning English as a second language," Computer Speech & Language, vol. 25, no. 2, pp. 246-260, 2011. https://doi.org/10.1016/j.csl.2010.05.001
  11. Guihua Sun, Xiaohua Liu, Gao Cong, Ming Zhou, Zhonguang Xiong, John Lee, and Chin-Yew Lin, "Detecting erroneous sentences using automatically mined sequential patterns," Processings of the 45th Annual Meeting of the Association of Computational Linguistics, pp. 81-88, 2007.
  12. Jian Pei, Jiawei Han, Behzad Mortazavi-Asl, and Helen Pinto, "Prefixspan: Mining sequential patterns efficiently by prefix-projected pattern growth," Proceedings of Proceedings of International Conference on Data Engineering, pp. 215-224, 2001.
  13. Michael Gamon, Claudia Leacock, Chris Brocket, W. B. Dolan, Jianfeng Gao, Dmitriy Belenko, and Alexander Klementiev, "Using statistical techniques and web search to correct ESL errors," Journal of The Computer Assisted Language Instruction Consortium, vol. 26, no. 3, pp. 491-511, 2009.

피인용 문헌

  1. 기계학습을 이용한 중등 수준의 단문형 영어 작문 자동 채점 시스템 구현 vol.41, pp.11, 2014, https://doi.org/10.5626/jok.2014.41.11.911
  2. 영작문 자동 채점 시스템을 위한 문맥 고려 단어 오류 검사기 vol.4, pp.1, 2013, https://doi.org/10.3745/ktsde.2015.4.1.45