Browse > Article
http://dx.doi.org/10.3745/KTSDE.2015.4.1.45

Context-sensitive Word Error Detection and Correction for Automatic Scoring System of English Writing  

Choi, Yong Seok (충남대학교 정보통신공학과)
Lee, Kong Joo (충남대학교 정보통신공학과)
Publication Information
KIPS Transactions on Software and Data Engineering / v.4, no.1, 2015 , pp. 45-56 More about this Journal
Abstract
In this paper, we present a method that can detect context-sensitive word errors and generate correction candidates. Spelling error detection is one of the most widespread research topics, however, the approach proposed in this paper is adjusted for an automated English scoring system. A common strategy in context-sensitive word error detection is using a pre-defined confusion set to generate correction candidates. We automatically generate a confusion set in order to consider the characteristics of sentences written by second-language learners. We define a word error that cannot be detected by a conventional grammar checker because of part-of-speech ambiguity, and propose how to detect the error and generate correction candidates for this kind of error. An experiment is performed on the English writings composed by junior-high school students whose mother tongue is Korean. The f1 value of the proposed method is 70.48%, which shows that our method is promising comparing to the current-state-of-the art.
Keywords
Context-sensitive Word Error; Error Detection/Correction; Confusion Set; English Automatic Scoring; Grammar-level Word Error;
Citations & Related Records
Times Cited By KSCI : 2  (Citation Analysis)
연도 인용수 순위
1 Xu Sun, Jianfeng Gao, Daniel Micol, and Chris Quirk, "Learning Phrase-Based Spelling Error Models from Clickthrough Data," pp.266-274. The Association for Computer Linguistics, 2010.
2 GyoungHo Lee and Kong Joo Lee, "Developing an Automated English Sentence Scoring System for Middle-school Level Writing Test by Using Machine Learning Techniques," Journal of KISE, Vol.41, No.11, 2014.   DOI
3 Andrew R. Golding, "A Bayesian hybrid method for context-sensitive spelling correction," In Proceedings of the Third Workshop on Very Large Corpora, pp.39-53, 1995.
4 Shane Bergsma, Dekang Lin, Randy Goebel, "Web-Scale N-gram Models for Lexical Disambiguation," In Proceedings of the Twenty-First International Joint Conference on Artificial Intelligence (IJCAI-09), pp.1507-1512, 2009.
5 Hermet, M., Desilets, A., Szpakowicz, S., "Using the Web as a Linguistic Resource to Automatically Correct Lexico-Syntactic Errors," In The 6th Edition of the Language Resources and Evaluation Conference (LREC 08). May 28-30, 2008. Marrakech, Morocco.
6 Michael Gamon, Claudia Leacock, Chris Brockett, Jianfeng Gao, and Alexander Klementiev, "Using Statistical Techniques and Web Search to Correct ESL Errors," 2009, CALICO Journal, Vol.26, No.3, pp.491-511.
7 Lawrence Phillips, "The Double Metaphone Search Algorithm", C/C++ Users Journal, Vol.18, No.6, June, 2000.
8 Mitchell P. Marcus, Mary Ann Marcinkiewicz, and Beatrice Santorini, Building a large annotated corpus of English: the penn treebank, Journal Computational Linguistics-Special issue on using large corpora: II archive Vol.19, Issue.2, June, 1993, Pp.313-330.
9 https://catalog.ldc.upenn.edu/LDC2006T13
10 A Winnow-Based Approach to Context-Sensitive Spelling Correction, Andrew R. Golding and Dan Roth, 1999.
11 Kong Joo Lee and Sonwook Lee, "A comparison of grammatical error detection techniques for an automated English scoring system," Journal of the Korean Society of Marine Engineering 01/2013; 37(7).   과학기술학회마을   DOI