Browse > Article
http://dx.doi.org/10.9708/jksci.2011.16.12.083

Detecting Spelling Errors by Comparison of Words within a Document  

Kim, Dong-Joo (Dept. of Computer Engineering, Anyang University)
Abstract
Typographical errors by the author's mistyping occur frequently in a document being prepared with word processors contrary to usual publications. Preparing this online document, the most common orthographical errors are spelling errors resulting from incorrectly typing intent keys to near keys on keyboard. Typical spelling checkers detect and correct these errors by using morphological analyzer. In other words, the morphological analysis module of a speller tries to check well-formedness of input words, and then all words rejected by the analyzer are regarded as misspelled words. However, if morphological analyzer accepts even mistyped words, it treats them as correctly spelled words. In this paper, I propose a simple method capable of detecting and correcting errors that the previous methods can not detect. Proposed method is based on the characteristics that typographical errors are generally not repeated and so tend to have very low frequency. If words generated by operations of deletion, exchange, and transposition for each phoneme of a low frequency word are in the list of high frequency words, some of them are considered as correctly spelled words. Some heuristic rules are also presented to reduce the number of candidates. Proposed method is able to detect not syntactic errors but some semantic errors, and useful to scoring candidates.
Keywords
Mistyping; Typographical Error; Morphological Analysis; Spell Checker;
Citations & Related Records
연도 인용수 순위
  • Reference
1 G. E. Heidorn, K. Jensen, L. A. Miller, R. J. Byrd and M. S. Chodorow, "The EPISTLE text-critiquing system," IBM System Journal, Vol. 21, No. 3, pp. 305-326, 1982.   DOI
2 Perterson J. L., "Computer Programs for Detecting and Correcting Spelling Errors," CACM, Vol. 23, No. 12, pp. 676-687, 1980.   DOI
3 Dong-Joo Kim, "A Critiquing System with Tight Morphological Constraints," MS Thesis, Hanyang University, 1997.
4 Sung-U Mi, "Sae Machumpop kwa Kyojong ui Sirche," Omungak, 1994.
5 Chul-Min Sim and Hyuk-Chul Kwon, "Impleme ntation of Korean Spelling Checker based on Collocation of Words," Journal of Computing Science and Engineering, Vol. 23, No. 7, pp. 776-785, 1996.
6 Kil-ja So and Hyuck-chul Kwon, "A Korean Grammar Checker using Lexical Disambiguation Rule and Partial Parsing," Journal of Computing Science and Engineering, Vol. 28, No. 3, pp. 305-315, 2001.
7 Hyun-Jin Kim, Chul-Min Sim and Hyuk-Chul Kwon, "Implementation of a Korean Grammar Checker using Partial Sentence Analysis," Proceedings of the 8th Annual Conference on Human and Cognitive Language Technology, pp. 469-475, Oct. 1996.
8 Youngkook Hong, Jonghyeok Lee and Geunbae Lee, "A Korean Syntactic Analyzer based on the Dependency Grammar," Journal of Computing Science and Engineering, Vol. 19, No. 5, pp. 191-194, 1990.
9 Hankyu-kyu Lim, Ung-Mo Kim, "A Spelling Correc tion System Based on Statistical Data of Spelling Errors," Journal of Korea Information Processing Society, Vol. 2, No. 6, pp. 839-846, 1995.