DOI QR코드

DOI QR Code

Automatic Selection of Similar Sentences for Teaching Writing in Elementary School

초등 글쓰기 교육을 위한 유사 문장 자동 선별

  • Park, Youngki (School of Computer Science and Engineering, Seoul National University)
  • Received : 2016.06.15
  • Accepted : 2016.06.24
  • Published : 2016.08.31

Abstract

When elementary students write their own sentences, it is often educationally beneficial to compare them with other people's similar sentences. However, it is impractical for use in most classrooms, because it is burdensome for teachers to look up all of the sentences written by students. To cope with this problem, we propose a novel approach for automatic selection of similar sentences based on a three-step process: 1) extracting the subword units from the word-level sentences, 2) training the model with the encoder-decoder architecture, and 3) using the approximate k-nearest neighbor search algorithm to find the similar sentences. Experimental results show that the proposed approach achieves the accuracy of 75% for our test data.

자신이 쓴 문장과 유사한 문장을 살펴보는 것은 초등 글쓰기 교육을 위한 효과적인 방법 중 하나이지만, 매번 글을 쓸 때마다 교사의 지도가 필요하기 때문에 현실적으로 활용하기 쉽지 않다. 본 논문에서는 이 한계를 극복하기 위해 컴퓨터가 자동으로 자신이 쓴 문장과 유사한 문장을 실시간으로 선별해 주는 방법을 제안한다. 이 방법은 단어의 구성 성분을 쪼개는 단계, 쪼갠 단어를 입력으로 활용하여 인코더-디코더 모델을 학습하는 단계, 모델을 통해 얻어낸 추상화된 문장을 활용해 검색하는 단계로 구성된다. 실험 결과, 작은 규모의 데이터에 대해 75%의 정확도를 보임으로써 실용화 가능성이 높은 것으로 나타났다. 이 방법을 통해 학생들은 자신의 어색한 문장을 교정하거나 새로운 표현을 익히고 싶은 경우 다른 사람이 작성한 좋은 예문을 쉽게 참조할 수 있어 자신의 글쓰기 능력을 향상시키는 데에 큰 도움이 될 것으로 기대된다.

Keywords

References

  1. Bahdanau, D., Cho, K. & Bengio, Y. (2015). Neural Machine Translation By Jointly Learning to Align and Translate. Proceedings of the International Conference on Learning Representations, 1-15.
  2. Kim, S. & Lee, H. (2012). The Effects of Online Bulletin Board on Korean Primary School Students’ English Writing and Learning Attitudes. Primary English Education, 18(1), 131-150.
  3. Manning, C., Raghavan, P. & Schutze, H. (2008). Introduction to Information Retrieval. Cambridge University Press.
  4. Lee, C. (2010). A Study on Teaching Contents of Sentence Writing in Elementary School. Grammar Education, 12(1), 321-341.
  5. Papineni, K., Roukos, S., Ward, T. & Zhu, W. (2002). BLEU: a method for Automatic Evaluation of Machine Translation. Proceedings of the 40th Annual Meeting on Association for Computational Linguistics. 311-318.
  6. Park, Y., Park, S., Lee, S. & Jung, W. (2014). Greedy Filtering: A Scalable Algorithm for K-Nearest Neighbor Graph Construction. Proceedings of the 19th International Conference on Database Systems for Advanced Applications, 8421, 327-341.
  7. Park, Y., Park, S., Lee, S. & Jung, W. (2013). Scalable k-Nearest Neighbor Graph Construction Based on Greedy Filtering. Proceedings of the 22nd International World Wide Web Conference, 227-228.
  8. Park, Y., Hwang, H. & Lee, S. (2016). A Novel Algorithm for Scalable k-Nearest Neighbour Graph Construction. Journal of Information Science, 42(2), 274-288. https://doi.org/10.1177/0165551515594728
  9. Park, Y., Hwang, H. & Lee, S. (2015). A Fast k-Nearest Neighbor Search Using Query-Specific Signature Selection. Proceedings of the 24th ACM International Conference on Information and Knowledge Management, 1883-1886.
  10. Park, Y., Hwang, H. & Lee, S. (2016). Query-Specific Signature Seletion for Efficient k-Nearest Neighbour Approximation, doi: 10.1177/0165551516644176.
  11. Pham, T. (2012). A Study on Teaching and Learning Korean Grammars Method based on Paraphrasing Activities. Master's Thesis, Seoul National University.
  12. Sennrich, R., Haddow, B. & Birch, A. (2016). Neural Machine Translation of Rare Words with Subword Units. Proceedings of the Annual Meeting of the Association for Computational Linguistics, arXiv:1508.07909.
  13. Son, J. (2009). The Study of Scientifically Gifted Students’ Scientific Thinking and Creative Problem Solving Ability through Science Writing. Korean Science Education Society for the Gifted, 1(3), 21-32.
  14. Salton, G. & Buckley, C. (1988). Term-Weighting Approachs in Automatic Text Retrieval. Information Processing & Management, 24(5), 513-523. https://doi.org/10.1016/0306-4573(88)90021-0
  15. Thornbury, S. (2000). How to Teach Grammar, Longman.