Vocabulary Analyzer Based on CEFR-J Wordlist for Self-Reflection (VACSR) Version 2

Yukiko Ohashi;Noriaki Katagiri;Takao Oshikiri;

doi:10.22925/apjcr.2023.4.2.75

Asia Pacific Journal of Corpus Research (아시아태평양코퍼스연구)

Volume 4 Issue 2
/
Pages.75-87
/
2023
/
2733-8096(eISSN)

Institute for Corpus Research (국립인천대학교 코퍼스연구소)

DOI QR Code

Vocabulary Analyzer Based on CEFR-J Wordlist for Self-Reflection (VACSR) Version 2

Yukiko Ohashi (Yamazaki University of Animal Health Technology) ;
Noriaki Katagiri (Hokkaido University of Education) ;
Takao Oshikiri (Bunkyo Gakuin University)

Received : 2023.10.01
Accepted : 2023.12.10
Published : 2023.12.31

https://doi.org/10.22925/apjcr.2023.4.2.75 Citation PDF

Download PDF

⟨ Previous Next ⟩

Abstract

This paper presents a revised version of the vocabulary analyzer for self-reflection (VACSR), called VACSR v.2.0. The initial version of the VACSR automatically analyzes the occurrences and the level of vocabulary items in the transcribed texts, indicating the frequency, the unused vocabulary items, and those not belonging to either scale. However, it overlooked words with multiple parts of speech due to their identical headword representations. It also needed to provide more explanatory result tables from different corpora. VACSR v.2.0 overcomes the limitations of its predecessor. First, unlike VACSR v.1, VACSR v.2.0 distinguishes words that are different parts of speech by syntactic parsing using Stanza, an open-source Python library. It enables the categorization of the same lexical items with multiple parts of speech. Second, VACSR v.2.0 overcomes the limited clarity of VACSR v.1 by providing precise result output tables. The updated software compares the occurrence of vocabulary items included in classroom corpora for each level of the Common European Framework of Reference-Japan (CEFR-J) wordlist. A pilot study utilizing VACSR v.2.0 showed that, after converting two English classes taught by a preservice English teacher into corpora, the headwords used mostly corresponded to CEFR-J level A1. In practice, VACSR v.2.0 will promote users' reflection on their vocabulary usage and can be applied to teacher training.

Keywords

References

Anthony, L. (2022). AntConc (Version 4.2.0) [Computer Software]. Tokyo, Japan: Waseda University. Available from https://www.laurenceanthony.net/software
Coxhead, A. (1998). The development and evaluation of an academic word list (Master Thesis, Victoria University of Wellington, New Zealand).
Coxhead, A. (2000). A new academic word list. TESOL Quarterly, 34(2), 213-238. https://doi.org/10.2307/3587951
Kilgarriff, A., Baisa, V., Busta, J., Jakubicek, M., Kovar, V., Michelfeit, J., Rychly, P., & Suchomel, V. (2014). The sketch engine: Ten years on. Lexicography, 1, 7-36. https://doi.org/10.1007/s40607-014-0009-9
Negishi, M.,Takada, T., & Tono, Y. (2013). A progress report on the development of the CEFR-J. In E.D. Galaczi & C. J. Weir (Eds.), Exploring language frameworks. Proceedings of the ALTE Krakow Conference, July 2001, 135-163.
Ohashi, Y., & Katagiri, N. (2020a). The Ratios of CEFR-J vocabulary usage compared with GSL and AWL in elementary EFL classrooms and suggestions of vocabulary items to be taught. Asia Pacific Journal of Corpus Research, 1(1), 35-65.
Ohashi, Y., Katagiri, N., & Oshikiri, T. (2022b). Developing classroom corpus tagger for teachers' reflective practice: A spoken language tagger to compile classroom corpora. English Corpus Studies, 29, 41-62.
Ohashi,Y., Katagiri, N., & Oshikiri, T. (2022). Vocabulary analyzer based on CEFR-J wordlist for self-reflection (VACSR): From classroom corpus compilation to self-reflection. International Journal of Language Learning and Applied Linguistics World, 31(1), 1-15.
Qi, P., Zhang, Y., Zhang, Y., Bolton, J., & Manning, C. (2020). Stanza: A Python Natural Language Processing Toolkit for Many Human Languages. https://nlp.stanford.edu/pubs/qi2020stanza.pdf
Schmitt, N. (2010). Researching Vocabulary: A Research Manual. Basingstoke: Palgrave Macmillan.
West, M. (1953). A General Service List of English Words. Longman, London.
Penn Treebank P.O.S. Tags. (n.d.). www.ling.upenn.edu. https://www.ling.upenn.edu/courses/Fall_2003/ling001/penn_treebank_pos.html

Asia Pacific Journal of Corpus Research (아시아태평양코퍼스연구)

Vocabulary Analyzer Based on CEFR-J Wordlist for Self-Reflection (VACSR) Version 2

Abstract

Keywords

References

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)