• Title/Summary/Keyword: Corpus-based Study

Search Result 204, Processing Time 0.022 seconds

Novice Corpus Users' Gains and Views on Corpus-based Lexical Development: A Case Study of COVID-19-related Expressions

  • Chen, Mei-Hua
    • Asia Pacific Journal of Corpus Research
    • /
    • v.2 no.1
    • /
    • pp.1-11
    • /
    • 2021
  • Recently, corpus assisted vocabulary instruction has been attracting a lot of interest. Most studies have focused on understanding language learners' receptive vocabulary knowledge. Limited attention has been paid to learners' productive competence. To fill this gap, this study attended to learners' productive lexical development in terms of form, meaning and use respectively. This study introduced EFL learners to the corpus-based language pedagogy to learn COVID-19 theme-based vocabulary. To investigate the gains and views of 33 EFL first-year college students, a sentence completion task and a questionnaire were developed. Learners' productive performances in the three lexical knowledge aspects (i.e., form, meaning and use) were particularly targeted. The results revealed that the students achieved significant gains in all aspects regardless of their proficiency level. In particular, the less proficient students achieved greater knowledge retention compared with their highly proficient counterparts. Meanwhile, students showed positive attitudes towards the corpus-based approach to vocabulary learning.

The effects of corpus-based vocabulary tasks on high school students' English vocabulary learning and attitude (코퍼스를 기반으로 한 어휘 과제가 고등학생의 영어 어휘 학습과 태도에 미치는 영향)

  • Lee, Hyun Jin;Lee, Eun-Joo
    • English Language & Literature Teaching
    • /
    • v.16 no.4
    • /
    • pp.239-265
    • /
    • 2010
  • This study investigates the effects of corpus-based vocabulary tasks on the acquisition of English vocabulary in an attempt to explore the influence of corpus use on EFL pedagogy. For this to be realized, a total of 40 Korean high school students participated in the study over a 4-week period. An experimental group used a set of corpus-based tasks for vocabulary learning, whereas a control group carried out a traditional task (i.e., the L1-L2 translation) for vocabulary learning. To assess learning gains, the students were asked to complete the pre- and post-treatment tests measuring the word form, meaning, and use aspects of target lexical items. Results of the study indicate that in the experimental group the corpus-based vocabulary tasks were beneficial for the learning of word forms and use. In particular, corpus-based benefits were greatest in the low-proficiency EFL learners' collocational aspects of vocabulary use. On the other hand, in the control group, the traditional vocabulary tasks benefited the meaning aspects of target vocabulary items the most. In addition, survey results revealed that most students were positive about the corpus-based learning experience although some expressed reservations about the heavy cognitive load and the time-consuming nature of the analysis of corpus data primarily due to learners' lack of language proficiency.

  • PDF

A Corpus-Based Study on Language Features and Literary Themes in the Yellow Wall-Paper and Herland by Charlotte Perkins Gilman

  • Lu, Hui-Chuan;Liu, Kai-Ling;Yeh, Chien-Ting;Chen, Ya-Jie
    • Asia Pacific Journal of Corpus Research
    • /
    • v.3 no.1
    • /
    • pp.21-34
    • /
    • 2022
  • This study aims to apply corpus-based approach to analyze The Yellow Wall-Paper and Herland written by Charlotte Perkins Gilman, a women's rights activist in the late nineteenth-century America. Although both works have attracted feminists' attention to the woman question that concerned Gilman, discussion on her language features and their relation to the literary themes of these two works is still in need. In this corpus-based analysis, we argue that the main themes of different literary works can be revealed through linguistic patterns identified by number and gender features of nouns and pronouns in the contrast of two works and a balanced corpus. The linguistic features (number and gender) have been related with two themes, the 'group and individual' and the 'feminine and masculine', and are further interpreted in terms of mothering and feminine consciousness. By adopting linguistic approach, our study provides quantitative and qualitative evidence to verify the established themes and arguments of these literary texts.

A Corpus-based Lexical Analysis of the Speech Texts: A Collocational Approach

  • Kim, Nahk-Bohk
    • English Language & Literature Teaching
    • /
    • v.15 no.3
    • /
    • pp.151-170
    • /
    • 2009
  • Recently speech texts have been increasingly used for English education because of their various advantages as language teaching and learning materials. The purpose of this paper is to analyze speech texts in a corpus-based lexical approach, and suggest some productive methods which utilize English speaking or writing as the main resource for the course, along with introducing the actual classroom adaptations. First, this study shows that a speech corpus has some unique features such as different selections of pronouns, nouns, and lexical chunks in comparison to a general corpus. Next, from a collocational perspective, the study demonstrates that the speech corpus consists of a wide variety of collocations and lexical chunks which a number of linguists describe (Lewis, 1997; McCarthy, 1990; Willis, 1990). In other words, the speech corpus suggests that speech texts not only have considerable lexical potential that could be exploited to facilitate chunk-learning, but also that learners are not very likely to unlock this potential autonomously. Based on this result, teachers can develop a learners' corpus and use it by chunking the speech text. This new approach of adapting speech samples as important materials for college students' speaking or writing ability should be implemented as shown in samplers. Finally, to foster learner's productive skills more communicatively, a few practical suggestions are made such as chunking and windowing chunks of speech and presentation, and the pedagogical implications are discussed.

  • PDF

Issues of Discourse Studies in Korean Language Education (한국어교육학에서의 담화 연구 분석)

  • Kang, Hyounhwa
    • Journal of Korean language education
    • /
    • v.23 no.1
    • /
    • pp.219-256
    • /
    • 2012
  • The aim of this study is to observe the trend of discourse study in language education and analyze the main issues by investigating the literatures related to discourse in Korean language education in the last ten years. This study observed the discourse study conducted in Korean language education from the perspectives of study subject, study method and study data. Moreover, based on the results, it estimated the achievements and effectiveness of the discourse study conducted in Korean language education. The subject of discourse study was mainly dealt with discourse function, discourse pattern, discourse marker, discourse structure. In the study methods, analysis of corpus and survey were mainly used as the study methods, and spoken corpus, written corpus and semi-spoken corpus were used as study materials. In particular, the semi-spoken corpus was used at a very high rate among them. This showed that discourse study in Korean language education was mainly focused on spoken corpus study. This study divided the detailed field of Korean language education into four fields of linguistic knowledge, communication function, teaching activities and learning activities, and observed the trends of discourse study in each field. Overall, it was recognized that relatively many studies were focused on linguistic knowledge, particularly in pragmatic perspective. It can be said that the study based on discourse has a language educational effectiveness in that it is based on actual data and improves practical communication skills in the environment of various languages.

A Study on the Diachronic Evolution of Ancient Chinese Vocabulary Based on a Large-Scale Rough Annotated Corpus

  • Yuan, Yiguo;Li, Bin
    • Asia Pacific Journal of Corpus Research
    • /
    • v.2 no.2
    • /
    • pp.31-41
    • /
    • 2021
  • This paper makes a quantitative analysis of the diachronic evolution of ancient Chinese vocabulary by constructing and counting a large-scale rough annotated corpus. The texts from Si Ku Quan Shu (a collection of Chinese ancient books) are automatically segmented to obtain ancient Chinese vocabulary with time information, which is used to the statistics on word frequency, standardized type/token ratio and proportion of monosyllabic words and dissyllabic words. Through data analysis, this study has the following four findings. Firstly, the high-frequency words in ancient Chinese are stable to a certain extent. Secondly, there is no obvious dissyllabic trend in ancient Chinese vocabulary. Moreover, the Northern and Southern Dynasties (420-589 AD) and Yuan Dynasty (1271-1368 AD) are probably the two periods with the most abundant vocabulary in ancient Chinese. Finally, the unique words with high frequency in each dynasty are mainly official titles with real power. These findings break away from qualitative methods used in traditional researches on Chinese language history and instead uses quantitative methods to draw macroscopic conclusions from large-scale corpus.

Metadiscourse in the Bank Negara Malaysia Governor's Speech Texts

  • Aziz, Roslina Abdul;Baharum, Norzie Diana
    • Asia Pacific Journal of Corpus Research
    • /
    • v.2 no.2
    • /
    • pp.1-15
    • /
    • 2021
  • The study aims to explore the use of metadiscourse in the Bank Negara Malaysia Governor's speeches based on Hyland's Interpersonal Model of Metadiscourse. The corpus data consist of 343 speech texts, which were extracted from the Malaysian Corpus of Financial English (MacFE), amounting to 688,778 tokens. Adopting both quantitative and qualitative approaches to data analysis the study investigates (1) the overall use of metadiscourse in the Bank Negara Governor's speech texts and (2) the functions of the most prominent metadiscourse resources used and their functions in the speech texts. The findings reveal that the Governor's speech texts to be interactional rather than interactive, revealing a rich distribution of interactional metadiscourse resources, namely engagement markers, self-mention, hedges, boosters and attitude markers throughout the texts. The interactional metadiscourse resources function to establish speaker-audience engagement and alignment of views, as well as to express degree of uncertainty and certainty and attitudes. The study concludes that the speech texts are not merely informational or propositional, but rather interpersonal.

A Transformation-Based Learning Method on Generating Korean Standard Pronunciation

  • Kim, Dong-Sung;Roh, Chang-Hwa
    • Proceedings of the Korean Society for Language and Information Conference
    • /
    • 2007.11a
    • /
    • pp.241-248
    • /
    • 2007
  • In this paper, we propose a Transformation-Based Learning (TBL) method on generating the Korean standard pronunciation. Previous studies on the phonological processing have been focused on the phonological rule applications and the finite state automata (Johnson 1984; Kaplan and Kay 1994; Koskenniemi 1983; Bird 1995). In case of Korean computational phonology, some former researches have approached the phonological rule based pronunciation generation system (Lee et al. 2005; Lee 1998). This study suggests a corpus-based and data-oriented rule learning method on generating Korean standard pronunciation. In order to substituting rule-based generation with corpus-based one, an aligned corpus between an input and its pronunciation counterpart has been devised. We conducted an experiment on generating the standard pronunciation with the TBL algorithm, based on this aligned corpus.

  • PDF

The Acquisition of Spanish Clitic Pronouns as a Third Language: A Corpus-based Study

  • Lu, Hui-Chuan;Cheng, An Chung;Chu, Yu-Hsin
    • Asia Pacific Journal of Corpus Research
    • /
    • v.1 no.2
    • /
    • pp.15-26
    • /
    • 2020
  • This corpus-based study investigated third language acquisition by Taiwanese college students in learning Spanish clitic pronouns at beginning and intermediate levels. It examined the acquisition sequences of Spanish clitic pronouns of the Chinese-speaking learners whose second language was English and third language was Spanish. The results indicated that indirect object pronouns (OP) preceded direct OP (case), first person preceded third person OP (person), masculine preceded feminine OP (gender), and animate preceded inanimate OP (animacy). The findings presented similar patterns as those of previous studies on English-speaking learners of Spanish. In further comparisons of the target forms in Chinese, English, and Spanish, the results suggested that L1 Chinese had strong influence on L3 Spanish, which accounts for the challenges that Taiwanese learners of Spanish face as they learn the Spanish clitic pronouns in the beginning stage.

Using Corpora for Studying English Grammar

  • Kwon, Heok-Seung
    • Korean Journal of English Language and Linguistics
    • /
    • v.4 no.1
    • /
    • pp.61-81
    • /
    • 2004
  • This paper will look at some grammatical phenomena which will illustrate some of the questions that can be addressed with a corpus-based approach. We will use this approach to investigate the following subjects in English grammar: number ambiguity, subject-verb concord, concord with measure expressions, and (reflexive) pronoun choice in coordinated noun phrases. We will emphasize the distinctive features of the corpus-based approach, particularly its strengths in investigating language use, as opposed to traditional descriptions or prescriptions of structure in English grammar. This paper will show that a corpus-based approach has made it possible to conduct new kinds of investigations into grammar in use and to expand the scope of earlier investigations. Native speakers rarely have accurate information about frequency of use. A large representative corpus (i.e., The British National Corpus) is one of the most reliable sources of frequency information. It is important to base an analysis of language on real data rather than intuition. Any description of grammar is more complete and accurate if it is based on a body of real data.

  • PDF