• Title/Summary/Keyword: Corpus-based Study

Search Result 204, Processing Time 0.023 seconds

Non-Discourse Marker Uses of So in EFL Writings: Functional Variability among Asian Learners

  • Sato, Shie
    • Asia Pacific Journal of Corpus Research
    • /
    • v.1 no.2
    • /
    • pp.27-39
    • /
    • 2020
  • This paper examines the frequency and distribution of the so-called "non-discourse marker functions" of so in essay writings produced by 200 L1 English speakers and 1,300 EFL learners in China, Japan, Korea, and Taiwan. Based on the data drawn from the International Corpus Network of Asian Learners of English, this study compares EFL learners and L1 English speakers' uses of so, identifying four grammatical uses, as (1) an adverb, (2) part of a fixed phrase, (3) a pro-form, and (4) a conjunction phrase specifying purpose. This study aims to show the wide variability among EFL learners with different L1s, identifying the tendency of usage both common among and specific to the sub-groups of EFL learners. The findings suggest that the learners demonstrate patterns distinctively different from those of L1 English speakers, indicating an underuse of so as a marker expressing "purpose" and an overuse as part of fixed phrases. Compared to L1 English speakers, the learners also tend to overuse so in the discourse marker functions, regardless of their L1s. The study proposes pedagogical implications focusing on discourse flow and diachronic aspects of so in order to understand its multifunctionality, although the latter is primarily suggested for advanced learners.

Examining Suicide Tendency Social Media Texts by Deep Learning and Topic Modeling Techniques (딥러닝 및 토픽모델링 기법을 활용한 소셜 미디어의 자살 경향 문헌 판별 및 분석)

  • Ko, Young Soo;Lee, Ju Hee;Song, Min
    • Journal of the Korean BIBLIA Society for library and Information Science
    • /
    • v.32 no.3
    • /
    • pp.247-264
    • /
    • 2021
  • This study aims to create a deep learning-based classification model to classify suicide tendency by suicide corpus constructed for the present study. Also, to analyze suicide factors, the study classified suicide tendency corpus into detailed topics by using topic modeling, an analysis technique that automatically extracts topics. For this purpose, 2,011 documents of the suicide-related corpus collected from social media naver knowledge iN were directly annotated into suicide-tendency documents or non-suicide-tendency documents based on suicide prevention education manual issued by the Central Suicide Prevention Center, and we also conducted the deep learning model(LSTM, BERT, ELECTRA) performance evaluation based on the classification model, using annotated corpus data. In addition, one of the topic modeling techniques, LDA identified suicide factors by classifying thematic literature, and co-word analysis and visualization were conducted to analyze the factors in-depth.

ToBI and beyond: Phonetic intonation of Seoul Korean ani in Korean Intonation Corpus (KICo)

  • Ji-eun Kim
    • Phonetics and Speech Sciences
    • /
    • v.16 no.1
    • /
    • pp.1-9
    • /
    • 2024
  • This study investigated the variation in the intonation of Seoul Korean interjection ani across different meanings ("no" and "really?") and speech levels (Intimate and Polite) using data from Korean Intonation Corpus (KICo). The investigation was conducted in two stages. First, IP-final tones in the dataset were categorized according to the K-ToBI convention (Jun, 2000). While significant relationships were observed between the meaning of ani and its IP-final tones, substantial overlap between groups was notable. Second, the F0 characteristics of the final syllable of ani were analyzed to elucidate the apparent many-to-many relationships between intonation and meaning/speech level. Results indicated that these seemingly overlapping relationships could be significantly distinguished. Overall, this study advocates for a deeper analysis of phonetic intonation beyond ToBI-based categorical labels. By examining the F0 characteristics of the IP-final syllable, previously unclear connections between meaning/speech level and intonation become more comprehensible. Although ToBI remains a valuable tool and framework for studying intonation, it is imperative to explore beyond these categories to grasp the "distinctiveness" of intonation, thereby enriching our understanding of prosody.

Formulaic Language Development in Asian Learners of English: A Comparative Study of Phrase-frames in Written and Oral Production

  • Yoon Namkung;Ute Romer
    • Asia Pacific Journal of Corpus Research
    • /
    • v.4 no.2
    • /
    • pp.1-39
    • /
    • 2023
  • Recent research in usage-based Second Language Acquisition has provided new insights into second language (L2) learners' development of formulaic language (Wulff, 2019). The current study examines the use of phrase-frames, which are recurring sequences of words including one or more variable slots (e.g., it is * that), in written and oral production data from Asian learners of English across four proficiency levels (beginner, low-intermediate, high-intermediate, advanced) and native English speakers. The variability, predictability, and discourse functions of the most frequent 4-word phrase-frames from the written essay and spoken dialogue sub-corpora of the International Corpus Network of Asian Learners of English (ICNALE) were analyzed and then compared across groups and modes. The results revealed that while learners' phrase-frames in writing became more variable and unpredictable as proficiency increased, no clear developmental patterns were found in speaking, although all groups used more fixed and predictable phrase-frames than the reference group. Further, no developmental trajectories in the functions of the most frequent phrase-frames were found in both modes. Additionally, lower-level learners and the reference group used more variable phrase-frames in speaking, whereas advanced-level learners showed more variability in writing. This study contributes to a better understanding of the development of L2 phraseological competence.

AP, IP Prediction For Corpus-based Korean Text-To-Speech (코퍼스 방식 음성합성에서의 개선된 운율구 경계 예측)

  • Kwon, O-Hil;Hong, Mun-Ki;Kang, Sun-Mee;Shin, Ji-Young
    • Speech Sciences
    • /
    • v.9 no.3
    • /
    • pp.25-34
    • /
    • 2002
  • One of the most important factor in the performance of Korean text-to-speech system is the prediction of accentual and intonational phrase boundary. The previous method of prediction shows only the 75-85% which is not proper in the practical and commercial system. Therefore, more accurate prediction must be needed in the practical system. In this study, we propose the simple and more accurate method of the prediction of AP, IP.

  • PDF

The pattern of use by gender and age of the discourse markers 'a', 'eo', and 'eum' (담화표지 '아', '어', '음'의 성별과 연령별 사용 양상)

  • Song, Youngsook;Shim, Jisu;Oh, Jeahyuk
    • Phonetics and Speech Sciences
    • /
    • v.12 no.4
    • /
    • pp.37-45
    • /
    • 2020
  • This paper quantitatively calculated the speech frequency of the discourse markers 'a', 'eo', and 'eum' and the speech duration of these discourse markers using the Seoul Corpus, a spontaneous speech corpus. The sound durations were confirmed with Praat, the Seoul Corpus was analyzed with Emeditor, and the results were presented by statistical analysis with R. Based on the corpus analysis, the study investigated whether a particular factor is preferred by speakers of particular categories. The most prominent feature of the corpus is that the sound durations of female speakers were longer than those of men when using the 'eum' discourse marker in a final position. In age-related variables, teenagers uttered 'a' more than 'eo' in an initial position when compared to people in their 40s. This study is significant because it has quantitatively analyzed the discourse markers 'a', 'eo', and 'eum' by gender and age. In order to continue the discussion, more precise research should be conducted considering the context. In addition, similarities can be found in "e" and "ma" in Japanese(Watanabe & Ishi, 2000) and 'uh', 'um' in English(Gries, 2013). afterwards, a study to identify commonalities and differences can be predicted by using the cross-linguistic analysis of the discourse.

Effects of Oja-Shingiwhan in Contracted Corpus Cavernosum Smooth Muscle (五子腎氣丸이 음경해면체 평활근의 수축에 미치는 영향)

  • Park, Jeong Su;Ahn, Sang Hyun;Park, Sun Young
    • Journal of Physiology & Pathology in Korean Medicine
    • /
    • v.30 no.5
    • /
    • pp.308-313
    • /
    • 2016
  • The purpose of this study is to investigate the effects of Oja-Shingiwhan(OS) in contracted corpus cavernosum smooth muscle and its mechanism. To evaluate the relaxation of OS in contracted corpus cavernosum, OS was treated in strips which were precontracted with phenylephrine(PE). To examine its mechanism, OS was treated into corporal strips contracted by PE after pretreatment of Nω-nitro-L-arginine(L-NNA) and compared with non-pretreatment of L-NNA. In calcium chloride(Ca2+)-free krebs solution, Ca2+ 1 mM was treated into corporal strips contracted by PE after pretreatment of OS and compared with non-pretreatment of OS. action were measured by histochemical, immunohistochemical methods. OS significantly affected on the relaxation of corporal strips, and the relaxation effects were inhibited by pretreatment of L-NNA. Contractions induced by Ca2+ influx were inhibited by pretreatment of OS in Ca2+-free krebs solution. OS increased eNOS positive reaction in corpus cavernosum, but decreased PDE-5 positive reaction. These result suggest that the effect of OS in contracted corpus cavernosum smooth muscle are shown by suppressing extracellular Ca2+ influx and increase of eNOS, NO production and decrease of PDE-5.

Phospholipids from Bombycis corpus and Their Neurotrophic Effects

  • Yeon Jung;Kwon, Hak-Cheol;Cho, Se-Yeon;Cho, Ock-Ryun;Yang, Min-Cheol;Kim, Sun-Yeou;Lee, Kang-Ro
    • Proceedings of the Korean Society of Sericultural Science Conference
    • /
    • 2003.10a
    • /
    • pp.58-65
    • /
    • 2003
  • This study was carried out to investigate active constituents of Bombysis corpus on the neurite outgrowth from PCl2 cells led to isolate three phospholipids (4 6) and three aromatic amines (13) were obtained from the methanol extract of Bombycis corpus. Based on spectral data, their structures have been elucidated as nicotiamide (1), cytidine (2), adenine (3), 1-O-(9Z-octadecenoyl)-2-O-(8Z, 11Z-octadecadienoyl)-sn-glycero-3-phosphorylcholine(4), 1, 2-di-O-hexadecanoyl-sn-glycero-3-phosphorylcholine(5) and 1, 2-di-O-9Z-octadecenoyl-sn-glycero-3-phosphorylcholine(6). (omitted)

  • PDF

Semantic Prosody and Meaning Equivalence: Is Korean pin konggan Equivalent to ‘Empty Space’ or ‘Blank Space’\ulcorner (의미운률과 의미 등가성: ‘빈 공간’은 ‘empty space’인가 ‘blank space’인가\ulcorner)

  • 조의연
    • Korean Journal of English Language and Linguistics
    • /
    • v.3 no.4
    • /
    • pp.589-609
    • /
    • 2003
  • The purpose of this paper is to show that lexical equivalency in translation can be achieved when it is based on semantic prosodies of lexical items. This paper examines the semantic prosodies of two seemingly synonymous English adjectives ‘empty’ and ‘blank’ on the basis of the corpus given in Cobuild English Collocations on CD-ROM and proposes that they are different in terms of spatial dimensions. Thus when a Korean equivalent pin derived from the verb pita is translated into English, syntagmatic phraseological environments of the Korean adjective must be taken into account to attain the equivalency of the source and target languages. Relevant Korean corpus was taken from the 21st Century Sejong Plan (2002). Out of 12 examples of pin konggan, five appear to be equivalent to ‘blank’ and seven to ‘empty.’ The five to seven ratio in different usage indicates that the equivalency problem concerning the lexical item pin is not a trivial matter in translation.

  • PDF

Data Mining Research on Maehwado Painting Poetry in the Early Joseon Dynasty

  • Haeyoung Park;Younghoon An
    • Journal of Information Processing Systems
    • /
    • v.19 no.4
    • /
    • pp.474-482
    • /
    • 2023
  • Data mining is a technique for extracting valuable information from vast amounts of data by analyzing statistical and mathematical operations, rules, and relationships. In this study, we employed data mining technology to analyze the data concerning the painting poetry of Maehwado (plum blossom paintings) from the early Joseon Dynasty. The data was extracted from the Hanguk Munjip Chonggan (Korean Literary Collections in Classical Chinese) in the Hanguk Gojeon Jonghap database (Korea Classics DB). Using computer information processing techniques, we carried out web scraping and classification of the painting poetry from the Hanguk Munjip Chonggan. Subsequently, we narrowed down our focus to the painting poetry specifically related to Maehwado in the early Joseon Dynasty. Based on this, refined dataset, we conducted an in-depth analysis and interpretation of the text data at the syllable corpus level. As a result, we found a direct correlation between the corpus statistics for each syllable in Maehwado painting poetry and the symbolic meaning of plum blossoms.