• 제목/요약/키워드: Corpus-based Study

검색결과 204건 처리시간 0.029초

Non-Discourse Marker Uses of So in EFL Writings: Functional Variability among Asian Learners

  • Sato, Shie
    • 아시아태평양코퍼스연구
    • /
    • 제1권2호
    • /
    • pp.27-39
    • /
    • 2020
  • This paper examines the frequency and distribution of the so-called "non-discourse marker functions" of so in essay writings produced by 200 L1 English speakers and 1,300 EFL learners in China, Japan, Korea, and Taiwan. Based on the data drawn from the International Corpus Network of Asian Learners of English, this study compares EFL learners and L1 English speakers' uses of so, identifying four grammatical uses, as (1) an adverb, (2) part of a fixed phrase, (3) a pro-form, and (4) a conjunction phrase specifying purpose. This study aims to show the wide variability among EFL learners with different L1s, identifying the tendency of usage both common among and specific to the sub-groups of EFL learners. The findings suggest that the learners demonstrate patterns distinctively different from those of L1 English speakers, indicating an underuse of so as a marker expressing "purpose" and an overuse as part of fixed phrases. Compared to L1 English speakers, the learners also tend to overuse so in the discourse marker functions, regardless of their L1s. The study proposes pedagogical implications focusing on discourse flow and diachronic aspects of so in order to understand its multifunctionality, although the latter is primarily suggested for advanced learners.

딥러닝 및 토픽모델링 기법을 활용한 소셜 미디어의 자살 경향 문헌 판별 및 분석 (Examining Suicide Tendency Social Media Texts by Deep Learning and Topic Modeling Techniques)

  • 고영수;이주희;송민
    • 한국비블리아학회지
    • /
    • 제32권3호
    • /
    • pp.247-264
    • /
    • 2021
  • 자살은 전 세계 사망 원인 중 4위이며 사회, 경제적 손실이 큰 난제이다. 본 연구는 자살 예방을 위하여 소셜미디어에 나타난 자살 관련 말뭉치를 구축하고 이를 통해 자살 경향 문헌을 분류할 수 있는 딥러닝 자동분류 모델을 만들고자 하였다. 또한, 자살 요인을 분석하기 위해 주제를 자동으로 추출하는 분석 기법인 토픽모델링을 활용하여 자살 관련 말뭉치를 세부 주제로 분류하고자 하였다. 이를 위해 소셜미디어 중 하나인 네이버 지식iN에 나타난 자살 관련 문헌 2,011개를 수집한 후 자살예방교육 매뉴얼을 기준으로 자살 경향 문헌 및 비경향 문헌 여부를 주석 처리하였으며, 이 데이터를 딥러닝 모델(LSTM, BERT, ELECTRA)로 학습시켜 자동분류 모델을 만들었다. 또한, 토픽모델링 기법의 하나인 LDA 기법으로 주제별 문헌을 분류하여 자살 요인을 발견하였고 이를 심층적으로 분석하기 위해 주제별로 동시출현 단어 분석 및 네트워크 시각화를 진행하였다.

ToBI and beyond: Phonetic intonation of Seoul Korean ani in Korean Intonation Corpus (KICo)

  • Ji-eun Kim
    • 말소리와 음성과학
    • /
    • 제16권1호
    • /
    • pp.1-9
    • /
    • 2024
  • This study investigated the variation in the intonation of Seoul Korean interjection ani across different meanings ("no" and "really?") and speech levels (Intimate and Polite) using data from Korean Intonation Corpus (KICo). The investigation was conducted in two stages. First, IP-final tones in the dataset were categorized according to the K-ToBI convention (Jun, 2000). While significant relationships were observed between the meaning of ani and its IP-final tones, substantial overlap between groups was notable. Second, the F0 characteristics of the final syllable of ani were analyzed to elucidate the apparent many-to-many relationships between intonation and meaning/speech level. Results indicated that these seemingly overlapping relationships could be significantly distinguished. Overall, this study advocates for a deeper analysis of phonetic intonation beyond ToBI-based categorical labels. By examining the F0 characteristics of the IP-final syllable, previously unclear connections between meaning/speech level and intonation become more comprehensible. Although ToBI remains a valuable tool and framework for studying intonation, it is imperative to explore beyond these categories to grasp the "distinctiveness" of intonation, thereby enriching our understanding of prosody.

Formulaic Language Development in Asian Learners of English: A Comparative Study of Phrase-frames in Written and Oral Production

  • Yoon Namkung;Ute Romer
    • 아시아태평양코퍼스연구
    • /
    • 제4권2호
    • /
    • pp.1-39
    • /
    • 2023
  • Recent research in usage-based Second Language Acquisition has provided new insights into second language (L2) learners' development of formulaic language (Wulff, 2019). The current study examines the use of phrase-frames, which are recurring sequences of words including one or more variable slots (e.g., it is * that), in written and oral production data from Asian learners of English across four proficiency levels (beginner, low-intermediate, high-intermediate, advanced) and native English speakers. The variability, predictability, and discourse functions of the most frequent 4-word phrase-frames from the written essay and spoken dialogue sub-corpora of the International Corpus Network of Asian Learners of English (ICNALE) were analyzed and then compared across groups and modes. The results revealed that while learners' phrase-frames in writing became more variable and unpredictable as proficiency increased, no clear developmental patterns were found in speaking, although all groups used more fixed and predictable phrase-frames than the reference group. Further, no developmental trajectories in the functions of the most frequent phrase-frames were found in both modes. Additionally, lower-level learners and the reference group used more variable phrase-frames in speaking, whereas advanced-level learners showed more variability in writing. This study contributes to a better understanding of the development of L2 phraseological competence.

코퍼스 방식 음성합성에서의 개선된 운율구 경계 예측 (AP, IP Prediction For Corpus-based Korean Text-To-Speech)

  • 권오일;홍문기;강선미;신지영
    • 음성과학
    • /
    • 제9권3호
    • /
    • pp.25-34
    • /
    • 2002
  • One of the most important factor in the performance of Korean text-to-speech system is the prediction of accentual and intonational phrase boundary. The previous method of prediction shows only the 75-85% which is not proper in the practical and commercial system. Therefore, more accurate prediction must be needed in the practical system. In this study, we propose the simple and more accurate method of the prediction of AP, IP.

  • PDF

담화표지 '아', '어', '음'의 성별과 연령별 사용 양상 (The pattern of use by gender and age of the discourse markers 'a', 'eo', and 'eum')

  • 송영숙;심지수;오재혁
    • 말소리와 음성과학
    • /
    • 제12권4호
    • /
    • pp.37-45
    • /
    • 2020
  • 이 연구는 담화 표지 '아, 어, 음'의 출현 빈도와 발화 시간, 발화 위치 등을 계량적으로 관찰하여 성별과 연령별 차이를 보이고자 하였다. 이를 위해 대용량 음성 코퍼스인 서울코퍼스를 이용하였고, Praat(ver.6.1.31)으로 음길이와 실제 발화를 확인하고, Emeditor(ver.17.6.1)로 코퍼스를 분석하고, R(ver.3.4.4)로 통계 분석하여 결과를 제시하였다. 성별에 따라 보면 여성의 경우 남성보다 단독 발화에서 '음'이 고빈도로 사용되었고, 발화 종결 위치에서의 평균 음길이 또한 길었다. 연령에 따라 보면 발화 시작 위치에서 10대에서는 '아'가, 40대는 '어'가 고빈도로 출현하는 것이 특징적이었다.

五子腎氣丸이 음경해면체 평활근의 수축에 미치는 영향 (Effects of Oja-Shingiwhan in Contracted Corpus Cavernosum Smooth Muscle)

  • 박정수;안상현;박선영
    • 동의생리병리학회지
    • /
    • 제30권5호
    • /
    • pp.308-313
    • /
    • 2016
  • The purpose of this study is to investigate the effects of Oja-Shingiwhan(OS) in contracted corpus cavernosum smooth muscle and its mechanism. To evaluate the relaxation of OS in contracted corpus cavernosum, OS was treated in strips which were precontracted with phenylephrine(PE). To examine its mechanism, OS was treated into corporal strips contracted by PE after pretreatment of Nω-nitro-L-arginine(L-NNA) and compared with non-pretreatment of L-NNA. In calcium chloride(Ca2+)-free krebs solution, Ca2+ 1 mM was treated into corporal strips contracted by PE after pretreatment of OS and compared with non-pretreatment of OS. action were measured by histochemical, immunohistochemical methods. OS significantly affected on the relaxation of corporal strips, and the relaxation effects were inhibited by pretreatment of L-NNA. Contractions induced by Ca2+ influx were inhibited by pretreatment of OS in Ca2+-free krebs solution. OS increased eNOS positive reaction in corpus cavernosum, but decreased PDE-5 positive reaction. These result suggest that the effect of OS in contracted corpus cavernosum smooth muscle are shown by suppressing extracellular Ca2+ influx and increase of eNOS, NO production and decrease of PDE-5.

Phospholipids from Bombycis corpus and Their Neurotrophic Effects

  • Yeon Jung;Kwon, Hak-Cheol;Cho, Se-Yeon;Cho, Ock-Ryun;Yang, Min-Cheol;Kim, Sun-Yeou;Lee, Kang-Ro
    • 한국잠사학회:학술대회논문집
    • /
    • 한국잠사학회 2003년도 International Symposium of Silkworm/Insect Biotechnology and Annual Meeting of Korea Society of Sericultural Science
    • /
    • pp.58-65
    • /
    • 2003
  • This study was carried out to investigate active constituents of Bombysis corpus on the neurite outgrowth from PCl2 cells led to isolate three phospholipids (4 6) and three aromatic amines (13) were obtained from the methanol extract of Bombycis corpus. Based on spectral data, their structures have been elucidated as nicotiamide (1), cytidine (2), adenine (3), 1-O-(9Z-octadecenoyl)-2-O-(8Z, 11Z-octadecadienoyl)-sn-glycero-3-phosphorylcholine(4), 1, 2-di-O-hexadecanoyl-sn-glycero-3-phosphorylcholine(5) and 1, 2-di-O-9Z-octadecenoyl-sn-glycero-3-phosphorylcholine(6). (omitted)

  • PDF

의미운률과 의미 등가성: ‘빈 공간’은 ‘empty space’인가 ‘blank space’인가\ulcorner (Semantic Prosody and Meaning Equivalence: Is Korean pin konggan Equivalent to ‘Empty Space’ or ‘Blank Space’\ulcorner)

  • 조의연
    • 한국영어학회지:영어학
    • /
    • 제3권4호
    • /
    • pp.589-609
    • /
    • 2003
  • The purpose of this paper is to show that lexical equivalency in translation can be achieved when it is based on semantic prosodies of lexical items. This paper examines the semantic prosodies of two seemingly synonymous English adjectives ‘empty’ and ‘blank’ on the basis of the corpus given in Cobuild English Collocations on CD-ROM and proposes that they are different in terms of spatial dimensions. Thus when a Korean equivalent pin derived from the verb pita is translated into English, syntagmatic phraseological environments of the Korean adjective must be taken into account to attain the equivalency of the source and target languages. Relevant Korean corpus was taken from the 21st Century Sejong Plan (2002). Out of 12 examples of pin konggan, five appear to be equivalent to ‘blank’ and seven to ‘empty.’ The five to seven ratio in different usage indicates that the equivalency problem concerning the lexical item pin is not a trivial matter in translation.

  • PDF

Data Mining Research on Maehwado Painting Poetry in the Early Joseon Dynasty

  • Haeyoung Park;Younghoon An
    • Journal of Information Processing Systems
    • /
    • 제19권4호
    • /
    • pp.474-482
    • /
    • 2023
  • Data mining is a technique for extracting valuable information from vast amounts of data by analyzing statistical and mathematical operations, rules, and relationships. In this study, we employed data mining technology to analyze the data concerning the painting poetry of Maehwado (plum blossom paintings) from the early Joseon Dynasty. The data was extracted from the Hanguk Munjip Chonggan (Korean Literary Collections in Classical Chinese) in the Hanguk Gojeon Jonghap database (Korea Classics DB). Using computer information processing techniques, we carried out web scraping and classification of the painting poetry from the Hanguk Munjip Chonggan. Subsequently, we narrowed down our focus to the painting poetry specifically related to Maehwado in the early Joseon Dynasty. Based on this, refined dataset, we conducted an in-depth analysis and interpretation of the text data at the syllable corpus level. As a result, we found a direct correlation between the corpus statistics for each syllable in Maehwado painting poetry and the symbolic meaning of plum blossoms.