• Title/Summary/Keyword: 코퍼스

Search Result 487, Processing Time 0.028 seconds

A Cognitive Study on Jeong in Korean and Amae in Japanese (한국의 '정(情)'과 일본의 '아마에(甘え)'에 대한 인지적 고찰)

  • Kim, Myung-Hee
    • Cross-Cultural Studies
    • /
    • v.27
    • /
    • pp.471-496
    • /
    • 2012
  • Since Takeo Toi's discussion in The Anatomy of Dependence (1973), the concept of amae has generated considerable interest in the fields of psychology, anthropology, communication and other fields. However, there have been few agreements on the definition and functions of amae (Behrens, 2004). Furthermore, it has rarely drawn interest from linguists in general and has rarely been discussed in the context of social interaction. This study aims to examine the concept of amae in Japanese and the corresponding phenomena in Korean, and attempts to explore the similarities and differences between them. The prototype of the amae relationship is the mother-infant relationship (Doi,1973). Even an adult can assume the role of a baby, showing his or her need for dependence on others and expecting to be accepted. It turns out that amae-like phenomena frequently occur in everyday life in Korea as well (Lee, 1982). There is no single term for the concept, but it can be translated in many different ways in Korean, for example, aykyo, ayang, ungsek, erikwang, etc. It can have either positive or negative connotations depending on the situation. It seems that the psychological system that causes dependent behaviors such as ungsek in Korea is cheong, one of the key terms characterizing the Korean culture. Cheong, like amae, starts in the mother-infant relationship extending to familial and to other relationships. A corpus-based analysis shows that cheong is conceptualized as gluing people together, growing over time, and also potentially fatal because it assumes illogical, uncalculated, and personal relationships. In conclusion, unlike some Japanese scholars' claim that amae is the Japanese concept that exists only in Japan, it seems that similar phenomena do exist in Korea as well, but that Korean has no single term corresponding to amae in Japanese. It seems that cheong can be posited in Korean as the inner emotional system that makes amae-like behaviors possible, and that there is not single term, but many, designating the behaviors, possibly because the behaviors have a negative connotation and are not encouraged in Korean society.

A Genre Analysis of Newspaper Articles for Korean Language Education -Based on the linguistic analysis of newspaper articles and reading materials in Korean language textbooks- (한국어 읽기 교육을 위한 기사문 장르분석 -신문기사 및 교재 기사문의 언어학적 분석을 바탕으로-)

  • Lee, Seungyeon;Sim, Jiyeon;Shin, Jungha
    • Journal of Korean language education
    • /
    • v.28 no.3
    • /
    • pp.53-83
    • /
    • 2017
  • The goal of this study is to examine whether the genre characteristics of newspaper articles are appropriately reflected in Korean language textbooks. For the purpose of this study, two corpora were built with 17 textbook articles and 60 newspaper articles respectively. The average sentence length and frequency of vocabulary in each corpus were measured. It was found that the sentences of articles in textbooks tended to have longer sentence length and more complicated structures than the articles in newspapers. For instance, sentences in the textbook articles had more verbal endings, such as conjunctive and transforming endings. On the other hand, in case of vocabulary representing 'timeliness', there was a high frequency of adverbs and nouns which were related to year, month, and time in actual articles, while it is found to be very limited in textbooks. Also, typical translative styles such as '-ko itta', '-e ttareumyun' were more prominent in textbooks than in newspaper articles. In the case of abbreviated and omitted form of particles, this was a characteristic that appeared only in actual articles because of the constraint of space. It is significant that this paper offers suggestions for the development of reading materials for Korean language education by revealing that the genre typology of actual newspaper articles is not adequately reflected in current textbooks.

Keyword Network Analysis on Global Research Trend in Design (1999~2018) (글로벌 디자인 연구동향에 대한 키워드 네트워크 분석 연구 (1999~2018))

  • Choi, Chool-Heon;Jang, Phill-Sik
    • Journal of Convergence for Information Technology
    • /
    • v.9 no.2
    • /
    • pp.7-16
    • /
    • 2019
  • The purpose of this study is to identify the characteristics of researches that have been conducted for the last 20 years through analyzing global research trends and evolutions of design articles from 1999 to 2018 with keyword network analysis. For this purpose, we selected 3,569 articles in 22 journals related to design research retrieved from the Scopus database and constructed keyword network model through the author keyword and index keyword. The frequency of the author and index keyword, the centrality of betweenness and degree were analyzed with the keyword network. The results show that design has been applied to various fields for recent 20 years, and the research trends of design could be quantitatively characterized by keyword network analysis. The result of this study could be used to suggest future research topics in the field of design based on quantitative and empirical data.

Network Analysis between Uncertainty Words based on Word2Vec and WordNet (Word2Vec과 WordNet 기반 불확실성 단어 간의 네트워크 분석에 관한 연구)

  • Heo, Go Eun
    • Journal of the Korean Society for Library and Information Science
    • /
    • v.53 no.3
    • /
    • pp.247-271
    • /
    • 2019
  • Uncertainty in scientific knowledge means an uncertain state where propositions are neither true or false at present. The existing studies have analyzed the propositions written in the academic literature, and have conducted the performance evaluation based on the rule based and machine learning based approaches by using the corpus. Although they recognized that the importance of word construction, there are insufficient attempts to expand the word by analyzing the meaning of uncertainty words. On the other hand, studies for analyzing the structure of networks by using bibliometrics and text mining techniques are widely used as methods for understanding intellectual structure and relationship in various disciplines. Therefore, in this study, semantic relations were analyzed by applying Word2Vec to existing uncertainty words. In addition, WordNet, which is an English vocabulary database and thesaurus, was applied to perform a network analysis based on hypernyms, hyponyms, and synonyms relations linked to uncertainty words. The semantic and lexical relationships of uncertainty words were structurally identified. As a result, we identified the possibility of automatically expanding uncertainty words.

The Stream of Uncertainty in Scientific Knowledge using Topic Modeling (토픽 모델링 기반 과학적 지식의 불확실성의 흐름에 관한 연구)

  • Heo, Go Eun
    • Journal of the Korean Society for information Management
    • /
    • v.36 no.1
    • /
    • pp.191-213
    • /
    • 2019
  • The process of obtaining scientific knowledge is conducted through research. Researchers deal with the uncertainty of science and establish certainty of scientific knowledge. In other words, in order to obtain scientific knowledge, uncertainty is an essential step that must be performed. The existing studies were predominantly performed through a hedging study of linguistic approaches and constructed corpus with uncertainty word manually in computational linguistics. They have only been able to identify characteristics of uncertainty in a particular research field based on the simple frequency. Therefore, in this study, we examine pattern of scientific knowledge based on uncertainty word according to the passage of time in biomedical literature where biomedical claims in sentences play an important role. For this purpose, biomedical propositions are analyzed based on semantic predications provided by UMLS and DMR topic modeling which is useful method to identify patterns in disciplines is applied to understand the trend of entity based topic with uncertainty. As time goes by, the development of research has been confirmed that uncertainty in scientific knowledge is moving toward a decreasing pattern.

The Relationship between English Proficiency and Syntactic Complexity for Korean College Students (한국 대학생의 에세이에 나타난 영어 능력 수준과 통사적 복잡성 간의 관계 탐색)

  • Lee, Young-Ju
    • The Journal of the Convergence on Culture Technology
    • /
    • v.7 no.3
    • /
    • pp.439-444
    • /
    • 2021
  • This study investigates the relationship between syntactic complexity and English proficiency for Korean college students, using the recently developed TAASSC(the Tool for the Automatic Analysis of Syntactic Sophistication and Complexity) program. Essays on the ICNALE(International Corpus Network of Asian Learners of English) corpus were employed and phrasal complexity indices and clausal complexity indices, respectively were used to predict English proficiency level for Korean students. Results of stepwise regression analysis showed that indices of phrasal complexity explained 8% of variance in English proficiency, while indices of clausal complexity accounted for approximately 11%. That is, indices of clausal complexity were slightly better predictors of English proficiency than indices of phrasal complexity, which contradicts Biber et at.(2011)'s claim that phrasal complexity is the hallmark of writing development.

The Relationship between Lexical Sophistication Features and English Proficiency for Korean College Students using TAALES Program (TAALES 프로그램을 활용하여 한국 대학생이 작성한 에세이에 나타난 어휘의 정교화 특성 비교)

  • Lee, Young-Ju
    • The Journal of the Convergence on Culture Technology
    • /
    • v.7 no.3
    • /
    • pp.433-438
    • /
    • 2021
  • This study investigates the relationship between lexical sophistication features and English proficiency for Korean college students. Essays from the ICNALE(International Corpus Network of Asian Learners of English) corpus were analyzed, using TAALES program. In order to examine whether or not there are statistically significant differences in lexical sophistication features across three groups, MANOVA was conducted. Results showed that the lexical sophistication features were significantly affected by English proficiency level. Essays written by Korean students with different English proficiency levels can be differentiated in terms of various lexical sophistication features including content words frequency, content words familiarity, lexical decision mean reaction time function words, hypernymy verbs, word naming response time function words, age of acquisition content words.

KorPatELECTRA : A Pre-trained Language Model for Korean Patent Literature to improve performance in the field of natural language processing(Korean Patent ELECTRA)

  • Jang, Ji-Mo;Min, Jae-Ok;Noh, Han-Sung
    • Journal of the Korea Society of Computer and Information
    • /
    • v.27 no.2
    • /
    • pp.15-23
    • /
    • 2022
  • In the field of patents, as NLP(Natural Language Processing) is a challenging task due to the linguistic specificity of patent literature, there is an urgent need to research a language model optimized for Korean patent literature. Recently, in the field of NLP, there have been continuous attempts to establish a pre-trained language model for specific domains to improve performance in various tasks of related fields. Among them, ELECTRA is a pre-trained language model by Google using a new method called RTD(Replaced Token Detection), after BERT, for increasing training efficiency. The purpose of this paper is to propose KorPatELECTRA pre-trained on a large amount of Korean patent literature data. In addition, optimal pre-training was conducted by preprocessing the training corpus according to the characteristics of the patent literature and applying patent vocabulary and tokenizer. In order to confirm the performance, KorPatELECTRA was tested for NER(Named Entity Recognition), MRC(Machine Reading Comprehension), and patent classification tasks using actual patent data, and the most excellent performance was verified in all the three tasks compared to comparative general-purpose language models.

Comparison of Classification Performance Between Adult and Elderly Using Acoustic and Linguistic Features from Spontaneous Speech (자유대화의 음향적 특징 및 언어적 특징 기반의 성인과 노인 분류 성능 비교)

  • SeungHoon Han;Byung Ok Kang;Sunghee Dong
    • KIPS Transactions on Software and Data Engineering
    • /
    • v.12 no.8
    • /
    • pp.365-370
    • /
    • 2023
  • This paper aims to compare the performance of speech data classification into two groups, adult and elderly, based on the acoustic and linguistic characteristics that change due to aging, such as changes in respiratory patterns, phonation, pitch, frequency, and language expression ability. For acoustic features we used attributes related to the frequency, amplitude, and spectrum of speech voices. As for linguistic features, we extracted hidden state vector representations containing contextual information from the transcription of speech utterances using KoBERT, a Korean pre-trained language model that has shown excellent performance in natural language processing tasks. The classification performance of each model trained based on acoustic and linguistic features was evaluated, and the F1 scores of each model for the two classes, adult and elderly, were examined after address the class imbalance problem by down-sampling. The experimental results showed that using linguistic features provided better performance for classifying adult and elderly than using acoustic features, and even when the class proportions were equal, the classification performance for adult was higher than that for elderly.

Pronunciation of the Korean diphthong /jo/: Phonetic realizations and acoustic properties (한국어 /ㅛ/의 발음 양상 연구: 발음형 빈도와 음향적 특징을 중심으로)

  • Hyangwon Lee
    • Phonetics and Speech Sciences
    • /
    • v.15 no.1
    • /
    • pp.9-17
    • /
    • 2023
  • The purpose of this study is to determine how the Korean diphthong /jo/ shows phonetic variation in various linguistic environments. The pronunciation of /jo/ is discussed, focusing on the relationship between phonetic variation and the distribution range of vowels. The location in a word (monosyllable, word-initial, word-medial, word-final) and word class (content word, function word) were analyzed using the speech of 10 female speakers of the Seoul Corpus. As a result of determining the frequency of appearance of /jo/ in each environment, the pronunciation type and word class were affected by the location in a word. Frequent phonetic reduction was observed in the function word /jo/ in the acoustic analysis. The word class did not change the average phonetic values of /jo/, but changed the distribution of individual tokens. These results indicate that the linguistic environment affects the phonetic distribution of vowels.