• Title/Summary/Keyword: Linguistic Features

Search Result 178, Processing Time 0.024 seconds

An Experimental Study on the Relation Extraction from Biomedical Abstracts using Machine Learning (기계 학습을 이용한 바이오 분야 학술 문헌에서의 관계 추출에 대한 실험적 연구)

  • Choi, Sung-Pil
    • Journal of the Korean Society for Library and Information Science
    • /
    • v.50 no.2
    • /
    • pp.309-336
    • /
    • 2016
  • This paper introduces a relation extraction system that can be used in identifying and classifying semantic relations between biomedical entities in scientific texts using machine learning methods such as Support Vector Machines (SVM). The suggested system includes many useful functions capable of extracting various linguistic features from sentences having a pair of biomedical entities and applying them into training relation extraction models for maximizing their performance. Three globally representative collections in biomedical domains were used in the experiments which demonstrate its superiority in various biomedical domains. As a result, it is most likely that the intensive experimental study conducted in this paper will provide meaningful foundations for research on bio-text analysis based on machine learning.

A Study on the Place-Names in Old Maps of Andong-Bu (안동부 고지도의 유형별 수록지명 연구)

  • Bae, Mi-Ae
    • Journal of the Korean association of regional geographers
    • /
    • v.10 no.3
    • /
    • pp.511-538
    • /
    • 2004
  • The aim of this paper is to investigate the characteristics of place-names in old map of Andong-Bu mapped in Chosun Dynasty. A place-name is a linguistic expression whose function is to identify a topographic feature. The place-names in old map of Andong-Bu normally involve physical, administrative, military, economic, transport, and cultural features. The cultural place-names are more meaningful in Andong-Bu which was a center of confucianism in Chosun Dynasty. The generic place-names, including cultural place-names such as school, pavilion, temple, etc., served as indicators to identify regional characteristics of Andong in the past and present. A study on the place-names, using old maps, is regarded as advisable attempt in regional approach in geography.

  • PDF

A Preliminary Study on the Toponymic Data of Joseon Jiji Jaryo : Chungcheongbuk-do (필사본 "조선지지자료(朝鮮地誌資料)" 충청북도편 지명 자료의 시론적 분석)

  • Kim, Sun-Bae
    • Journal of the Korean association of regional geographers
    • /
    • v.19 no.1
    • /
    • pp.31-44
    • /
    • 2013
  • The academic values of Joseon jiji jaryo (Geographical Documents of Joseon) are that native Korean toponyms corresponding with toponyms in Sino-Korean characters (hanzi 漢字) are abundantly recorded nationwide. Through analysing these documents, it is able to examine the linguistic features of toponymic changes while reasoning about geographical named sources and administrative districts as well as obtaining primary data. This article aims to preliminarily analyze these documents for encouraging academic studies which haven't been investigated sufficiently since they was introduced into the academic world. Noticing the toponymic values of these documents, I reexamined the bibliography and composing contents centering around the volume of Chungcheongbuk-do province in these documents. Consequently, I pointed out the possibility of regional differences of compilation times and mistakes in the contents of these documents. In addition to that, I proposed the toponymic values, referring to the transcription changes of village toponyms and the tendencies of loan transcription.

  • PDF

A Study on Expression Visual of Metamorphosis Transition of Image Animation (영상애니메이션 트랜지션의 메타모포시스 시각 표현에 관한 연구)

  • Joo, Hae-Jeong;Kim, Chee-Yong
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2010.05a
    • /
    • pp.347-350
    • /
    • 2010
  • With the advent of new media era we are living in the world of motion picture image is not an exaggeration. With the development of the mass media with images and moments that passed the day feeling like part of it is communication. In this era a sensual and shapeable image more than the logical and realistic thinking may be passed. In the act of seeing a moving image, prior to linguistic and conceptual meaning, color and screen configuration delivery and formative element in the change they produce is first recognized. Particular kind of video animation having the narrative structure, this territory effect greatly. the film even more significant effect on the area and linking the image of the scene transitions in the scheme also features a simple means of natural communication functions are being expanded. These transitional effects provide visual pleasure from a simple transition traditional methods. And In terms of narrative functions performed to help a narrative flow of availability, visual representation should continue.

  • PDF

Developing of Text Plagiarism Detection Model using Korean Corpus Data (한글 말뭉치를 이용한 한글 표절 탐색 모델 개발)

  • Ryu, Chang-Keon;Kim, Hyong-Jun;Cho, Hwan-Gue
    • Journal of KIISE:Computing Practices and Letters
    • /
    • v.14 no.2
    • /
    • pp.231-235
    • /
    • 2008
  • Recently we witnessed a few scandals on plagiarism among academic paper and novels. Plagiarism on documents is getting worse more frequently. Although plagiarism on English had been studied so long time, we hardly find the systematic and complete studies on plagiarisms in Korean documents. Since the linguistic features of Korean are quite different from those of English, we cannot apply the English-based method to Korean documents directly. In this paper, we propose a new plagiarism detecting method for Korean, and we throughly tested our algorithm with one benchmark Korean text corpus. The proposed method is based on "k-mer" and "local alignment" which locates the region of plagiarized document pairs fast and accurately. Using a Korean corpus which contains more than 10 million words, we establish a probability model (or local alignment score (random similarity by chance). The experiment has shown that our system was quite successful to detect the plagiarized documents.

An Error Analysis on Business E-mails in English : A Case-Study (비지니스 이메일 영작문에 나타난 오류분석: 사례연구)

  • Hwang, Seon-Yoo
    • Journal of Convergence for Information Technology
    • /
    • v.8 no.6
    • /
    • pp.273-279
    • /
    • 2018
  • This study aimed at providing a comprehensive account of the sources and causes of errors in business emails that Korean college students wrote using a translation machine. Data were collected from 21 emails written by the students who took a business English course. Findings indicated that the students tended to make frequent errors in verb use and verb tense as well as a definite article, countable/noncountable nouns, time adverbs and prepositions. Therefore, the study suggested that the students' common errors imply that they experience some difficulties learning these linguistic features. Given that learners' errors can give us valuable insights into teaching and learning how to write in English, pedagogical suggestions are put forward based on the study results.

A Study on Phenomenon 'Play of Words' in Modern Russian Advertising Language (현대 러시아 광고언어에 있어서의 '언어유희' 현상에 대한 연구)

  • Kim, Sung Wan
    • Cross-Cultural Studies
    • /
    • v.42
    • /
    • pp.241-260
    • /
    • 2016
  • The purpose of this article is to represent the types of advertising in the modern Russian language as 'Play of Words' (игра слов). The causal reason for this phenomenon is studied from the result of certain characteristics of advertising. The definition and characteristics of the language of the advertisement are analyzed in achieving the goal, as these factors reveal how language is used to maximize the effectiveness of the advertising. Academic research is needed in the collaborative fields of linguistics, psychology, economics, sociology, marketing, literature, art, and music. Modern advertisement is mixed with semiotic objects that consist of display, sound, and texts. While this study is not complete, the acknowledgement of the phenomenon 'Play of Words' between the creators of advertising and the consumer is undeniable. On one hand, advertising is recognized by linguists as the main factor that destroys the literary language. It represents the distortion of a standard language norm, as opposed to formal linguistic means used in advertising. In this research, we pay attention to the frequent use of foreign language borrowings and incorrect representation of foreign words, slang and jargon, that occur in misspelled usage of literary norms. The features that are revealed in this article are helpful to understand the purpose of advertising.

A Corpus Analysis of British-American Children's Adventure Novels: Treasure Island (영미 아동 모험 소설에 관한 코퍼스 분석 연구: 『보물섬』을 중심으로)

  • Choi, Eunsaem;Jung, Chae Kwan
    • The Journal of the Korea Contents Association
    • /
    • v.21 no.1
    • /
    • pp.333-342
    • /
    • 2021
  • In this study, we analyzed the vocabulary, lemmas, keywords, and n-grams in 『Treasure Island』 to identify certain linguistic features of this British-American children's adventure novel. The current study found that, contrary to the popular claim that frequently-used words are important and essential to a story, the set of frequently-used words in 『Treasure Island』 were mostly function words and proper nouns that were not directly related to the plot found in 『Treasure Island』. We also ascertained that a list of keywords using a statistical method making use of a corpus program was not good enough to surmise the story of 『Treasure Island』. However, we managed to extract 30 keywords through the first quantitative keyword analysis and then a second qualitative keyword analysis. We also carried out a series of n-gram analyses and were able to discover lexical bundles that were preferred and frequently used by the author of 『Treasure Island』. We hope that the results of this study will help spread this knowledge among British-American children's literature as well as to further put forward corpus stylistic theory.

Semantic Role Labeling using Biaffine Average Attention Model (Biaffine Average Attention 모델을 이용한 의미역 결정)

  • Nam, Chung-Hyeon;Jang, Kyung-Sik
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.26 no.5
    • /
    • pp.662-667
    • /
    • 2022
  • Semantic role labeling task(SRL) is to extract predicate and arguments such as agent, patient, place, time. In the previously SRL task studies, a pipeline method extracting linguistic features of sentence has been proposed, but in this method, errors of each extraction work in the pipeline affect semantic role labeling performance. Therefore, methods using End-to-End neural network model have recently been proposed. In this paper, we propose a neural network model using the Biaffine Average Attention model for SRL task. The proposed model consists of a structure that can focus on the entire sentence information regardless of the distance between the predicate in the sentence and the arguments, instead of LSTM model that uses the surrounding information for prediction of a specific token proposed in the previous studies. For evaluation, we used F1 scores to compare two models based BERT model that proposed in existing studies using F1 scores, and found that 76.21% performance was higher than comparison models.

Deep Learning-based Korean Dialect Machine Translation Research Considering Linguistics Features and Service (언어적 특성과 서비스를 고려한 딥러닝 기반 한국어 방언 기계번역 연구)

  • Lim, Sangbeom;Park, Chanjun;Yang, Yeongwook
    • Journal of the Korea Convergence Society
    • /
    • v.13 no.2
    • /
    • pp.21-29
    • /
    • 2022
  • Based on the importance of dialect research, preservation, and communication, this paper conducted a study on machine translation of Korean dialects for dialect users who may be marginalized. For the dialect data used, AIHUB dialect data distributed based on the highest administrative district was used. We propose a many-to-one dialect machine translation that promotes the efficiency of model distribution and modeling research to improve the performance of the dialect machine translation by applying Copy mechanism. This paper evaluates the performance of the one-to-one model and the many-to-one model as a BLEU score, and analyzes the performance of the many-to-one model in the Korean dialect from a linguistic perspective. The performance improvement of the one-to-one machine translation by applying the methodology proposed in this paper and the significant high performance of the many-to-one machine translation were derived.