• Title/Summary/Keyword: 어휘 분석

Search Result 861, Processing Time 0.025 seconds

Debelppment of C++ Compiler and Programming Environment (C++컴파일러 및 프로그래밍 환경 개발)

  • Jang, Cheon-Hyeon;O, Se-Man
    • The Transactions of the Korea Information Processing Society
    • /
    • v.4 no.3
    • /
    • pp.831-845
    • /
    • 1997
  • In this paper,we proposed and developed a compiler and interactive programming enviroments for C++ wich is mostly worth of nitice among the object -oriented languages.To develope the compiler for C++ we took front=end/back-end model using EM virtual machine.In develpoing Front-End,we formailized C++ gram-mar with the context semsitive tokens which must be manipulated by dexical scanner and designed a AST class li-brary which is the hierarchy of AST node class and well defined interface among them,In develpoing Bacik-End,we proposed model for three major components :code oprtimizer,code generator and run-time enviroments.We emphasized the retargatable back-end which can be systrmatically reconfigured to genrate code for a variety of distinct target computers.We also developed terr pattern matching algorithm and implemented target code gen-erator which produce SPARC code.We also proposed the theroy and model for construction interative pro-gramming enviroments. To represent language features we adopt AST as internal reprsentation and propose uncremental analysis algorithm and viseal digrams.We also studied unparsing scheme, visual diagram,graphical user interface to generate interactive environments automatically Results of our resarch will be very useful for developing a complier and programming environments, and also can be used in compilers for parallel and distributed enviroments.

  • PDF

Korean Semantic Role Labeling Based on Suffix Structure Analysis and Machine Learning (접사 구조 분석과 기계 학습에 기반한 한국어 의미 역 결정)

  • Seok, Miran;Kim, Yu-Seop
    • KIPS Transactions on Software and Data Engineering
    • /
    • v.5 no.11
    • /
    • pp.555-562
    • /
    • 2016
  • Semantic Role Labeling (SRL) is to determine the semantic relation of a predicate and its argu-ments in a sentence. But Korean semantic role labeling has faced on difficulty due to its different language structure compared to English, which makes it very hard to use appropriate approaches developed so far. That means that methods proposed so far could not show a satisfied perfor-mance, compared to English and Chinese. To complement these problems, we focus on suffix information analysis, such as josa (case suffix) and eomi (verbal ending) analysis. Korean lan-guage is one of the agglutinative languages, such as Japanese, which have well defined suffix structure in their words. The agglutinative languages could have free word order due to its de-veloped suffix structure. Also arguments with a single morpheme are then labeled with statistics. In addition, machine learning algorithms such as Support Vector Machine (SVM) and Condi-tional Random Fields (CRF) are used to model SRL problem on arguments that are not labeled at the suffix analysis phase. The proposed method is intended to reduce the range of argument instances to which machine learning approaches should be applied, resulting in uncertain and inaccurate role labeling. In experiments, we use 15,224 arguments and we are able to obtain approximately 83.24% f1-score, increased about 4.85% points compared to the state-of-the-art Korean SRL research.

A Study on the Ellipsis of Case markers through the Hangul letters of Hyun-Poong Kwak's family (현풍(玄風) 곽씨(郭氏) 언간(諺簡)의 격조사(格助詞) 생략(省略)에 대한 고찰(考察))

  • Jeon, Byeong-Yong
    • (The)Study of the Eastern Classic
    • /
    • no.33
    • /
    • pp.413-435
    • /
    • 2008
  • This study is purposed to analyze the appearances and functions of ellipsis of case markers through the Hangul letters of Hyun-Poong Kwak's family in the early 17th century. Hangul letters appear more colloquial than typography, and ellipsis is the one of main features of it. Generally, 'ellipsis' occurs when a constituent of a sentence deliberately leaves out of a sentence, because it is repeated or can be deduced by the context or occasion. As Hangul letter is written for a specific person, 'ellipsis' occurs more often than typography written for unspecified individuals. The ellipsis of case markers are not an exception. The ellipsis has functions as follow. The first, function is 'brevity.' Communication can be more convenient by ellipsis. Next will be 'informality.' Informality caused by ellipsis can make people express their thoughts and feelings fluently and naturally. It is the reason that spoken language has more frequently occurred ellipses than written language. The third function is 'quickness.' The same information can be delivered more quickly by the sentence using ellipsis than not using. In the 21st Century, ellipsis is misused in netizen communication owing to the quickness. The last function should be 'the effect of a literary style.' The effects can embody through ellipsis such as 'rhythm effect', 'letter style effect', 'translation style effect.' As a result of analyzing ellipsis of case markers, frequency follows the order below.[subjective objective > Locative > Dative > Commutative > Instrumental]

A Contrastive Study on Korean and Chinese Passive Expression: Centered on Korean Act Subject Marks and Chinese Passive Marks (한·중 피동 표현 대조 연구 - 한국어 행위주 표지와 중국어 피동 표지 대비 중심으로 -)

  • Yu, Tong-Tong;Kim, In-Kyun
    • Cross-Cultural Studies
    • /
    • v.47
    • /
    • pp.217-240
    • /
    • 2017
  • This paper is based on a comparative analysis of the Korean act subject marks '-에게(한테), -로, -에' and Chinese passive marks '被[$b{\grave{e}}i$]/?[$r{\grave{a}}ng$]/叫[$ji{\grave{a}}o$]/?[$g{\check{e}}i$]'. Each distribution's aspects and characteristics were examined and corresponding relationships were analyzed by comparison of these forms. The method of this comparative analysis focused on three aspects such as tangible characteristics of the two languages, selective restrictions on the 'act subject' or 'passive subject' in the passive expression, and constraints on the use of the act subject (passive) marks in the Korean passive expression by '받다'. In this comparative analysis Korean act subject markers '-에게(한테), -로, -에' and Chinese passive markers '被/?/叫/?' are always as an adverb in passive expression in combination with the act subject. Despite this common point, some differences were revealed relative to the use of the two languages. First, we reveal that the 'act subject' and the conjoined manner follow the passive expression according to characteristics of the two languages. In addition, the act subject marks of Korean passive expressions '에게/한테, -에/로' only serve as an investigative role. They are limited only to [${\pm}animate$] of the act subject. But Chinese passive markers '被/?/叫/?' are often restricted by [${\pm}animate$] of passive subject, existence and non-existence of act subject. In the Korean passive expression by '받다', it is used as act subject marks '에게/한테, -에/로' but the Chinese passive marks are restricted by the meaning of lexical items in a sentence.

A Qualitative Study on English Speaking Tasks Experienced by Beginner Level EFL Learners (초급 수준의 영어학습자들이 경험한 그림을 활용한 영어 말하기 과업에 관한 연구)

  • Kim, Byung-Sun;Yoon, Tecnam
    • The Journal of the Korea Contents Association
    • /
    • v.21 no.10
    • /
    • pp.603-612
    • /
    • 2021
  • The purpose of this study is to allow beginner level English learners to experience the English speaking task using pictures, and to analyze the meanings of the experience using a phenomenological research method. As research participants, 10 freshmen majoring in Power Generation Facilities at Korean Polytechnic University in Gangwon-do were selected. Face-to-face interviews and SNS were used for data collection, and Colaizzi's research method was adopted for data analysis. As a result of the analysis, 9 themes, 4 theme clusters, and 2 categories were derived. The results are as follows. First, the participants were able to find hope that they could speak English at their own level through the English speaking task using pictures. Second, they stated that the effect of the visual medium of painting increased concentration and curiosity and lowered anxiety. Third, it was recognized that self-confidence, a speaker like a native speaker, and quickness of speaking improved due to familiarity with speaking English. Fourth, the biggest difficulty in the English speaking task was vocabulary. So, they felt the limitation in explaining the picture, and they were having a lot of trouble in translating Korean words into English words. Finally, through the results of this study, the effect of the medium of picture was confirmed, and necessary future studies were suggested.

Sentiment Analysis using Robust Parallel Tri-LSTM Sentence Embedding in Out-of-Vocabulary Word (Out-of-Vocabulary 단어에 강건한 병렬 Tri-LSTM 문장 임베딩을 이용한 감정분석)

  • Lee, Hyun Young;Kang, Seung Shik
    • Smart Media Journal
    • /
    • v.10 no.1
    • /
    • pp.16-24
    • /
    • 2021
  • The exiting word embedding methodology such as word2vec represents words, which only occur in the raw training corpus, as a fixed-length vector into a continuous vector space, so when mapping the words incorporated in the raw training corpus into a fixed-length vector in morphologically rich language, out-of-vocabulary (OOV) problem often happens. Even for sentence embedding, when representing the meaning of a sentence as a fixed-length vector by synthesizing word vectors constituting a sentence, OOV words make it challenging to meaningfully represent a sentence into a fixed-length vector. In particular, since the agglutinative language, the Korean has a morphological characteristic to integrate lexical morpheme and grammatical morpheme, handling OOV words is an important factor in improving performance. In this paper, we propose parallel Tri-LSTM sentence embedding that is robust to the OOV problem by extending utilizing the morphological information of words into sentence-level. As a result of the sentiment analysis task with corpus in Korean, we empirically found that the character unit is better than the morpheme unit as an embedding unit for Korean sentence embedding. We achieved 86.17% accuracy on the sentiment analysis task with the parallel bidirectional Tri-LSTM sentence encoder.

Investigating the Influence of ESG Information on Funding Success in Online Crowdfunding Platform by Using Text Mining Technique and Logistic Regression

  • Kyu Sung Kim;Min Gyeong Kim;Francis Joseph Costello;Kun Chang Lee
    • Journal of the Korea Society of Computer and Information
    • /
    • v.28 no.7
    • /
    • pp.155-164
    • /
    • 2023
  • In this paper, we examine the influence of Environmental, Social, and Governance (ESG)-related content on the success of online crowdfunding proposals. Along with the increasing significance of ESG standards in business, investment proposals incorporating ESG concepts are now commonplace. Due to the ESG trend, conventional wisdom holds that the majority of proposals with ESG concepts will have a higher rate of success. We investigate by analyzing over 9000 online business presentations found in a Kickstarter dataset to determine which characteristics of these proposals led to increased investment. We first utilized lexicon-based measurement and Feature Engineering to determine the relationship between environment and society scores and financial indicators. Next, Logistic Regression is utilized to determine the effect of including environmental and social terms in a project's description on its ability to obtain funding. Contrary to popular belief, our research found that microentrepreneurs were less likely to succeed with proposals that focused on ESG issues. Our research will generate new opportunities for research in the disciplines of information science and crowdfunding by shedding new light on the environment of online micro-entrepreneurship.

A Study of the New Chinese Words Under the Influence of Culture Content (문화 콘텐츠 영향의 신조 중국어 고찰)

  • Meng, Xiang-Shan;Lee, Kwang-Ho
    • Journal of Korea Entertainment Industry Association
    • /
    • v.13 no.8
    • /
    • pp.131-142
    • /
    • 2019
  • This paper is intended to examine and analyze the new Chinese words as the result of culture content. The development of the Korean entertainment industry has created a Korean wave around the world. Through this, many Korean words, Internet vocabulary, and cultural concepts have begun to enter China. Among them, there are many new words that have appeared on the Chinese Internet due to the culture content. As the number of Korean fans and Korean learners increases, new words on the Internet are widely used. The new Chinese words, which are influenced by Korean cultural content, are considered an important part of new Chinese vocabulary. To accurately recognize and understand this, first of all six categories of the new Chinese words were analyzed, which were figurative meaning, substitution, loan of foreign words, abbreviation, compound word, derivation. This formulation also works on the Chinese words with the influence of cultural content. There are three types of the Internet new words form Korean cultural. Which were new words in Chinese characters, new words in alphabets, extended meanings. And had analyzed new words through the acquisition of new meanings. Also took specific news titles and songs according to each category. Through new Chinese words, The influence of cultural content had been confirmed. It is expected that these new Chinese words enrich Chinese vocabulary, also help to facilitate communication. And these new Chinese words are often used in public media or in everyday life. We should recognize the existence of these new Chinese words, and have an accurate perception of them.

Individualism and collectivism in ethical decision making (문화성향은 윤리적 의사결정의 과정에 영향을 주는가?)

  • Hong Im Shin
    • Korean Journal of Culture and Social Issue
    • /
    • v.21 no.1
    • /
    • pp.67-96
    • /
    • 2015
  • Do cultural differences affect moral decisions? Two studies were conducted to investigate whether attitudes of individualism vs. collectivism have an impact on ethical decision making. Study 1 (N=92) showed that utilitarianism was preferred in a situation, in which an intervention resulted in the best outcome (i.e., saving more people's lives), while deontology was preferred in a situation, in which the focus was on negative consequences of the intervention (i.e. personal sacrifices). Additionally, there were differences between the idiocentrics and the allocentrics groups regarding morality aspects. In the idiocentrics group, harm and fairness were regarded as more important than other moral aspects, while in the allocentrics group, not only harm and fairness, but also ingroup and authority were perceived as critical moral aspects. In Study 2 (N=30), after lexical decision tasks were conducted for culture priming, the mouse tracking method was used to explore response dynamics of moral decision processes, while judging appropriateness of interventions in moral dilemmas. In Study 2, in a condition, in which the small number of victims were focused upon, there were more maximal deviations and higher Xflips in the individualism priming group than in the collectivism priming group, which showed that the participants in the individualism condition had more deliberative processes before choosing their answers between utilitarianism and deontology. In addition, the participants in the individualism priming condition showed more maximal deviations in the mouse trajectories regarding ingroup related interventions in moral dilemmas than those in the collectivism priming condition. These results implicated the possibilities that the automatic emotional process and the controlled deliberative process in moral decision making might interact with cultural dispositions of the individuals and the focus of situations.

  • PDF

Aspects of Korean rhythm realization by second language learners: Focusing on Chinese learners of Korean (제 2언어 학습자의 한국어 리듬 실현양상 -중국인 한국어 학습자를 중심으로-)

  • Youngsook Yune
    • Phonetics and Speech Sciences
    • /
    • v.15 no.3
    • /
    • pp.27-35
    • /
    • 2023
  • This study aimed to investigate the effect of Chinese on the production of Korean rhythm. Korean and Chinese are typologically classified into different rhythmic categories; because of this, the phonological properties of Korean and Chinese are similar and different at the same time. As a result, Chinese can exert both positive and negative influences on the realization of Korean rhythm. To investigate the influence of the rhythm of the native language of L2 learners on their target language, we conducted an acoustic analysis using acoustic metrics like of the speech of 5 Korean native speakers and 10 advanced Chinese Korean learners. The analyzed material is a short paragraph of five sentences containing a variety of syllable structures. The results showed that KS and CS rhythms are similar in %V, VarcoV, and nPVI_S. However, CS, unlike KS, showed characteristics closer to those of a stress-timed language in the values of %V and VarcoV. There was also a significant difference in nPVI_V values. These results demonstrate a negative influence of the native language in the realization of Korean rhythm. This can be attributed to the fact that all vowels in Chinese sentence are not pronounced with the same emphasis due to neutral tone. In this sense, this study allowed us to observe influences of L1 on L2 production of rhythm.