• Title/Summary/Keyword: 어휘정보

Search Result 1,062, Processing Time 0.023 seconds

Sentiment Classification considering Korean Features (한국어 특성을 고려한 감성 분류)

  • Kim, Jung-Ho;Kim, Myung-Kyu;Cha, Myung-Hoon;In, Joo-Ho;Chae, Soo-Hoan
    • Science of Emotion and Sensibility
    • /
    • v.13 no.3
    • /
    • pp.449-458
    • /
    • 2010
  • As occasion demands to obtain efficient information from many documents and reviews on the Internet in many kinds of fields, automatic classification of opinion or thought is required. These automatic classification is called sentiment classification, which can be divided into three steps, such as subjective expression classification to extract subjective sentences from documents, sentiment classification to classify whether the polarity of documents is positive or negative, and strength classification to classify whether the documents have weak polarity or strong polarity. The latest studies in Opinion Mining have used N-gram words, lexical phrase pattern, and syntactic phrase pattern, etc. They have not used single word as feature for classification. Especially, patterns have been used frequently as feature because they are more flexible than N-gram words and are also more deterministic than single word. Theses studies are mainly concerned with English, other studies using patterns for Korean are still at an early stage. Although Korean has a slight difference in the meaning between predicates by the change of endings, which is 'Eomi' in Korean, of declinable words, the earlier studies about Korean opinion classification removed endings from predicates only to extract stems. Finally, this study introduces the earlier studies and methods using pattern for English, uses extracted sentimental patterns from Korean documents, and classifies polarities of these documents. In this paper, it also analyses the influence of the change of endings on performances of opinion classification.

  • PDF

The Effect of the Individual differences in Cognitive Processes on Paragraph Comprehension: Structural Equation Modeling (인지정보처리의 개인차와 문단의 이해: 구조모형 연구)

  • Lee, Yoonhyoung;Kwon, Youan
    • Korean Journal of Cognitive Science
    • /
    • v.23 no.4
    • /
    • pp.487-515
    • /
    • 2012
  • The purpose of this study was to investigate the effect of the individual differences in cognitive processes on paragraph comprehension. To do so, the lexical decision task and the pattern comparison task were used to measure the low-level cognitive processes. Digit span task was used to test the phonological loop capacity. The individual differences of the central executive processing capacity were measured by operational span task. Reading span task was used to test the working memory capacity related with the sentence processing. Reading times and accuracies of the logically valid inferences and logically void inferences were tested to measure the high-level cognitive processes. Reading times and accuracies for the target sentences with and without prior explicit causal sentence were measured to test individuals' paragraph comprehension abilities. The results showed that the speed of the low-level cognitive processes was related with the speed of the high-level cognitive processes. Also, the accuracy of the low-level cognitive processes was related with the accuracy of the high-level cognitive processes while there was no significant correlation between the speed and the accuracy in any measures of the cognitive processes. Working memory capacity was related with the accuracy of the cognitive processes while it was not significantly correlated with the speed of the cognitive processes. Most importantly, the speed of low-level cognitive processes significantly affected the speed of the paragraph comprehension while the working memory capacity and the high-level cognitive processes had influences on the accuracies of the paragraph comprehension. The speed of the paragraph comprehension had no influence on the accuracies of the paragraph comprehension.

  • PDF

Descriptive Characteristics of the Label Texts Related to Earth Science: Toward Educationally Meaningful Communication (교육적으로 유의미한 의사소통을 위한 지구과학 관련 전시 라벨의 서술 특징)

  • Kim, Chan-Jong;Park, Eun-Ji;Yoon, Sae-Yeol;Lee, Sun-Kyung
    • Journal of the Korean earth science society
    • /
    • v.33 no.1
    • /
    • pp.94-109
    • /
    • 2012
  • The purpose of this study is to analyse the descriptive characteristics of the label texts related to Earth Science at a science museum and a natural history museum in Korea. The data were collected from Korean National Science Museum and Seodaemun Natural History Museum. The analysis framework was modified according to the Systemic Functional Linguistics. As a result, characteristics of the labels are 1) mostly declarative sentences, 2) appropriate amount of scientific information, and 3) mainly 'facts'. Moreover, all of the text genre are 4) 'logical expositions'. Particularly in Korean National Science Museum, the labels present 5) more scientific words among the entire terminologies and 6) more than half subjects omitted or long nominalized. Those results may imply that the labels can lead one-way communication regarding the culture of science rather than two-way. This study presents the descriptive characteristics of the label texts to make educationally meaningful communication possible by building an open structure between visitors' own culture in everyday life and the culture of science.

Semantic Network Analysis of Presidential Debates in 2007 Election in Korea (제17대 대통령 후보 합동 토론 언어네트워크 분석 - 북한 관련 이슈를 중심으로)

  • Park, Sung-Hee
    • Korean journal of communication and information
    • /
    • v.45
    • /
    • pp.220-254
    • /
    • 2009
  • Presidential TV debates serve as an important instrument for the general viewers to evaluate the candidates’ character, to examine their policy, and finally to make an important political decisions to cast ballots. Every words candidates utter in the course of entire election campaign exert influence of a certain significance by delivering their ideas and by creating clashes with their respective opponents. This study focuses on the conceptual venue, coined as ‘stasis’ by ancient rhetoricians, in which the clashes take place, and examines the words selection made by each candidates, the manners in which they form stasis, call for evidence, educate the public, and finally create a legitimate form of political argumentation. The study applied computer based content analysis using KrKwic and UCINET software to analyze semantic networks among the candidates. The results showed three major candidates, namely Lee Myung Bak, Jung Dong Young, and Lee Hoi Chang, displayed separate patterns in their use of language, by selecting the words that are often neglected by their opponents. Apparently, the absence of stasis and the lack of speaking mutual language significantly undermined the effects of debates. Central questions regarding issues of North Korea failed to meet basic requirements, and the respondents failed to engage in effective argumentation process.

  • PDF

A Trustworthiness Improving Link Evaluation Technique for LOD considering the Syntactic Properties of RDFS, OWL, and OWL2 (RDFS, OWL, OWL2의 문법특성을 고려한 신뢰향상적 LOD 연결성 평가 기법)

  • Park, Jaeyeong;Sohn, Yonglak
    • Journal of KIISE:Databases
    • /
    • v.41 no.4
    • /
    • pp.226-241
    • /
    • 2014
  • LOD(Linked Open Data) is composed of RDF triples which are based on ontologies. They are identified, linked, and accessed under the principles of linked data. Publications of LOD data sets lead to the extension of LOD cloud and ultimately progress to the web of data. However, if ontologically the same things in different LOD data sets are identified by different URIs, it is difficult to figure out their sameness and to provide trustworthy links among them. To solve this problem, we suggest a Trustworthiness Improving Link Evaluation, TILE for short, technique. TILE evaluates links in 4 steps. Step 1 is to consider the inference property of syntactic elements in LOD data set and then generate RDF triples which have existed implicitly. In Step 2, TILE appoints predicates, compares their objects in triples, and then evaluates links between the subjects in the triples. In Step 3, TILE evaluates the predicates' syntactic property at the standpoints of subject description and vocabulary definition and compensates the evaluation results of Step 2. The syntactic elements considered by TILE contain RDFS, OWL, OWL2 which are recommended by W3C. Finally, TILE makes the publisher of LOD data set review the evaluation results and then decide whether to re-evaluate or finalize the links. This leads the publishers' responsibility to be reflected in the trustworthiness of links among the data published.

Implementation of Policy based In-depth Searching for Identical Entities and Cleansing System in LOD Cloud (LOD 클라우드에서의 연결정책 기반 동일개체 심층검색 및 정제 시스템 구현)

  • Kim, Kwangmin;Sohn, Yonglak
    • Journal of Internet Computing and Services
    • /
    • v.19 no.3
    • /
    • pp.67-77
    • /
    • 2018
  • This paper suggests that LOD establishes its own link policy and publishes it to LOD cloud to provide identity among entities in different LODs. For specifying the link policy, we proposed vocabulary set founded on RDF model as well. We implemented Policy based In-depth Searching and Cleansing(PISC for short) system that proceeds in-depth searching across LODs by referencing the link policies. PISC has been published on Github. LODs have participated voluntarily to LOD cloud so that degree of the entity identity needs to be evaluated. PISC, therefore, evaluates the identities and cleanses the searched entities to confine them to that exceed user's criterion of entity identity level. As for searching results, PISC provides entity's detailed contents which have been collected from diverse LODs and ontology customized to the content. Simulation of PISC has been performed on DBpedia's 5 LODs. We found that similarity of 0.9 of source and target RDF triples' objects provided appropriate expansion ratio and inclusion ratio of searching result. For sufficient identity of searched entities, 3 or more target LODs are required to be specified in link policy.

A Design and Implementation of WML Compiler for WAP Gateway for Wireless Internet Services (무선 인터넷 서비스를 위한 WAP 게이트웨이용 WML 컴파일러의 설계 및 구현)

  • Choi, Eun-Jeong;Han, Dong-Won;Lim, Kyung-Shik
    • Journal of KIISE:Computing Practices and Letters
    • /
    • v.7 no.2
    • /
    • pp.165-182
    • /
    • 2001
  • In this paper, we describe a design and implementation of the Wireless Markup Language(WML) compiler to deploy wireless Internet services effectively. The WML compiler translates textual WML decks into binary ones in order to reduce the traffic on wireless links that have relatively low bandwidth to wireline links and mitigate the processing overhead of WML decks on, wireless terminals that have relatively low processing power to fixed workstations. In addition, it takes over the overhead of eXtensible Markup Language(XML) well-formedness and validation processes. The WML compiler consists of the lexical analyzer and parser modules. The granunar for the WML parser module is LALR(1) context-free grammar that is designed based on XML 1.0 and WML 1.2 DTD(Document Type Definition) with the consideration of the Wireless Application Protocol Binary XML grammar. The grammar description is converted into a C program to parse that grammar by using parser generator. Even though the tags in WML will be extended or WML DTD will be upgraded, this approach has the advantage of flexibility because the program is generated by modifying just the changed parts. We have verified the functionality of the WML compiler by using a WML decompiler in the public domain and by using the Nokia WAP Toolkit as a WAP client. To measurethe compressibility gain of the WML compiler, we have tested a large number of textual WML decks and obtained a maximum 85 %. As the effect of compression is reduced when the portion of general textual strings increases relative to one of the tags and attributes in a WML deck, an extended encoding method might be needed for specific applications such as compiling of the WML decks to which the Hyper Text Markup Language document is translated dynamically.

  • PDF

A Preliminary Study on Extending OAK Metadata for Research Data (연구데이터 관리를 위한 OAK 메타데이터 확장 방안 연구)

  • Lee, Mihwa;Lee, Eun-Ju;Rho, Jee-Hyun
    • Journal of Korean Library and Information Science Society
    • /
    • v.51 no.3
    • /
    • pp.27-51
    • /
    • 2020
  • This study aims to propose an extended OAK metadata for research data that would be described in OAK, an open access repository of the National Library of Korea. As a research method, literature review, case studies, and interviews with related parties were conducted. The method of extending the existing OAK metadata for research data was derived as follows. First, in modeling for research data, the structure of the collection> item> file is maintained, the collection is placed as a higher group to which the research data can be grouped, and item was combined metadata and files or digital objects of various formats together. Second, by mapping the metadata standard and case organizations with the existing OAK metadata, elements judged to need to be extended to OAK for research data were selected and reflected in the existing OAK. Third, the controlled vocabulary and syntax are also proposed so that it can be used for search or later statistics through structured data. By expanding the OAK metadata to describe research data, research data produced in Korea can be officially stored and used, which is the basis for preventing duplication of research and sharing and recycling research results nationally.

Analysis on Research Trends and Proposal for Standardization of Construction & Architectural Terms in Korea (국내 건설·건축용어 연구의 동향 분석 및 표준화 제안)

  • Park, Eunha;Jeon, Jinwoo
    • The Journal of the Korea Contents Association
    • /
    • v.15 no.5
    • /
    • pp.620-629
    • /
    • 2015
  • As the construction industry becomes bigger and more complicated, standardization of terms should be established between academic and industrial fields in order to accumulate and share information technology. The aim of this study is to investigate and analyze the research trends and actual usage of construction and architectural terms in Korea. For this purpose, we examined research related to construction and architectural terms by searching RISS up to August 2014. We also analyzed document types and contents of research by year. As a result, 130 research studies related to construction and architectural terms were searched. Of document types, glossary ranks the highest, followed by academic journal papers, master's theses and research reports. Research related to construction and architectural terms began in 1939, and was actively studied between the mid-1980s to the mid-1990s. Within the research, list and opinion of related construction and architectural terms are found the most frequently, followed by standardization, analysis, alteration, dictionary and wordbook, and search system of terms. Despite these efforts and research, standardization of terms has not yet been consolidated between academic and industrial fields. Therefor, we suggest six proposals in order to standardize the terms. This study is an attempt to see the trends and conditions of construction and architectural terms and to provide base-line data and an insight for future research.

Debelppment of C++ Compiler and Programming Environment (C++컴파일러 및 프로그래밍 환경 개발)

  • Jang, Cheon-Hyeon;O, Se-Man
    • The Transactions of the Korea Information Processing Society
    • /
    • v.4 no.3
    • /
    • pp.831-845
    • /
    • 1997
  • In this paper,we proposed and developed a compiler and interactive programming enviroments for C++ wich is mostly worth of nitice among the object -oriented languages.To develope the compiler for C++ we took front=end/back-end model using EM virtual machine.In develpoing Front-End,we formailized C++ gram-mar with the context semsitive tokens which must be manipulated by dexical scanner and designed a AST class li-brary which is the hierarchy of AST node class and well defined interface among them,In develpoing Bacik-End,we proposed model for three major components :code oprtimizer,code generator and run-time enviroments.We emphasized the retargatable back-end which can be systrmatically reconfigured to genrate code for a variety of distinct target computers.We also developed terr pattern matching algorithm and implemented target code gen-erator which produce SPARC code.We also proposed the theroy and model for construction interative pro-gramming enviroments. To represent language features we adopt AST as internal reprsentation and propose uncremental analysis algorithm and viseal digrams.We also studied unparsing scheme, visual diagram,graphical user interface to generate interactive environments automatically Results of our resarch will be very useful for developing a complier and programming environments, and also can be used in compilers for parallel and distributed enviroments.

  • PDF