• Title/Summary/Keyword: Linguistic processing

Search Result 171, Processing Time 0.023 seconds

Development of a Traceability Analysis Method Based on Case Grammar for NPP Requirement Documents Written in Korean Language

  • Yoo Yeong Jae;Seong Poong Hyun;Kim Man Cheol
    • Nuclear Engineering and Technology
    • /
    • v.36 no.4
    • /
    • pp.295-303
    • /
    • 2004
  • Software inspection is widely believed to be an effective method for software verification and validation (V&V). However, software inspection is labor-intensive and, since it uses little technology, software inspection is viewed upon as unsuitable for a more technology-oriented development environment. Nevertheless, software inspection is gaining in popularity. KAIST Nuclear I&C and Information Engineering Laboratory (NICIEL) has developed software management and inspection support tools, collectively named "SIS-RT. "SIS-RT is designed to partially automate the software inspection processes. SIS-RT supports the analyses of traceability between a given set of specification documents. To make SIS-RT compatible for documents written in Korean, certain techniques in natural language processing have been studied [9]. Among the techniques considered, case grammar is most suitable for analyses of the Korean language [3]. In this paper, we propose a methodology that uses a case grammar approach to analyze the traceability between documents written in Korean. A discussion regarding some examples of such an analysis will follow.

On a Novel Way of Processing Data that Uses Fuzzy Sets for Later Use in Rule-Based Regression and Pattern Classification

  • Mendel, Jerry M.
    • International Journal of Fuzzy Logic and Intelligent Systems
    • /
    • v.14 no.1
    • /
    • pp.1-7
    • /
    • 2014
  • This paper presents a novel method for simultaneously and automatically choosing the nonlinear structures of regressors or discriminant functions, as well as the number of terms to include in a rule-based regression model or pattern classifier. Variables are first partitioned into subsets each of which has a linguistic term (called a causal condition) associated with it; fuzzy sets are used to model the terms. Candidate interconnections (causal combinations) of either a term or its complement are formed, where the connecting word is AND which is modeled using the minimum operation. The data establishes which of the candidate causal combinations survive. A novel theoretical result leads to an exponential speedup in establishing this.

Recent Progresses in the Linguistic Modeling of Biological Sequences Based on Formal Language Theory

  • Park, Hyun-Seok;Galbadrakh, Bulgan;Kim, Young-Mi
    • Genomics & Informatics
    • /
    • v.9 no.1
    • /
    • pp.5-11
    • /
    • 2011
  • Treating genomes just as languages raises the possibility of producing concise generalizations about information in biological sequences. Grammars used in this way would constitute a model of underlying biological processes or structures, and that grammars may, in fact, serve as an appropriate tool for theory formation. The increasing number of biological sequences that have been yielded further highlights a growing need for developing grammatical systems in bioinformatics. The intent of this review is therefore to list some bibliographic references regarding the recent progresses in the field of grammatical modeling of biological sequences. This review will also contain some sections to briefly introduce basic knowledge about formal language theory, such as the Chomsky hierarchy, for non-experts in computational linguistics, and to provide some helpful pointers to start a deeper investigation into this field.

Intelligent Multimedia Educational System on Distributed Environment (분산 환경에서의 지능형 멀티미디어 교육 시스템)

  • Lee, Se-Hun;Yun, Gyeong-Seop
    • The Transactions of the Korea Information Processing Society
    • /
    • v.6 no.5
    • /
    • pp.1323-1331
    • /
    • 1999
  • This paper suggests a multimedia educational system which has the ability to extract intelligent instruction on the distributed environment. The proposed system is designed for supporting individual instruction and real time user interaction. As the system based on CORBA, we put group managing module on it for multi user environment, so it has ability for distributed computing facilities. Using MHEG standard, we can provide multimedia courseware and real time user interaction. To diagnose students' responses and generate evaluations, we use several linguistic variables of fuzzy theory. There are two major advantages for using this system. This system can provide dynamic generation of problems and the ability to provide a dynamic instruction strategy. And it can increase reusability of courseware material for using standard of multimedia representation and communication. We use CORBA and MHEG to overcome the disadvantage of the Web, passive protocol and poor interactivity, HTTP.

  • PDF

An Intuitionistic Fuzzy Approach to Classify the User Based on an Assessment of the Learner's Knowledge Level in E-Learning Decision-Making

  • Goyal, Mukta;Yadav, Divakar;Tripathi, Alka
    • Journal of Information Processing Systems
    • /
    • v.13 no.1
    • /
    • pp.57-67
    • /
    • 2017
  • In this paper, Atanassov's intuitionistic fuzzy set theory is used to handle the uncertainty of students' knowledgeon domain concepts in an E-learning system. Their knowledge on these domain concepts has been collected from tests that were conducted during their learning phase. Atanassov's intuitionistic fuzzy user model is proposed to deal with vagueness in the user's knowledge description in domain concepts. The user model uses Atanassov's intuitionistic fuzzy sets for knowledge representation and linguistic rules for updating the user model. The scores obtained by each student were collected in this model and the decision about the students' knowledge acquisition for each concept whether completely learned, completely known, partially known or completely unknown were placed into the information table. Finally, it has been found that the proposed scheme is more appropriate than the fuzzy scheme.

Microblog Sentiment Analysis Method Based on Spectral Clustering

  • Dong, Shi;Zhang, Xingang;Li, Ya
    • Journal of Information Processing Systems
    • /
    • v.14 no.3
    • /
    • pp.727-739
    • /
    • 2018
  • This study evaluates the viewpoints of user focus incidents using microblog sentiment analysis, which has been actively researched in academia. Most existing works have adopted traditional supervised machine learning methods to analyze emotions in microblogs; however, these approaches may not be suitable in Chinese due to linguistic differences. This paper proposes a new microblog sentiment analysis method that mines associated microblog emotions based on a popular microblog through user-building combined with spectral clustering to analyze microblog content. Experimental results for a public microblog benchmark corpus show that the proposed method can improve identification accuracy and save manually labeled time compared to existing methods.

Named Entity and Coreference Tagging for Information Extraction (정보추출을 위한 고유명사 및 대용어 태깅)

  • Jang, Sung-Ho;Kang, Seung-Shik;Woo, Chong-Woo;Yun, Bo-Hyun
    • Annual Conference of KIPS
    • /
    • 2002.04b
    • /
    • pp.1111-1114
    • /
    • 2002
  • 최근 정보추출에 대한 중요성이 점차 증가하면서 정보추출에서 필요로 하는 Named Entity와 Coreference, Information Extraction, Information Retrieval의 소개와 한국어에 대해 적용시키기 위한 정의와 방법을 제시한다. 또한, 대량의 문서에 대한 태깅을 효율적으로 수행할 수 있도록 Named Entity와 Coreference 태깅을 쉽게 할 수 있는 NE-CO 태깅 도구를 개발하였다. 이 태깅 도구를 이용하여 시험적으로 경제, 공연, 여행 분야의 300문서에 대한 말뭉치를 구축하였으며, 이 말뭉치는 한국어 정보추출 시스템을 개발하는데 기초 자료로서 활용될 예정이다.

  • PDF

A method for Measuring Second Language Ability based on linguistic cognitive experiments (언어 인지 실험을 통한 외국어 능력 측정 방법)

  • Yang, Yeong-Wook;Lee, Sae-Byeok;Lim, Heui-Seok
    • Annual Conference of KIPS
    • /
    • 2012.11a
    • /
    • pp.362-363
    • /
    • 2012
  • 외국어 능력이 현대 사회에서 요구하는 필수적인 요소 중에 하나이다. 본 논문에서는 기존의 외국어 능력을 평가하는 능력 시험이 아닌 언어심리학적 관점으로 외국어 능력을 평가하는 방법을 제안한다. 외국어를 처리하는데 있어서 외국어를 모국어를 바꾸는 언어인지 과정이 필요하다. 본 논문에서는 이러한 언어 인지 능력을 측정하는 Reading LDT, Listening LDT, Verbal Span, Yes-No task(Semantic), Same-Different task실험을 제안한다. 해당 과제들은 각각 피험자의 읽기, 듣기, 기억, 의미적 결정, 변환 능력을 측정하는 과제이다.

Image Understanding for Visual Dialog

  • Cho, Yeongsu;Kim, Incheol
    • Journal of Information Processing Systems
    • /
    • v.15 no.5
    • /
    • pp.1171-1178
    • /
    • 2019
  • This study proposes a deep neural network model based on an encoder-decoder structure for visual dialogs. Ongoing linguistic understanding of the dialog history and context is important to generate correct answers to questions in visual dialogs followed by questions and answers regarding images. Nevertheless, in many cases, a visual understanding that can identify scenes or object attributes contained in images is beneficial. Hence, in the proposed model, by employing a separate person detector and an attribute recognizer in addition to visual features extracted from the entire input image at the encoding stage using a convolutional neural network, we emphasize attributes, such as gender, age, and dress concept of the people in the corresponding image and use them to generate answers. The results of the experiments conducted using VisDial v0.9, a large benchmark dataset, confirmed that the proposed model performed well.

A Corpus-Based Study on the Vocabulary Development of Korean Learners

  • Sinhye Nam;Chaerin Jang;Sunyoung Kim
    • Journal of Information Processing Systems
    • /
    • v.20 no.4
    • /
    • pp.477-490
    • /
    • 2024
  • This study identifies the vocabulary usage patterns of Korean heritage language learners. We analyzed the interlanguage of the Korean heritage language learners and examined their vocabulary usage patterns, especially the major content keywords being used at their respective proficiency levels. The Korean Learner's Corpus from the National Institute of Korean Language is used for the data analysis. We found that as the heritage language learners' proficiency increases, low-frequency (high-level) vocabulary is often used as the keywords and the semantic vocabulary areas expand from daily to social to specialized fields. It is therefore confirmed that the vocabulary use of Korean heritage language learners develops as their proficiency increases. This study confirms the development of Korean vocabulary in Korean heritage language learners and exemplifies how corpus-based applied linguistic research and computer science can be integrated using a keyword extraction algorithm.