• 제목/요약/키워드: Semantic Data

검색결과 1,396건 처리시간 0.026초

시맨틱 텍스트 마이닝을 위한 온톨로지 활용 방안 (Using Ontologies for Semantic Text Mining)

  • 유은지;김정철;이춘열;김남규
    • 한국정보시스템학회지:정보시스템연구
    • /
    • 제21권3호
    • /
    • pp.137-161
    • /
    • 2012
  • The increasing interest in big data analysis using various data mining techniques indicates that many commercial data mining tools now need to be equipped with fundamental text analysis modules. The most essential prerequisite for accurate analysis of text documents is an understanding of the exact semantics of each term in a document. The main difficulties in understanding the exact semantics of terms are mainly attributable to homonym and synonym problems, which is a traditional problem in the natural language processing field. Some major text mining tools provide a thesaurus to solve these problems, but a thesaurus cannot be used to resolve complex synonym problems. Furthermore, the use of a thesaurus is irrelevant to the issue of homonym problems and hence cannot solve them. In this paper, we propose a semantic text mining methodology that uses ontologies to improve the quality of text mining results by resolving the semantic ambiguity caused by homonym and synonym problems. We evaluate the practical applicability of the proposed methodology by performing a classification analysis to predict customer churn using real transactional data and Q&A articles from the "S" online shopping mall in Korea. The experiments revealed that the prediction model produced by our proposed semantic text mining method outperformed the model produced by traditional text mining in terms of prediction accuracy such as the response, captured response, and lift.

Motion Ontology를 이용한 비디오내 객체 움직임의 의미표현 (Semantic Representation of Moving Objectin Video Data Using Motion Ontology)

  • 신주현;김판구
    • 한국멀티미디어학회논문지
    • /
    • 제10권1호
    • /
    • pp.117-127
    • /
    • 2007
  • 멀티미디어 데이터의 활용가치가 높아짐에 따라 멀티미디어 정보의 의미적인 인식과 검색 방법에 대한 필요성이 증대되고 있다. 본 논문에서는 비디오 내 이벤트에 대한 객체 움직임 요소간의 의미표현을 위해 모션 온톨로지(Motion Ontology)를 구축하고 적용한다. 본 연구에서 제안한 방법은 워드넷(WordNet)내 동사어휘들 중 장소 이동이나 방향등과 같이 움직임을 잘 표현하는 동사들에 대해 분류하여 계층구조로 표현하고, 또한 이를 OWL/RDF(S)로 작성한다. 이는 온톨로지(Ontology)의 IS-A관계와 동의어관계가 가진 특징을 이용한 추론을 위함이며, 온톨로지(Ontology)에 기반하여 비디오 데이터를 인덱싱함으로써, 의미적 표현을 가능하게 한다. 본 연구에서 비디오 데이터에 대하여 의미적 검색을 수행한 결과, 기존 키워드 기반 검색에 비해 정확률 측면에서 약 10% 정도 향상되었다.

  • PDF

Ontology Supported Information Systems: A Review

  • Padmavathi, T.;Krishnamurthy, M.
    • Journal of Information Science Theory and Practice
    • /
    • 제2권4호
    • /
    • pp.61-76
    • /
    • 2014
  • The exponential growth of information on the web far exceeds the capacity of present day information retrieval systems and search engines, making information integration on the web difficult. In order to overcome this, semantic web technologies were proposed by the World Wide Web Consortium (W3C) to achieve a higher degree of automation and precision in information retrieval systems. Semantic web, with its promise to deliver machine understanding to the traditional web, has attracted a significant amount of research from academia as well as from industries. Semantic web is an extension of the current web in which data can be shared and reused across the internet. RDF and ontology are two essential components of the semantic web architecture which support a common framework for data storage and representation of data semantics, respectively. Ontologies being the backbone of semantic web applications, it is more relevant to study various approaches in their application, usage, and integration into web services. In this article, an effort has been made to review the research work being undertaken in the area of design and development of ontology supported information systems. This paper also briefly explains the emerging semantic web technologies and standards.

의미 특징 행렬과 의미 가변행렬을 이용한 질의 기반의 문서 요약 (Query-Based Summarization using Semantic Feature Matrix and Semantic Variable Matrix)

  • 박선
    • 한국항행학회논문지
    • /
    • 제12권4호
    • /
    • pp.372-377
    • /
    • 2008
  • 본 논문은 의미특징행렬(semantic feature matrix)과 의미변수행령(semantic variable matrix)을 이용하는 질의 기반의 새로운 문서를 요약방법을 제안한다. 제안된 방법은 비지도 학습 방법으로 질의와 문장 간에 사전학습이 필요 없고, 의미 특징(semantic feature)과 의미변수(semantic variable)를 이용하여 질의에 적합한 하위 주제를 잘 반영하여서 정확한 문서를 요약 할 수 있다. 이것은 비음수 행렬 분해가 주제들로 구성된 문서의 내부구조를 나타내는 의미특징을 자연스럽게 추출할 수 있기 때문이다. 실험결과 제안방법이 다른 방법에 비하여 좋은 성능을 보인다.

  • PDF

인문학 분야 연구데이터를 위한 연구논문의 의미구조 기반 메타데이터 연구 (A Study on the Metadata based on the Semantic Structure of the Humanities Research Articles for Research Data)

  • 안병군;고영만
    • 한국비블리아학회지
    • /
    • 제33권1호
    • /
    • pp.345-369
    • /
    • 2022
  • 본 연구의 목적은 인문학 분야 연구데이터와 연구논문을 연계할 수 있는 '인문학 분야 연구데이터를 위한 연구논문 의미구조 기반 메타데이터'를 개발하는 것이다. 인문학 분야 연구자의 관심 정보를 파악하기 위해 키워드 유형화를 진행하고 이를 일반적인 연구논문의 의미구조를 기준으로 분석함으로써 인문학 분야의 특성을 반영하기 위해 추가될 요소를 확인하였다. 이를 기반으로 인문학 분야 연구논문의 의미구조 기반 메타데이터를 구성하였으며 해당 메타데이터가 인문학 분야 연구데이터의 메타데이터로서 적합한지를 평가하기 위하여 인문학 분야 연구자를 대상으로 설문을 진행하였다. 인문학 분야 연구논문의 의미구조를 기반으로 도출된 17개의 메타데이터 요소 중 인문학 분야 연구데이터 메타데이터로 활용하기에 적합하다고 판단된 요소는 7개이며, 그 외 10개 요소는 사용 가능한 요소로 확인되었다. 평가 결과를 바탕으로 인문학 분야 연구데이터를 위한 연구논문 의미구조기반 메타데이터를 구성하였다.

시맨틱 웹 기술혁신의 채택과 확산: 질적연구접근법 (The Adoption and Diffusion of Semantic Web Technology Innovation: Qualitative Research Approach)

  • 주재훈
    • Asia pacific journal of information systems
    • /
    • 제19권1호
    • /
    • pp.33-62
    • /
    • 2009
  • Internet computing is a disruptive IT innovation. Semantic Web can be considered as an IT innovation because the Semantic Web technology possesses the potential to reduce information overload and enable semantic integration, using capabilities such as semantics and machine-processability. How should organizations adopt the Semantic Web? What factors affect the adoption and diffusion of Semantic Web innovation? Most studies on adoption and diffusion of innovation use empirical analysis as a quantitative research methodology in the post-implementation stage. There is criticism that the positivist requiring theoretical rigor can sacrifice relevance to practice. Rapid advances in technology require studies relevant to practice. In particular, it is realistically impossible to conduct quantitative approach for factors affecting adoption of the Semantic Web because the Semantic Web is in its infancy. However, in an early stage of introduction of the Semantic Web, it is necessary to give a model and some guidelines and for adoption and diffusion of the technology innovation to practitioners and researchers. Thus, the purpose of this study is to present a model of adoption and diffusion of the Semantic Web and to offer propositions as guidelines for successful adoption through a qualitative research method including multiple case studies and in-depth interviews. The researcher conducted interviews with 15 people based on face-to face and 2 interviews by telephone and e-mail to collect data to saturate the categories. Nine interviews including 2 telephone interviews were from nine user organizations adopting the technology innovation and the others were from three supply organizations. Semi-structured interviews were used to collect data. The interviews were recorded on digital voice recorder memory and subsequently transcribed verbatim. 196 pages of transcripts were obtained from about 12 hours interviews. Triangulation of evidence was achieved by examining each organization website and various documents, such as brochures and white papers. The researcher read the transcripts several times and underlined core words, phrases, or sentences. Then, data analysis used the procedure of open coding, in which the researcher forms initial categories of information about the phenomenon being studied by segmenting information. QSR NVivo version 8.0 was used to categorize sentences including similar concepts. 47 categories derived from interview data were grouped into 21 categories from which six factors were named. Five factors affecting adoption of the Semantic Web were identified. The first factor is demand pull including requirements for improving search and integration services of the existing systems and for creating new services. Second, environmental conduciveness, reference models, uncertainty, technology maturity, potential business value, government sponsorship programs, promising prospects for technology demand, complexity and trialability affect the adoption of the Semantic Web from the perspective of technology push. Third, absorptive capacity is an important role of the adoption. Fourth, suppler's competence includes communication with and training for users, and absorptive capacity of supply organization. Fifth, over-expectance which results in the gap between user's expectation level and perceived benefits has a negative impact on the adoption of the Semantic Web. Finally, the factor including critical mass of ontology, budget. visible effects is identified as a determinant affecting routinization and infusion. The researcher suggested a model of adoption and diffusion of the Semantic Web, representing relationships between six factors and adoption/diffusion as dependent variables. Six propositions are derived from the adoption/diffusion model to offer some guidelines to practitioners and a research model to further studies. Proposition 1 : Demand pull has an influence on the adoption of the Semantic Web. Proposition 1-1 : The stronger the degree of requirements for improving existing services, the more successfully the Semantic Web is adopted. Proposition 1-2 : The stronger the degree of requirements for new services, the more successfully the Semantic Web is adopted. Proposition 2 : Technology push has an influence on the adoption of the Semantic Web. Proposition 2-1 : From the perceptive of user organizations, the technology push forces such as environmental conduciveness, reference models, potential business value, and government sponsorship programs have a positive impact on the adoption of the Semantic Web while uncertainty and lower technology maturity have a negative impact on its adoption. Proposition 2-2 : From the perceptive of suppliers, the technology push forces such as environmental conduciveness, reference models, potential business value, government sponsorship programs, and promising prospects for technology demand have a positive impact on the adoption of the Semantic Web while uncertainty, lower technology maturity, complexity and lower trialability have a negative impact on its adoption. Proposition 3 : The absorptive capacities such as organizational formal support systems, officer's or manager's competency analyzing technology characteristics, their passion or willingness, and top management support are positively associated with successful adoption of the Semantic Web innovation from the perceptive of user organizations. Proposition 4 : Supplier's competence has a positive impact on the absorptive capacities of user organizations and technology push forces. Proposition 5 : The greater the gap of expectation between users and suppliers, the later the Semantic Web is adopted. Proposition 6 : The post-adoption activities such as budget allocation, reaching critical mass, and sharing ontology to offer sustainable services are positively associated with successful routinization and infusion of the Semantic Web innovation from the perceptive of user organizations.

데이터사전을 이용한 ERP애플리케이션 개발 (ERP Application Development Using Business Data Dictionary)

  • Minsu Jang;Joo-Chan Sohn;Jong-Myoung Baik
    • 한국전자거래학회지
    • /
    • 제7권1호
    • /
    • pp.141-152
    • /
    • 2002
  • Data dictionary is a collection of meta-data, which describes data produced and consumed while performing business processes. Data dictionary is an essential element for business process standardization and automation, and has a fundamental role in ERP application management and customization. Also, data dictionary facilitates B2B processes by enabling painless integration of business processes between various enterprises. We implemented data dictionary support in SEA+, a component- based scalable ERP system developed in ETRI, and found out that it's a plausible feature of business information system. We discovered that data dictionary promotes semantic, not syntactic, data management, which can make it possible to leverage viability of the tool in the coming age of more meta-data oriented computing world. We envision that business data dictionary is a firm foundation of adapting business knowledge, applications and processes into the semantic web based enterprise infra-structure.

  • PDF

계층화된 비디오 내용 모델링 및 브라우징 (Layered Video Content Modeling and Browsing)

  • 복경수;이낙규;허정필;유재수;조기형;이병엽
    • 정보처리학회논문지D
    • /
    • 제10D권7호
    • /
    • pp.1115-1126
    • /
    • 2003
  • 본 논문에서는 비디오 데이터 대한 구조적 내용과 의미적 내용에 효과적으로 표현하기 위해 비디오 모델링을 제안한다. 또한 사용자가 비디오 데이터에 대한 내용 정보를 쉽게 파악하고 재생할 수 있도록 하는 브라우징 기법을 제안한다. 제안하는 모델링은 원시 데이터 계층, 내용 계층 그리고 키프레임 계층의 세 계층으로 구성되며 비디오 데이터에 대한 논리적인 계층 구조와 의미적 내용은 내용 계층에 표현한다. 비디오 브라우징은 비디오 재생과 내용 정보를 나타내는 두 가지 브라우저를 구현하였다. 비디오 재생 브라우저는 비디오를 재생과 현재화면 정보를 나타내는 기능을 수행한다. 내용 정보 브라우저는 원시데이터, 구조적 내용 그리고 의미적 내용에 대한 정보를 브라우징하는 기능을 제공한다.

Domain-Adaptation Technique for Semantic Role Labeling with Structural Learning

  • Lim, Soojong;Lee, Changki;Ryu, Pum-Mo;Kim, Hyunki;Park, Sang Kyu;Ra, Dongyul
    • ETRI Journal
    • /
    • 제36권3호
    • /
    • pp.429-438
    • /
    • 2014
  • Semantic role labeling (SRL) is a task in natural-language processing with the aim of detecting predicates in the text, choosing their correct senses, identifying their associated arguments, and predicting the semantic roles of the arguments. Developing a high-performance SRL system for a domain requires manually annotated training data of large size in the same domain. However, such SRL training data of sufficient size is available only for a few domains. Constructing SRL training data for a new domain is very expensive. Therefore, domain adaptation in SRL can be regarded as an important problem. In this paper, we show that domain adaptation for SRL systems can achieve state-of-the-art performance when based on structural learning and exploiting a prior model approach. We provide experimental results with three different target domains showing that our method is effective even if training data of small size is available for the target domains. According to experimentations, our proposed method outperforms those of other research works by about 2% to 5% in F-score.

시맨틱 헬스케어를 위한 상호정보교환 프로세스 (Towards Semantic Healthcare with Interoperable Processes)

  • 와자하트 알리 칸;마크불 후세인;아사드 마수드 카탁;이승룡;구교호
    • 한국정보처리학회:학술대회논문집
    • /
    • 한국정보처리학회 2011년도 춘계학술발표대회
    • /
    • pp.414-415
    • /
    • 2011
  • Due to heterogeneity in Data and Processes, healthcare systems are facing the challenge of interoperability. This heterogeneity results in different healthcare workflows of each individual organization. The compatibility of these heterogeneous workflows is possible when standards are followed. HL7 is one of the standards that is used for communicating medical data between healthcare systems. Its newer version V3 is providing semantic interoperability which is lacking in V2. The interoperability in HL7 V3 is only limited to data level and process level interoperability needs to be catered. The process level interoperability is achieved only when heterogeneous workflows are aligned. These workflows are very complex in nature due to continuous change in medical data resulting in problems related to maintenance and degree of automation. Semantic technologies plays important role in resolving the above mentioned problems. This research work is based on the integration of semantic technology in HL7 V3 standard to achieve semantic process interoperability. Web Service Modeling Framework (WSMF) is used for incorporating semantics in HL7 V3 processes and achieves seamless communication. Interaction Ontology represents the process artifacts of HL7 V3 and helps in achieving automation.