Search | Korea Science

Study on the Improvement of Extraction Performance for Domain Knowledge based Wrapper Generation (도메인 지식 기반 랩퍼 생성의 추출 성능 향상에 관한 연구)

Jeong Chang-Hoo;Choi Yun-Soo;Seo Jeong-Hyeon;Yoon Hwa-Mook
- Journal of Internet Computing and Services
- /
- v.7 no.4
- /
- pp.67-77
- /
- 2006
Wrappers play an important role in extracting specified information from various sources. Wrapper rules by which information is extracted are often created from the domain-specific knowledge. Domain-specific knowledge helps recognizing the meaning the text representing various entities and values and detecting their formats However, such domain knowledge becomes powerless when value-representing data are not labeled with appropriate textual descriptions or there is nothing but a hyper link when certain text labels or values are expected. In order to alleviate these problems, we propose a probabilistic method for recognizing the entity type, i.e. generating wrapper rules, when there is no label associated with value-representing text. In addition, we have devised a method for using the information reachable by following hyperlinks when textual data are not immediately available on the target web page. Our experimental work shows that the proposed methods help increasing precision of the resulting wrapper, particularly extracting the title information, the most important entity on a web page. The proposed methods can be useful in making a more efficient and correct information extraction system for various sources of information without user intervention.
PDF

A Study on the Performance and Acceptance of Knowledge Management System By Considering Knowledge Circulation Process and Knowledge Schema (지식순환과정과 지식스키마를 고려한 지식경영시스템 성과 및 수용에 관한 연구)

Lee, Kun-Chang;Roh, Jeong-Ran
- Journal of the Korean Society for Library and Information Science
- /
- v.36 no.3
- /
- pp.259-274
- /
- 2002
Recently, a great deal of corporations have adopted knowledge management system with eagerness to enhance the company competitiveness. However, since the main feature of knowledge management system is not just a simple information system but another entity creating intangible assets tailed "knowledge", we need to develop a new approach to investigating the performance and acceptance of knowledge management systems from a perspective allowing knowledge-sensitive constructs. In this regard, we develop new constructs like knowledge schema and several knowledge circulation-related activities. As a research model, we adopt a famous technology acceptance model or TAM by Davis (1989), and extend it into incorporating knowledge schema. With the statistically valid and usable questionnaire survey data collected from 886 respondents in a big corporation typically using knowledge management system, we induced a robust result empirically, saying that knowledge schema and knowledge circulation activities are valid determinants of performance and acceptance of knowledge management systems.
https://doi.org/10.4275/KSLIS.2002.36.3.259 인용 PDF

Systematic Review of Sustainable Knowledge Transfer Process in Government-Industry-Academia Consortium

Faisal, Rouhi;Chong, Aik Lee;Yee, Angelina Seow Voon
- Asian Journal of Innovation and Policy
- /
- v.6 no.3
- /
- pp.295-312
- /
- 2017
The purpose of this case study is to understand the sustainability practices of knowledge transfer process at the Malaysian government-industry-academia consortium. At this stage in the research, the R&D consortium is defined as an established entity by two or more organizations that pool resources and shared decision making for cooperative research and development activities. In attempts to understand the formation, outcomes and sustainability of the sustainable knowledge transfer process, this paper conducted a systematic literature review based on Gough, Oliver and Thomas systematic reviews protocol. From the review, the data were enriched and enhanced with a better understanding of sustainable knowledge transfer process. The systematic review resulted in identifying six factors including internal and external perspectives. However, key sustainability factors are not only directly influencing KTP, and the consortium, but are also mediated by other organisational variables.
https://doi.org/10.7545/ajip.2017.6.3.295 인용 PDF KSCI

An Evaluation of Applying Knowledge Base to Academic Information Service

Lee, Seok-Hyoung;Kim, Hwan-Min;Choe, Ho-Seop
- International Journal of Knowledge Content Development & Technology
- /
- v.3 no.1
- /
- pp.81-95
- /
- 2013
Through a series of precise text handling processes, including automatic extraction of information from documents with knowledge from various fields, recognition of entity names, detection of core topics, analysis of the relations between the extracted information and topics, and automatic inference of new knowledge, the most efficient knowledge base of the relevant field is created, and plans to apply these to the information knowledge management and service are the core requirements necessary for intellectualization of information. In this paper, the knowledge base, which is a necessary core resource and comprehensive technology for intellectualization of science and technology information, is described and the usability of academic information services using it is evaluated. The knowledge base proposed in this article is an amalgamation of information expression and knowledge storage, composed of identifying code systems from terms to documents, by integrating terminologies, word intelligent networks, topic networks, classification systems, and authority data.
https://doi.org/10.5865/IJKCT.2013.3.1.081 인용 PDF KSCI KPUBS

A Study on the Descriptive Features and Origin of the Heart Diagram in the Donguibogam(東醫寶鑑) (『동의보감』 심장도(心臟圖)의 묘사 특징과 그 기원에 대한 연구)

Jo, Hak-jun
- Journal of Korean Medical classics
- /
- v.36 no.1
- /
- pp.17-32
- /
- 2023
Objectives : This paper investigates the background, meaning and origin of the descriptions of the Heart such as 'seven orifices', 'sanmao', 'saw-toothed four layered lines' that are unique to the diagram in the Donguibogam. Methods : First the Heart diagram of the Donguibogam was compared with other Zhangfu diagrams of the past. Materials related to unique features in the descriptions of the Heart in the Donguibogam were collected, against which descriptive features were analyzed. Results : Of the many unique features, the descriptive basis of the 'seven orifices' could be found in the Qixingban[七星板] as a physical entity reflecting basic anatomical knowledge. The 'sanmao', which is compared to the Santaixing[三台星], could be understood as a non-physical entity whose descriptive basis could be found in the Xinxuetu of the Xinching. It could be assumed that the 'saw-toothed four layered lines' are likened to the multi-layered petals or calyx of a lotus flower bud to describe the Pericardium, or to the multiple walls of a mountain fortress surrounding a palace to describe the Danzhong, which is the chest cavity. These features could be understood as results of spiritualism influence. Conclusions : It could be concluded that Heo Jun, in his attempt to describe the Heart in more detail than previous diagrams of the Zangfu, referenced popular texts and images based on anatomical knowledge of previous texts, added varied descriptions resulting in a new diagram with a completely different origin.
https://doi.org/10.14369/jkmc.2023.36.1.017 인용 PDF

Using Syntax and Shallow Semantic Analysis for Vietnamese Question Generation

Phuoc Tran;Duy Khanh Nguyen;Tram Tran;Bay Vo
- KSII Transactions on Internet and Information Systems (TIIS)
- /
- v.17 no.10
- /
- pp.2718-2731
- /
- 2023
This paper presents a method of using syntax and shallow semantic analysis for Vietnamese question generation (QG). Specifically, our proposed technique concentrates on investigating both the syntactic and shallow semantic structure of each sentence. The main goal of our method is to generate questions from a single sentence. These generated questions are known as factoid questions which require short, fact-based answers. In general, syntax-based analysis is one of the most popular approaches within the QG field, but it requires linguistic expert knowledge as well as a deep understanding of syntax rules in the Vietnamese language. It is thus considered a high-cost and inefficient solution due to the requirement of significant human effort to achieve qualified syntax rules. To deal with this problem, we collected the syntax rules in Vietnamese from a Vietnamese language textbook. Moreover, we also used different natural language processing (NLP) techniques to analyze Vietnamese shallow syntax and semantics for the QG task. These techniques include: sentence segmentation, word segmentation, part of speech, chunking, dependency parsing, and named entity recognition. We used human evaluation to assess the credibility of our model, which means we manually generated questions from the corpus, and then compared them with the generated questions. The empirical evidence demonstrates that our proposed technique has significant performance, in which the generated questions are very similar to those which are created by humans.
https://doi.org/10.3837/tiis.2023.10.007 인용 PDF HTML

Analysis Framework of Public Library as Knowledge Center (지식센터로서의 공공도서관 분석 프레임워크)

Namn, Su Hyeon
- Journal of Digital Convergence
- /
- v.11 no.1
- /
- pp.181-190
- /
- 2013
The recent advance of information and communication technologies has made the identity of public libraries ambiguous. Few literatures have dealt with a local community public library from knowledge creation perspective. In this article we extend the applicability of the concepts such as knowledge management and knowledge city to local public libraries whose major role needs to change from the traditional book rental to knowledge and social capital creating entity. Based on the concepts, we propose a framework for analyzing the public library as a center of knowledge creation. Using the framework, we analyze the Ridgewood Public Library in New Jersey to test the validity of the framework.
https://doi.org/10.14400/JDPM.2013.11.1.181 인용 PDF

Design and Implementation of ebXML BD Authoring Tool(XDocBuilder) (ebXML 비즈니스 문서 저작도구(XDocBuilder) 설계 및 구현)

Park, Cheon-Shu;Kang, Sang-Seung;Han, Woo-Yong;Sohn, Joo-Chan
- Proceedings of the Korea Information Processing Society Conference
- /
- 2003.05b
- /
- pp.1293-1296
- /
- 2003
ebXML의 핵심 컴포넌트(Core Component)는 범 산업 분야 및 다양한 환경에서 재사용 가능하고 컨텍스트(context)에 영향을 받지 않는 일반적인 빌딩블록(building block)으로, 비즈니스 문서를 구성하기 위한 가장 기본 요소이다. 이러한 핵심 컴포넌트는 비즈니스 컨텍스트에 의해 BIE(Business Information Entity)를 이루게 되며 Syntax binding을 통해 XML Schema, DTD 등의 형태로 표현된다. 따라서, ebXML환경에서 사용되는 비즈니스 문서를 포함하여 다양한 종류의 XML Schema, DTD, XML 관련 문서를 쉽게 저작(생성, 검증, 편집)할 수 있는 도구가 필요하다. 본 논문에서는 이러한 요구사항에 적합한 ebXML CC 기반 BD(Business Document) 저작도구로 범용적으로 사용할 수 있는 Schema.DTD 기반 XML 문서 편집기, Schema 문서 편집기, DTD 문서 편집기로 구성된 XDocBuilder를 설계 및 구현하였다.
PDF

Korean Named Entity Recognition using Cotraining-based Learning (Cotraining 학습을 이용한 한국어 개체명 인식)

Lee, Hyun-Sook;Chung, Eui-Sok;Hwang, Yi-Gyu;Yun, Bo-Hyun
- Proceedings of the Korea Information Processing Society Conference
- /
- 2002.11a
- /
- pp.597-600
- /
- 2002
본 논문에서는 정보추출 및 정보검색, 문서요약과 같은 자연어처리 응용에서 중요한 역할을 하는 개체명 인식 모델을 제안하였다. 기존의 한국어 개체명 인식에 관한 연구는 규칙 기반 연구의 경우 수동으로 생성한 규칙이나 어휘사전에 매우 의존적이고, 통계기반의 연구의 경우 개체명이 태깅된 대량의 학습데이터를 필요로 하므로 새로운 도메인으로의 이식성 관점에서 한계가 있다. 이를 극복하기 위해 본 논문에서는 개체명이 태깅되지 않은 학습데이터를 이용하여 Cotraining 기반 학습을 수행함으로써 개체명 인식을 위한 규칙과 사전을 자동적으로 확장하였다. 실험 결과, 경제분야 문서에 대해 87.6%의 정확률을 보였다.
PDF

Clausal Segmentation for Event Sentences Using Named Entity Co-occurrence Information (개체명 공기 정보를 이용한 이벤트 문장의 단문 구조 분석)

Lim, Soo-Jong;Kim, Tae-Hyun;Hwang, Yi-Gyu;Yun, Bo-Hyun
- Proceedings of the Korea Information Processing Society Conference
- /
- 2002.11a
- /
- pp.593-596
- /
- 2002
정보추출이란 자연어로 작성된 문서 집합에서 원하는 정보를 선택하여 구조화된 표현으로 생성하는 것을 말한다. 문장 단위로 정보 추출 작업을 수행할 때 추출되는 정보를 보유한 문장을 이벤트 문장이라고 정의하고 이러한 이벤트 문장의 구조를 분석하여 최종적으로 유용한 정보를 추출하기 위해서는 이벤트 문장의 구조를 파악하기 위해 이벤트 문장을 단문으로 분할하여 구조를 분석한다. 본 연구에서는 단문 구조 분석을 위해 일반적인 한국어 문장의 특성과 용언의 조사 정보를 이용하고 이러한 정보로 분석할 수 없는 문장에 대해서는 공기 정보를 사용한다. 사용되는 공기 정보는 개체명이 많이 사용되는 이벤트 문장의 특성을 이용하기 위하여 개체명으로 확장된 명사(개체명)-조사-용언의 공기 정보를 구축하여 사용한다. 개체명 확장된 공기 정보는 일반 공기 정보에 비해 이벤트 문장에서 F-Measure 기준으로 약 2%의 성능향상을 보인다.
PDF

Search Result 161, Processing Time 0.024 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)