통합 검색 | Korea Science

XML 문서 변경 탐지 기능을 갖는 통합 리파지토리 시스템 (An Integrated Repository System with the Change Detection Functionality for XML Documents)

박성진
- 한국산학기술학회논문지
- /
- 제10권10호
- /
- pp.2696-2707
- /
- 2009
비록 많은 DBMS 업체들이 XML을 지원하기 위해 기존 제품들을 확장하고 있지만 이와는 별도로 DBMS 종류와 플랫폼에 독립적인 경량의 XML 리파지토리 시스템 개발이 요구되고 있다. 본 논문에서 다음과 같은 기능들을 지원하는 XML 통합 리파지토리 시스템의 설계 및 구현에 관해 기술하였다. 구현된 XML 리파지토리 시스템은 XML DTD로부터 XML 문서 저장에 필요한 스키마 구조를 생성하고 데이터베이스 테이블에 저장한 뒤 XMLQL(XML Query Language)를 통해 자유롭게 XML 문서를 생성할 수 있으며 중복된 XML 문서들을 동기화시킨다. XML 리파지토리에는 동일한 데이터가 다양한 XML 문서에 중복될 수 있기 때문에 중복된 XML 문서들의 일관성 유지를 위한 효율적인 변경 탐지 기법이 요구된다. 논문에서는 메시지 다이제스트 기반의 변경 탐지 기법을 제안함으로써 클라이언트 XML 문서와 리파지토리 안의 XML 데이터간의 일관성을 유지하도록 하였다.
https://doi.org/10.5762/KAIS.2009.10.10.2696 인용 PDF

링크 질의를 통한 XML 문서의 검색 기법 (Retrieval Scheme of XML Documents Using Link Queries)

문찬호;강현철
- 정보처리학회논문지D
- /
- 제8D권4호
- /
- pp.313-326
- /
- 2001
Web 문서를 기술하기 위해 차세대 표준으로 제안된 XML은 Web 기반의 여러 응용 분야에서 널리 사용되고 있으며, Web 상의 XML 문서들은 서로 하이퍼링크를 통해 연결되어 있다. 현재까지 대부분의 XML 관련 연구들은 XML 문서의 효율적인 저장, 관리 및 검색을 위한 XML 저장 시스템을 대상으로 하고 있으며, XML 링크를 지원하는 질의어의 개발이나 링크를 활용한 XML 검색 시스템의 개발에 대한 연구는 미흡하다. 본 논문에서는, XML 링크 질의 표현을 위한 XML 질의어의 확장과 링크 질의 처리 기법을 제시한다. 링크 질의는 하나의 XML 문서(질의 문서)와 질의 문서 내의 링크로 참조되는 XML 문서(참조 문서)들의 내용을 검색하는 것이다. 참조 문서의 검색을 위해서 현재는, 참조 문서에 대한 질의를 수작업으로 생성, 처리, 그리고 그 결과의 리턴을 반복적으로 수행하는 방법이 사용되고 있다. 본 논문의 링크 질의 처리 목적은 한번의 질의 입력을 통해 추가적인 수작업 없이 참조 문서(들)에 대한 검색 결과까지 얻을 수 있는 기능을 제공하는 것이다. 기존 수작업 기반과 본 논문의 링크 질의 처리의 성능을 비교, 분석한 결과, 참조 문서로의 링크가 많을수록 수작업 기반에 비하여 질의 처리 시간이 줄어들고, 질의 문서가 저장된 사이트에 참조 문서가 많이 있을수록, 질의 처리 시간이 줄어들었다.
PDF

Building Topic Hierarchy of e-Documents using Text Mining Technology

Kim, Han-Joon
- 한국전자거래학회:학술대회논문집
- /
- 한국전자거래학회 2004년도 e-Biz World Conference
- /
- pp.294-301
- /
- 2004
·Text-mining approach to e-documents organization based on topic hierarchy - Machine-Learning ＆ information Theory-based ㆍ 'Category(topic) discovery' problem → document bundle-based user-constraint document clustering ㆍ 'Automatic categorization' problem → Accelerated EM with CU-based active learning → 'Hierarchy Construction' problem → Unsupervised learning of category subsumption relation
PDF

SVDD 기반 중요문서 변조 유출 탐지 알고리즘 (An Algorithm for Detecting Leak of Defaced Confidential Information Based on SVDD)

길지호;남기효;강형석;김성인
- 정보보호학회논문지
- /
- 제20권1호
- /
- pp.105-111
- /
- 2010
본 논문은 보호하고자 하는 중요문서의 다양한 변조를 통한 유출시도를 정확히 탐지하는 알고리즘을 제시한다. 중요문서는 내부자에 의해 다양한 방법으로 변조된 후 유출이 시도되고 있으나, 중요문서 유출탐지에 관한 기존 연구들은 유사도를 기반으로 함으로써 중요정보에 대한 다양한 변조 형태를 정확히 반영하지 못하여 탐지 정확도가 떨어지는 단점이 있다. 본 연구는 이를 해결하기 위해 SVDD(Support Vector Data Description)을 이용한 새로운 중요문서 유출 탐지 알고리즘인 v-SVDD 알고리즘을 제시한다. 본 연구에서 제시한 알고리즘 수행결과는 기존 연구결과와 비교할 때 변조 유출 탐지 측면에서 우수한 정확도를 보여준다.
https://doi.org/10.13089/JKIISC.2010.20.1.105 인용 PDF KSCI HTML

Clustering Techniques for XML Data Using Data Mining

Kim, Chun-Sik
- 한국전자거래학회:학술대회논문집
- /
- 한국전자거래학회 2005년도 e-Biz World Conference 2005
- /
- pp.189-194
- /
- 2005
Many studies have been conducted to classify documents, and to extract useful information from documents. However, most search engines have used a keyword based method. This method does not search and classify documents effectively. This paper identifies structures of XML document based on the fact that the XML document has a structural document using a set theory, which is suggested by Broder, and attempts a test for clustering XML document by applying a k-nearest neighbor algorithm. In addition, this study investigates the effectiveness of the clustering technique for large scaled data, compared to the existing bitmap method, by applying a test, which reveals a difference between the clause based documents instead of using a type of vector, in order to measure the similarity between the existing methods.
PDF

Storing and Retrieval of Multiversion XML Documents in Relational Databases

Jin Min
- 한국멀티미디어학회논문지
- /
- 제9권6호
- /
- pp.700-708
- /
- 2006
In this paper, we propose a method of managing versions of XML documents by using relational databases. Data structures based on relational tables are developed for accommodating versions of XML documents. The structure information, the contents, and changes of the versions are stored in relational tables. Thus, SQL can be exploited in queries such as horizontal queries, vertical queries, and delta queries without parsing the documents. The structure information and contents of all versions are not represented explicitly in the tables, those of certain versions which are called snapshot versions are represented. Other versions are represented indirectly as sequences of operations that are stored in the corresponding tables. The experiment shows the space performance.
PDF

기계 조립품 정보의 표현을 위한 XML 기반 공용문서 구조 개발 (Development of Common Document Structure based on XML for Representing Mechanical Part Assembly Information)

정태형;박승현;윤성원
- 한국공작기계학회:학술대회논문집
- /
- 한국공작기계학회 2002년도 추계학술대회 논문집
- /
- pp.359-364
- /
- 2002
In engineering design environment it is hard to link design data and system because the types of them are disparate. Therefore, the importance of metadata has increased. Some researches have been executed to develop metadata. But they cannot interact with other metadata and are difficult to extend. The purpose of this paper is to develop a common metadata structure which represents the general information of mechanical part assembly using XML, and to use it as base documents in order to integrate design data and systems. It is composed of part and assembly documents. Part document represents the information of a part independently to part type. Assembly document represents the location of part documents which compose an assembly. Common documents can be used as a broker between design data and systems and improve interpretability and reusability of document. We applied the developed common document structure to 2-stage spur gear drive.
PDF

Single Pass Algorithm for Text Clustering by Encoding Documents into Tables

Jo, Tae-Ho
- 한국멀티미디어학회논문지
- /
- 제11권12호
- /
- pp.1749-1757
- /
- 2008
This research proposes a modified version of single pass algorithm specialized for text clustering. Encoding documents into numerical vectors for using the traditional version of single pass algorithm causes the two main problems: huge dimensionality and sparse distribution. Therefore, in order to address the two problems, this research modifies the single pass algorithm into its version where documents are encoded into not numerical vectors but other forms. In the proposed version, documents are mapped into tables and the operation on two tables is defined for using the single pass algorithm. The goal of this research is to improve the performance of single pass algorithm for text clustering by modifying it into the specialized version.
PDF

Group Technology를 이용한 설계정보관리 시스템의 개발 (The Development of the Drawing Information Management System Based on Group Technology)

H.S. Moon;Kim, S.H.
- 한국정밀공학회지
- /
- 제14권1호
- /
- pp.58-68
- /
- 1997
In order to provide economic high-quality products to customers in a timely manner, companies have tried much effort to decrease the time period of engineering design and information management. As a part of this effort, we have developed the Drawing Information Management System(DIMS) based ofn GT(Group Technology) that could decrease design processing time by speedy and rational management of design processes. The characteristics of DIMS are as follows: First, the concept of Concurrent Engineering was applied to DIMS. Through LAN, reviewers are able to attach comments to dlectronic documents by anno- tation functions called Mark-up. The reviewer annotations are collected and combind with the original document to revise the documents. Second, we have developed a Classification and Coding(C&C) system suitable for electronic component parts bassed on GT(Group Technology). The C&C system makes both parts and drawing with similar characteriscs into families and helps users search existing documents or create new drawings promptly. Finally, DIMS provides the Engineering BOM(Bill of Material) using the concept of Family BOM based on model options.
PDF

문서 내용의 계층화를 이용한 문서 비교 방법 (Document Clustering Methods using Hierarchy of Document Contents)

황명권;배용근;김판구
- 한국정보통신학회논문지
- /
- 제10권12호
- /
- pp.2335-2342
- /
- 2006
웹의 비약적인 성장으로 웹에는 무수한 정보를 축적하고 있으며, 특히 텍스트 문서는 인간에 의해 가장 쉽게 그리고 많이 이용되는 형식이라 하겠다. 텍스트 문서의 효율적 검색을 위해 많은 연구가 이루어졌으며, 확률을 이용한 방법, 통계적인 기법을 이용한 방법, 벡터 유사도를 이용한 방법, 베이지안 자동문서 분류 방법 등이 제안되었다. 그러나 이러한 기존의 방법들은 문서의 특징을 정확하게 반영할 수 없고, 의미적 검색이 이루어지지 않는 단점을 가지고 있다 이에 본 논문은 문서를 미리 분류하는 기존의 방법을 개선하기 위해, 유사한 문서를 의미적으로 찾아내기 위한 새로운 문서 분류의 척도를 제안하며 이를 적용하는 방법을 제시한다. 본 방법은 문서의 내용을 의미적인 계층으로 표현하고 중요 도메인에 가중치를 두며, 문서들간의 도메인 가중치와 도메인 내의 개념 일치도를 이용하여 유사도를 구한다.
PDF KSCI

검색결과 1,076건 처리시간 0.028초

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

자세히 찾기

이미지 검색 (β)