Search | Korea Science

Design of Data Structures and Algorithms for Efficient Retrieval of Structured Documents (구조적 문서의 효율적인 검색을 위한 자료 구조와 알고리즘 설계)

김영자;정채영;김현주;배종민
- Proceedings of the Korean Information Science Society Conference
- /
- 1999.10a
- /
- pp.60-62
- /
- 1999
SGML이나 XML과 같은 마크업 언어를 사용하여 생성된 구조적 문서에 대한 검색 시스템은 문서의 임의의 부분에 대한 검색을 지원한다. 문서의 구조에 바탕을 둔 다양한 유형의 사용자 질의를 처리하기 위해서는 색인에 필요한 메모리량이 커지게 된다. 색인에 필요한 메모리양을 줄이기 위해, 색인된 노드의 ID에서 찾고자 하는 노드의 ID를 계산할 수 있어야 한다. 그러나 이 경우 각 노드에 ID가 고정되기 때문에 문서의 갱신이 발생할 때 많은 부분이 수정되어야 하기 때문에 갱신에 필요한 오버헤드가 커지게 된다. 본 논문에서는 전체문서인스턴스트리 구조를 제안하고, 이를 기반으로 하여 노드의 ID를 구성함으로서, 색인과 검색의 효율성을 유지하면서 자료의 추가나 삭제등의 갱신이 발생할 때, 갱신의 파장을 최소화시킬 수 있는 색인구조와 질의처리 알고리즘을 제시한다.
PDF

A Prime Numbering Scheme with Sibling-Order Value for Efficient Labeling in Dynamic XML Documents (동적 XML 문서에서 효과적인 레이블링을 위해 형제순서 값을 갖는 프라임 넘버링 기법)

Lee, Kang-Woo;Lee, Joon-Dong
- Journal of the Korea Society of Computer and Information
- /
- v.12 no.5
- /
- pp.65-72
- /
- 2007
Labeling schemes which don't consider about frequent update in dynamic XML documents need relabeling process to reflect the changed label information whenever the tree of XML document is update. There is disadvantage of considerable expenses in the dynamic XML document which can occurs frequent update. To solve this problem, we suggest prime number labeling scheme that doesn't need relabeling process. However the prime number labeling scheme does not consider that it needs to update the sibling order of nodes in the tree of XML document. This update process needs much costs because the most of the tree of XML document has to be researched and rewritten. In this paper, we propose the prime number labeling scheme with sibling order value that can maintain the sibling order without researching or rewriting the tree of XML documents.
PDF

A Study on Rankin Decision of Retrieved Documents Using User Profile (사용자 프로파일을 이용한 문서 검색순위 결정에 관한 연구)

Kim, Hyeong-Gyun;Kim, Yong-Ho;Lee, Sang-Beom
- Proceedings of the Korean Institute of Information and Commucation Sciences Conference
- /
- v.9 no.2
- /
- pp.993-996
- /
- 2005
본 논문에서는 동일한 분야의 검색된 문서가 갖는 하나의 성향을 중심으로 문서들 자체가 가지고 있는 관계성을 분석하여 용어의 가중치를 결정하였다. 그리고 사용자의 관심분야와 선호도를 적절히 표현하기 위하여 질의가 아닌 사용자 프로파일을 구축하여 이용하였다. 사용자 프로파일은 관심 분야별로 용어열과 선호도 벡터로 구성하고, ‘사용자접근에 의한 갱신’, ‘사용자 프로파일을 이용한 갱신’ 방법을 이용하여 사용자 프로파일을 사용자 위주로 학습시킨다. ‘사용자 접근에 의한 갱신’ 방법은 주제 분야에 대한 지식이 있는 경우에 적용할 수 있는 방법으로서 실험 결과, 사용자 프로파일이사용자의 선호도를 제대로 표현하기까지의 갱신 회수를 상당히 감소시킬 수 있었다. ‘사용자 프로파일을 이용한 갱신’ 방법은 갱신초기에 수행하는 방법으로서 선호도 값의 차이를 명확히 해주는 결과를 가져온다.
PDF

An Efficient Updates Processing Using Labeling Scheme In Dynamic Ordered XML Trees (동적 순서 XML 트리에서 레이블링 기법을 이용한 효율적인 수정처리)

Lee, Kang-Woo
- Journal of the Korea Institute of Information and Communication Engineering
- /
- v.12 no.12
- /
- pp.2219-2225
- /
- 2008
Labeling schemes which don't consider about frequent update in dynamic XML documents need relabeling process to reflect the changed label information whenever the tree of XML document is update. There is disadvantage of considerable expenses in the dynamic XML document which can occurs frequent update. To solve this problem, we suggest prime number labeling scheme that doesn't need relabeling process. However the prime number labeling scheme does not consider that it needs to update the sibling order of nodes in the XML tree of document. This update process needs much costs because the most of the XML tree of document has to be relabeling and recalculation. In this paper, we propose the prime number labeling scheme with sibling order value that can maintain the sibling order without relabeling or recalculation the XML tree of documents.
https://doi.org/10.6109/jkiice.2008.12.12.2219 인용 PDF KSCI

An Index Method for Storing and Extracting XML Documents (XML 문서의 저장과 추출을 위한 색인 기법)

Kim Woosaeng;Song Jungsuk
- Journal of Korea Multimedia Society
- /
- v.8 no.2
- /
- pp.154-163
- /
- 2005
Because most researches that were studied so far on XML documents used an absolute coordinate system in most of the index techniques, the update operation makes a large burden. To express the structural relations between elements, attributes and text, we need to reconstruct the structure of the coordinates. As the reconstruction process proceeds through out the entire XML document in a cascade manner, which is not limited to the current changing node, a serious performance problem may be caused by the frequent update operations. In this paper, we propose an index technique based on extensible index that does not cause serious performance degradations. It can limit the number of node to participate in reconstruction process and improve lots of performance capacities on the whole. And extensible index performs the containment relationship query by the simple expression using SQL statement.
PDF

An Efficient Validation Method for XML Data Updates (XML 데이타 갱신에 대한 효율적인 유효 검증 기법)

Lee, Ji-Hyun;Park, Myung-Jae;Chung, Chin-Wan
- Journal of KIISE:Databases
- /
- v.34 no.2
- /
- pp.153-165
- /
- 2007
XML is widely used in various applications as the standard for representing and exchanging data on the Web. XML Schema is the standard schema for XML and an XML document generated based on the XML Schema is called 'Valid XML document'. The XML Schema validity should be guaranteed after the XML document is updated. In this paper, we design an efficient method that verifies XML Schema validity before update, and so eliminates unnecessary validations. Also, we propose validation algorithms for each update. In addition, we propose the mapping between XML data and XML Schema and a storage method for XML Schema in order to efficiently extract the schema information for the validation. Finally, we compare the performance of the validation according to the storage methods.
PDF KSCI

An Indexing Scheme for Incremental Updating in Structured Document (구조화 문서에서 점진적 갱신을 위한 인텍싱 기법)

김성완;김선경;이재호;임해철
- Proceedings of the Korean Information Science Society Conference
- /
- 2002.04b
- /
- pp.190-192
- /
- 2002
구조와 문서(structured document)에 대만 효율적인 처리를 위해서는 문서의 임의 엘리먼트에 빠르고 직접적인 접근을 지원하는 인덱싱 기법이 필요하다 이를 위한 기존의 연구들에서는 전통적인 정보 검색 분야에서 사용되는 역 리스트나 시그너쳐 파일을 응용한 기법들이 제안되었다. 그러나 기존의 연구들은 정적인 환경에 적합한 인덱스 구조로써, 문서에 대한 동적인 변경이 있을 경우 인덱스론 전체적으로 재구성해야 하는 부담이 있다. 본 논문에서는 역 리스트를 기반으로 문서에 대만 구조 변경과 내용 변경 등 동적인 변경에 대해 점진적 갱신을 지원하는 인덱스 구조를 설계하였다.
PDF

XML Numbering Schemes for Efficient Update of XML Documents (XML 문서의 효율적인 변경을 위한 XML 번호체계)

Kim Young-Hyun;Kang Hyunchul
- Proceedings of the Korea Information Processing Society Conference
- /
- 2004.11a
- /
- pp.55-58
- /
- 2004
XML 의 중요성이 부각 되면서 XML 데이터의 효율적 관리 기법에 관한 연구가 활발히 수행되고 있다. XML 질의의 표준화 작업도 활발히 이루어져 현재 XQuery가 유력한 표준으로 부각되었다. 그러나 XQuery 등이 완전한 XML 질의어가 되기 위해서는 변경 연산을 제공해야 하는데 XML 변경어의 표준화 작업이나 XML 변경 처리 기법에 대한 연구는 아직 미미한 실정이다. 본 논문에서는 e-Business 등 XML 데이터베이스 기반 웹 응용의 효율적 지원을 위한 XML 문서 갱신에 대한 기법을 다룬다. XML 문서의 효율적 갱신을 위한 RN(RmdEid, NextChildEid) 번호 체계, R(RmdEid) 번호 체계 을 제시한다. 어떤 XML 번호 체계를 사용하여 XML 문서를 저장하는 가는 XML 문서의 갱신 연산에 중용한 영향을 미친다. 이들 번호 체계들은 데이터베이스내의 테이블 스키마의 차이점으로 구분한 것이다. 이들 번호체계 각각에 대하여 XML 데이터베이스 시스템을 구현하여 이들 기능의 성능 및 공간 부담을 평가한 결과를 기술한다.
PDF

An Efficient Detection of Conflicting Updating in valid XMLs (유효한 XML 환경에서의 효율적인 갱신 충돌 탐지 기법)

Byun, Chang-Woo;Yun, Il-Kook;Park, Seog
- Proceedings of the Korean Information Science Society Conference
- /
- 2007.06c
- /
- pp.1-6
- /
- 2007
XML 전용 데이터베이스 시스템의 등장 및 갱신 연산 지원되면서 갱신 연산의 유효성 검사 및 효율적인 갱신 연산의 충돌 감지 기법의 필요성이 대두되고 있다. 이러한 필요성은 잘 정형화된 XML 문서와는 달리 스키마의 제약사항을 준수해야 하는 유효한 XML 문서 환경에서 절실히 요구된다. 특히, 효율적인 갱신 연산의 충돌 탐지 기법은 질의 컴파일러의 질의 최적화 및 트랜잭션 관리의 높은 병행수행 목적을 달성하는데 필수적인 요소이다. 본 논문은 판독-갱신 및 갱신-갱신 연산 사이의 충돌을 정의하고, 유효한 XML 환경에서 효율적으로 충돌을 감지하는 기법을 제안한다.
PDF

Study on Update Processing for Secure XML documents including Invisible Ancestor (보이지 않는 조상을 포함하는 안전한 XML 문서의 갱신 질의 처리에 대한 연구)

Byun Chang-woo;Park Seog
- Proceedings of the Korean Information Science Society Conference
- /
- 2005.07b
- /
- pp.52-54
- /
- 2005
XML이 웹 정보 시스템의 데이터베이스로 활용되면서 공유 부분에 대한 데이터 처리의 높은 효율성을 제공하고자 최소 단위의 접근제어 모델에 대한 연구가 판독 모드 측면에서 활발히 진행되었다. 질의 처리 연구에서는 XML 데이터베이스에 대한 갱신 질의 표준화 작업이 진행되고 있다. 본 논문은 갱신 질의 연산을 최소 단위 접근제어 모델의 연산 모드로 추가함으로써 발생하는 보이지 않는 조상 문제를 정의한다. 이를 해결하기 위한 고려 사항으로 보이지 않는 조상 노드들을 포함하는 XML 문서에 대한 갱신 연산 시 고려해야 할 특성을 갱신의 비밀성, 갱신의 무결성, 그리고 갱신의 일관성으로 정의하고 갱신 연산 수행 시 발생될 수 있는 특성 위배 상황을 정리한다.
PDF

Search Result 137, Processing Time 0.062 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)