Search | Korea Science

Mining of Frequent Structures over Streaming XML Data (스트리밍 XML 데이터의 빈발 구조 마이닝)

Hwang, Jeong-Hee
- The KIPS Transactions:PartD
- /
- v.15D no.1
- /
- pp.23-30
- /
- 2008
The basic research of context aware in ubiquitous environment is an internet technique and XML. The XML data of continuous stream type are popular in network application through the internet. And also there are researches related to query processing for streaming XML data. As a basic research to efficiently query, we propose not only a labeled ordered tree model representing the XML but also a mining method to extract frequent structures from streaming XML data. That is, XML data to continuously be input are modeled by a stream tree which is called by XFP_tree and we exactly extract the frequent structures from the XFP_tree of current window to mine recent data. The proposed method can be applied to the basis of the query processing and index method for XML stream data.
https://doi.org/10.3745/KIPSTD.2008.15-D.1.23 인용 PDF KSCI

XML Query Optimization based on Signature (시그니처를 이용한 XML 질의 최적화)

Park, Sang-Won;Kim, Hyeong-Ju
- Journal of KIISE:Databases
- /
- v.28 no.1
- /
- pp.104-114
- /
- 2001
XML은 웹에서 데이터를 주고받는 표준으로 새로이 등장하고 있다. 이러한 데이터를 잘 처리하기 위하여 데이터베이스의 도움은 필연적이다. XML을 처리하는 데이터베이스에서 데이터는 트리 형태로 저장되며 질의어는 정규 경로식(regular path expression)의 특징을 가지고 있다. 이때 질의 처리는 트리의 각 노드 탐색을 통하여 수행된다. 본 논문에서는 시그니처를 이용한 저장 방법 및 질의 처리를 통하여 질의 수행 시 각 노드의 탐색 횟수를 줄여 질의 수행을 빠르게 할 수 있게 한다. 뿐만 아니라 반 구조적 데이터에 대한 인덱스도 데이터베이스 내에서 반구조적 데이터로 표현된다. 그러므로 본 논문에서 제안한 방법을 인덱스 노드에도 적용할 수 있다. 이와 같이 본 논문에서 제안한 방법은 데이터 객체와 인덱스 객체의 탐색을 줄임으로서 XML 질의를 빠르게 처리할 수 있게 한다.
PDF

The Efficient XML Bottom- Up Query Processing (효율적인 XML 상향식 질의 처리)

Seo, Dong-Min;Yoo, Jae-Soo
- Proceedings of the Korea Information Processing Society Conference
- /
- 2006.11a
- /
- pp.345-348
- /
- 2006
경로 질의는 XML에서 가장 일반적으로 사용되는 질의이며, 기존에 경로 질의를 효율적으로 처리하기 위한 다양한 색인 기법들이 연구되었다. 최근에는 suffix 트리를 사용하는 구조 조인 기법들이 제안되어 경로 질의 성능을 향상시키고 있다. ViST는 가장 대표적인 구조 조인 기법으로 XML 문서에 대한 검색 시간을 줄이기 위해 suffix 트리와 $B^{+)$ 트리를 사용한다. 하지만, ViST는 suffix 트리에 최적화 되지 못한 번호 부여 기법을 사용함으로써, 질의 처리 시 불필요한 조인과 디스크 접근이 야기된다. 이와 같은 문제들은 경로 질의 성능을 급격히 감소시킨다. 따라서 본 논문에서는 ViST의 문제들을 해결하기 위한 새로운 색인 구조를 제안하다. 제안하는 색인 구조는 suffix 트리를 사용하는 구조 조인 기법의 성능을 향상시키기 위해서 최적화된 번호 부여 기법과 상향식 질외 처리 기법을 사용한다. 본 논문의 성능 평가에서는 제안하는 색인 구조를 ViST와 비교하여, 제안하는 색인 구조가 와일드-카드('$^{\ast}$와 '//')를 포함하는 다양한 단일 경로 질의와 분기 질의에 대해 향상된 성능을 나타냄을 보인다.
PDF

Design and Implementation of a XML Compression Algorithm Supporting Query Processing for Compressed Documents (압축된 문서에 대한 질의 처리를 지원하는 XML 압축 알고리즘의 설계 및 구현)

이석재;강영준;유재수;조기형
- The Journal of the Korea Contents Association
- /
- v.4 no.1
- /
- pp.90-99
- /
- 2004
With the spread of internet, the digitalization and the knowledge informatization are in progress rapidly. Specially, numerous users make the various works and use the services on the web. For the most part, these works make use of the XML The XML shines the reusing of the documents because it is separated from contents and sues. Also, it can re-define the logic structure of the document for requirement of the developer. However, the XML document’s size is much larger than common text document because it handles the document type and adds numerous tags for representing structure of the document. To utilize the limited storage devices of Palmtop, PDA and so u, it is necessary to compress and handle the documents efficiently. Recently, the compression techniques for efficiently handling and compressing the XML documents are under way to solve this problem. But most of the existing researches don't support the query processing for the compressed XML documents. In this paper, we design and implement the XML compression algorithm that compresses the XML document and Processes the query of compressed XML document faster and more efficiently than previous techniques.
PDF

A Tree-structured XPath Query Reduction Scheme for Enhancing XML Query Processing Performance (XML 질의의 수행성능 향상을 위한 트리 구조 XPath 질의의 축약 기법에 관한 연구)

Lee, Min-Soo;Kim, Yun-Mi;Song, Soo-Kyung
- The KIPS Transactions:PartD
- /
- v.14D no.6
- /
- pp.585-596
- /
- 2007
XML data generally consists of a hierarchical tree-structure which is reflected in mechanisms to store and retrieve XML data. Therefore, when storing XML data in the database, the hierarchical relationships among the XML elements are taken into consideration during the restructuring and storing of the XML data. Also, in order to support the search queries from the user, a mechanism is needed to compute the hierarchical relationship between the element structures specified by the query. The structural join operation is one solution to this problem, and is an efficient computation method for hierarchical relationships in an in database based on the node numbering scheme. However, in order to process a tree structured XML query which contains a complex nested hierarchical relationship it still needs to carry out multiple structural joins and results in another problem of having a high query execution cost. Therefore, in this paper we provide a preprocessing mechanism for effectively reducing the cost of multiple nested structural joins by applying the concept of equivalence classes and suggest a query path reduction algorithm to shorten the path query which consists of a regular expression. The mechanism is especially devised to reduce path queries containing branch nodes. The experimental results show that the proposed algorithm can reduce the time requited for processing the path queries to 1/3 of the original execution time.
https://doi.org/10.3745/KIPSTD.2007.14-D.6.585 인용 PDF KSCI

A Study of Join and Communication Cost for processing Integrated XQuery queries over Distributed Environment (분산 환경에서 통합 XQuery 질의 처리를 위한 조인과 통신비용에 대한 연구)

Choi, Seong-Il;Park, Jong-Hyun;Kang, Ji-Hoon
- Proceedings of the Korean Information Science Society Conference
- /
- 2007.06c
- /
- pp.59-63
- /
- 2007
XML은 웹 상에서 정보교환의 표준이며, 이종의 데이터를 보유하고 있는 지역 시스템들은 XML View를 이용하여 정보를 공개한다. 사용자는 XML을 위한 표준 질의어인 XQuery를 사용하여 분산된 XML View들을 대상으로 통합 XQuery질의를 생성할 수 있다. 이렇게 생성된 통합 XQuery질의는 자연스럽게 다른 지역시스템들 사이의 조인을 포함 할 수 있다. 조인은 비용이 많이 드는 연산자이므로 조인을 효율적으로 처리하는 것은 전체 질의의 성능에 큰 영향을 준다. 그러므로 조인을 효율적으로 처리하기 위한 비용을 결정하는 연구가 중요하다고 할 수 있다. SQL에서는 이와 같은 연구들이 많이 존재하며, 분산 환경에서의 조인을 효율적으로 처리하기 위해 크게 두 가지 비용을 고려한다. 그 중 하나는 지역시스템 내에서 조인을 처리하는 조인비용이며, 나머지 하나는 조인을 수행하기 위하여 다른 지역시스템으로 데이터를 전송하는 통신비용이다. 이 두 비용은 분산 조인을 처리하기 위한 중요한 요소이므로[6,7,8] 이를 측정하는 것은 통합 질의 처리를 위해서 필요한 연구라 할 수 있다. 그러나 테이블 구조의 데이터를 검색하기 위한 SQL 의 방법들을 구조적인 XML 데이터를 검색하기 위한 XQuery질의를 위해서 그대로 사용하는 것은 어려운 일이다. 본 논문에서는 분산 환경에서 통합 XQuery질의의 조인을 효율적으로 처리하기 위해 조인비용과 통신비용을 측정하는 방법을 제안한다. 본 논문의 기여는 앞서 기술한 문제점을 해결하여, XQuery 질의의 조인을 효율적으로 처리하기 위한 비용을 측정할 수 있도록 한다.
PDF

An Efficient Index Structure for Bottom-Up Query Processing of XML Documents (XML 문서의 상향식 질의처리를 지원하는 효율적인 색인구조)

Seo Dong-Min;Kim Eun-Jae;Seong Dong-Ook;Yoo Jae-Soo;Cho Ki-Hyung
- Journal of Internet Computing and Services
- /
- v.7 no.4
- /
- pp.101-113
- /
- 2006
A path query is used in XML. Several index structures have been studied for processing the path query efficiently. In recent. the index schemes using suffix tree with structure join method were proposed. ViST is the most representative method among such methods. ViST processes the query using suffix tree and uses B+-tree to reduce the search time of the documents. However, it significantly degrades the search performance when processing the path query. The reason is that it regards the element that is not ancestor-descendant relation in the document as a descendent. In this paper, we propose an efficient index structure to solve the problem of ViST. The query processing method suitable to the index structure is also proposed. It is shown through various experiments that the proposed index structure outperforms the existing index structure in terms of the query processing time.
PDF

Normalization of XQuery Queries for Efficient XML Query Processing (효율적인 XML질의 처리를 위한 XQuery 질의의 정규화)

김서영;이기훈;황규영
- Journal of KIISE:Computing Practices and Letters
- /
- v.10 no.5
- /
- pp.419-433
- /
- 2004
As XML becomes a standard for data representation, integration, and exchange on the Web, several XML query languages have been proposed. World Wide Web Consortium(W3C) has proposed XQuery as a standard for the XML query language. Like SQL, XQuery allows nested queries. Thus, normalization rules have been proposed to transform nested XQuery queries to semantically equivalent ones that could be executed more efficiently. However, previous normalization rules are applicable only to restricted forms of nested XQuery queries. Specifically, they can not handle FLWR expressions having nested expressions in the where clause. In this paper, we propose normalization rules for XQuery queries by extending those for SQL queries. Our proposed rules can handle FLWR expressions haying nested expressions in every clause. The major contributions of this paper are as follows. First, we classily nesting types of XQuery queries according to the existence of correlation and aggregation. We then propose normalization rules for each nesting type. Second, we propose detailed algorithms that apply the normalization rules to nested XQuery queries.
PDF KSCI

Genealogy-based Indexing Technique for XML Documents (XML문서를 위한 족보 기반 인덱싱 기법)

이월영;용환승
- Journal of KIISE:Databases
- /
- v.31 no.1
- /
- pp.72-81
- /
- 2004
Theses days, a number of data over the Internet are represented using XML because of a virtue of XML. In proportion to the increase of XML data, query processing techniques are required that support quickly and efficiently the diverse queries to search the useful information on XML documents. But, up to now, the researches handling queries for XML data are methodologies focusing on how to process regular path expressions. Therefore, we have developed a new genealogy-based indexing technique to solve various queries such as not only regular path expression but also simple path expression, path expression referencing other elements, and so on. Also, we have applied this technique on object-relational model and evaluated the performance for many documents and various query types. The result shows improved performance in comparison with the other storage techniques.
PDF KSCI

Efficient Structural Join Technique using the Level Information of Indexed XML Documents (색인된 XML 문서에서 레벨 정보를 이용한 효과적인 구조 조인 기법)

Lee Yunho;Choi Ilhwan;Kim Jongik;Kim Hyoung-Joo
- Journal of KIISE:Databases
- /
- v.32 no.6
- /
- pp.641-649
- /
- 2005
As XML is widely used with the development of internet, many researches on the XML storage and query processing have been done Several index techniques have been proposed to efficiently process XML path queries. Recently, structural join has received murk attention as a method to protest the path query. Structural join technique process a path query by identifying the containment relationship of elements. Especially, it has an advantage that we can get the result set by simply comparing related elements only instead of scanning whole document. However during the comparison process, unnecessary elements that are not included in the result set can be scanned. So we propose a new technique, the level structural join. In this technique, we use both the relationship and the level distribution of elements in the path query. Using this technique, we tao improve the performance of query processing only by comparing elements with specific level in the target inverted level.
PDF KSCI

Search Result 395, Processing Time 0.02 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)