Search | Korea Science

A Data Mining Approach for Selecting Bitmap Join Indices

Bellatreche, Ladjel;Missaoui, Rokia;Necir, Hamid;Drias, Habiba
- Journal of Computing Science and Engineering
- /
- v.1 no.2
- /
- pp.177-194
- /
- 2007
Index selection is one of the most important decisions to take in the physical design of relational data warehouses. Indices reduce significantly the cost of processing complex OLAP queries, but require storage cost and induce maintenance overhead. Two main types of indices are available: mono-attribute indices (e.g., B-tree, bitmap, hash, etc.) and multi-attribute indices (join indices, bitmap join indices). To optimize star join queries characterized by joins between a large fact table and multiple dimension tables and selections on dimension tables, bitmap join indices are well adapted. They require less storage cost due to their binary representation. However, selecting these indices is a difficult task due to the exponential number of candidate attributes to be indexed. Most of approaches for index selection follow two main steps: (1) pruning the search space (i.e., reducing the number of candidate attributes) and (2) selecting indices using the pruned search space. In this paper, we first propose a data mining driven approach to prune the search space of bitmap join index selection problem. As opposed to an existing our technique that only uses frequency of attributes in queries as a pruning metric, our technique uses not only frequencies, but also other parameters such as the size of dimension tables involved in the indexing process, size of each dimension tuple, and page size on disk. We then define a greedy algorithm to select bitmap join indices that minimize processing cost and verify storage constraint. Finally, in order to evaluate the efficiency of our approach, we compare it with some existing techniques.
https://doi.org/10.5626/JCSE.2007.1.2.177 인용 PDF

A Group Update Technique based on a Buffer Node to Store a Vehicle Location Information (차량 위치 정보 저장을 위한 버퍼 노드 기반 그룹 갱신 기법)

Jung, Young-Jin;Ryu, Keun-Ho
- Journal of KIISE:Databases
- /
- v.33 no.1
- /
- pp.1-11
- /
- 2006
It is possible to track the moving vehicle as well as to develop the location based services actively according to the progress of wireless telecommunication and GPS, to the spread of network, and to the miniaturization of cellular phone. To provide these location based services, it is necessary for an index technique to store and search too much moving object data rapidly. However the existing indices require a lot of costs to insert the data because they store every position data into the index directly. To solve this problem in this paper, we propose a buffer node operation and design a GU-tree(Group Update tree). The proposed buffer node method reduces the input cost effectively since the operation stores the moving object location data in a group, the buffer node as the unit of a non-leaf node. hnd then we confirm the effect of the buffer node operation which reduces the insert cost and increase the search performance in a time slice query from the experiment to compare the operation with some existing indices. The proposed tufter node operation would be useful in the environment to update locations frequently such as a transportation vehicle management and a tour-guide system.
PDF KSCI

A Flash Memory B+-Tree for Efficient Range Searches (효율적 범위 검색을 위한 플래시 메모리 기반 B+-트리)

Lim, Sung-Chae;Park, Chang-Sup
- The Journal of the Korea Contents Association
- /
- v.13 no.9
- /
- pp.28-38
- /
- 2013
During the past decades, the B+-tree has been most widely used as an index file structure for disk-resident databases. For the disk based B+-tree, a node update can be cheaply performed just by modifying its associated disk page in place. However, in case that the B+-tree is stored on flash memory, the traditional algorithms of the B+-tree come to be useless due to the prohibitive cost of in-place updates on flash memory. For this reason, the earlier schemes for flash memory B+-trees usually take an approach that saves B+-tree changes from real-time updates into extra temporary storage. Although that approach can easily prevent frequent in-place updates in the B+-tree, it can suffer from a waste of storage space and prolonged search times. Particularly, it is not allowable to process range searches on the leaf node level. To resolve such problems, we devise a new scheme in which the leaf nodes and their parent node are stored together in a single flash block, called the p-node block.
https://doi.org/10.5392/JKCA.2013.13.09.028 인용 PDF KSCI

Index-based Boundary Matching Supporting Partial Denoising for Large Image Databases

Kim, Bum-Soo
- Journal of the Korea Society of Computer and Information
- /
- v.24 no.10
- /
- pp.91-99
- /
- 2019
In this paper, we propose partial denoising boundary matching based on an index for faster matching in very large image databases. Attempts have recently been made to convert boundary images to time-series with the objective of solving the partial denoising problem in boundary matching. In this paper, we deal with the disk I/O overhead problem of boundary matching to support partial denoising in a large image database. Although the solution to the problem superficially appears trivial as it only applies indexing techniques to boundary matching, it is not trivial since multiple indexes are required for every possible denoising parameters. Our solution is an efficient index-based approach to partial denoising using $R^*-tree$ in boundary matching. The results of experiments conducted show that our index-based matching methods improve search performance by orders of magnitude.
https://doi.org/10.9708/jksci.2019.24.10.091 인용 PDF KSCI

Hippocratic XML Databases: A Model and Access Control Mechanism (히포크라테스 XML 데이터베이스: 모델 및 액세스 통제 방법)

Lee Jae-Gil;Han Wook-Shin;Whang Kyu-Young
- Journal of KIISE:Databases
- /
- v.31 no.6
- /
- pp.684-698
- /
- 2004
The Hippocratic database model recently proposed by Agrawal et al. incorporates privacy protection capabilities into relational databases. Since the Hippocratic database is based on the relational database, it needs extensions to be adapted for XML databases. In this paper, we propose the Hippocratic XML database model, an extension of the Hippocratic database model for XML databases and present an efficient access control mechanism under this model. In contrast to relational data, XML data have tree-like hierarchies. Thus, in order to manage these hierarchies of XML data, we extend and formally define such concepts presented in the Hippocratic database model as privacy preferences, privacy policies, privacy authorizations, and usage purposes of data records. Next, we present a new mechanism, which we call the authorization index, that is used in the access control mechanism. This authorization index, which is Implemented using a multi-dimensional index, allows us to efficiently search authorizations implied by the authorization granted on the nearest ancestor using the nearest neighbor search technique. Using synthetic and real data, we have performed extensive experiments comparing query processing time with those of existing access control mechanisms. The results show that the proposed access control mechanism improves the wall clock time by up to 13.6 times over the top-down access control strategy and by up to 20.3 times over the bottom-up access control strategy The major contributions of our paper are 1) extending the Hippocratic database model into the Hippocratic XML database model and 2) proposing an efficient across control mechanism that uses the authorization index and nearest neighbor search technique under this model.
PDF KSCI

Design and Implementation of a CIR-Tree Manager for Efficient Image Retrieval on MiDAS-III (MiDAS-III에서 효율적인 이미지 검색을 위한 CIR-트리 관리기의 설계 및 구현)

Song, Seok-Il;Lee, Hee-Jong;Lee, Seok-Hee;Yoo, Jae-Soo;Cho, Ki-Hyung;Yoo, Kwan-Hee
- Journal of KIISE:Computing Practices and Letters
- /
- v.7 no.5
- /
- pp.392-401
- /
- 2001
Nowadays, the amount of image data increase explosively. To manage the large amount of image data efficiently, high-dimensional index structures are necessary. However, as my knowledge none of existing DBMSs supports high-dimensional index structures as access methods of DBMSs. In this paper, we design and implement CIR-Tree as a access method for retrieving image data effectively on the MiDAS-III that is the storage subsystem of the BADA-III. The implemented CIR-Tree manager shows much better retrieval performance than sequential search in performance evaluation.
PDF

Indexing Techniques or Nested Attributes of OODB Using a Multidimensional Index Structure (다차원 파일구조를 이용한 객체지향 데이터베이스의 중포속성 색인기법)

Lee, Jong-Hak
- The Transactions of the Korea Information Processing Society
- /
- v.7 no.8
- /
- pp.2298-2309
- /
- 2000
This paper proposes the multidimensioa! nested attribute indexing techniques (MD- NAI) in object-oriented databases using a multidimensional index structure. Since most conventional indexing techniques for object oriented databases use a one-dimensional index stnlcture such as the B-tree, they do not often handle complex qUlTies involving both nested attributes and class hierarchies. We extend a tunable two dimensional class hierachy indexing technique(2D-CHI) for nested attributes. The 2D-CHI is an indexing scheme that deals with the problem of clustering ohjects in a two dimensional domain space that consists of a kev attribute dOI11'lin and a class idmtifier domain for a simple attribute in a class hierachy. In our extended scheme, we construct indexes using multidimensional file organizations that include one class identifier domain per class hierarchy on a path expression that defines the indexed nested attribute. This scheme efficiently suppoI1s queries that involve search conditions on the nested attribute represcnted by an extcnded path expression. An extended path expression is a one in which a class hierarchy can be substituted by an indivisual class or a subclass hierarchy in the class hierarchy.
PDF

Efficient Indexing for Large DNA Sequence Databases (대용량 DNA 시퀀스 데이타베이스를 위한 효율적인 인덱싱)

Won Jung-Im;Yoon Jee-Hee;Park Sang-Hyun;Kim Sang-Wook
- Journal of KIISE:Databases
- /
- v.31 no.6
- /
- pp.650-663
- /
- 2004
In molecular biology, DNA sequence searching is one of the most crucial operations. Since DNA databases contain a huge volume of sequences, a fast indexing mechanism is essential for efficient processing of DNA sequence searches. In this paper, we first identify the problems of the suffix tree in aspects of the storage overhead, search performance, and integration with DBMSs. Then, we propose a new index structure that solves those problems. The proposed index consists of two parts: the primary part represents the trie as bit strings without any pointers, and the secondary part helps fast accesses of the leaf nodes of the trio that need to be accessed for post processing. We also suggest an efficient algorithm based on that index for DNA sequence searching. To verify the superiority of the proposed approach, we conducted a performance evaluation via a series of experiments. The results revealed that the proposed approach, which requires smaller storage space, achieves 13 to 29 times performance improvement over the suffix tree.
PDF KSCI

Design and Implementation of the Spatial Data Cache Based on Agents for Providing Mobile Map Services (모바일 지도 서비스를 위한 에이전트 기반의 공간 데이터 캐쉬의 설계 및 구현)

Lim, Duk-Sung;Lee, Jai-Ho;Hong, Bong-Hee
- The KIPS Transactions:PartD
- /
- v.10D no.2
- /
- pp.175-186
- /
- 2003
Mobile clients like a PDA need a cache and a spatial index to search and access map data efficiently. When a server transmits spatial objects to a mobile client which has a low storage capacity, some of them can be duplicated in a cache of the mobile client. Moreover, the cost for strong added data in the cache and reconfiguring spatial index is very high in the mobile client with low computing power. The scheme for processing duplicated objects and disturbing tasks of the mobile client which has low computing power is needed. In this paper, we classfy the method for strorng duplicated objects and present the scheme for the both caching objects and reconfiguring a spatial index of cached objects using the clipping technique. We propose the caching system based on an agent in order to distribute the overhead of a mobile client as well as to provide efficiently map services. We design and implement it, and evaluate the performance.
https://doi.org/10.3745/KIPSTD.2003.10D.2.175 인용 PDF KSCI

Performance Evaluation of a Spatial Index Structure Supporting the Circular Property in Spatial Database Systems (공간 데이타베이스 시스템에서 순환 속성을 지원하는 공간색인구조의 성능평가)

김홍기;선휘준
- Journal of Korea Multimedia Society
- /
- v.4 no.3
- /
- pp.197-204
- /
- 2001
In order to increase the performance of spatial database systems, a spatial indexing method is necessary to manage spatial objects efficiently in both dynamic and static environments. A spatial indexing method considering a spatial locality is required to increase the retrieval performance. And the spatial locality is related to the location property of objects. The previous spatial indexing methods did not consider the circular location property of objects. In this paper, we introduce the CR-Tree that is a spatial index structure for clustering spatially adjacent objects in which a search space is constructed with the circular and linear domains. Using a spatial index structure considered a circular location property of objects, we show that high hit ratio and bucket utilization are increased through the simulation.
PDF

Search Result 115, Processing Time 0.022 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)