• Title/Summary/Keyword: tree indexing

Search Result 211, Processing Time 0.028 seconds

T*-tree:An Efficient Indexing Technique for Main Memory Database (T*-트리:주기억 데이터베이스에서의 효율적인 색인기법)

  • 최공림;김기룡;김경창
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.21 no.10
    • /
    • pp.2597-2604
    • /
    • 1996
  • 본 논문에서는 주기억 데이터베이스 시스템에서의 효율적인 데이터 처리를 위하여 T*-트리라는 새로운 색인구조를 제시한다. T*-트리 색인구조는 기존의 디스크를 기반으로 하는 색인기법과 달리 모든 데이터가 주기억장치에 적재되어 있는 시스템에서 보다 빠른 데이터 접근과 메모리 공간의 효율적인 사용을 위해 주기억 데이터베이스 시스템에서 주로 사용되고 있는 T*-트리색인구조의 장점은 그대로 계승하면서 단점을 보완한 인덱스 구조이다. 본 논문에서 제시하는 T*-트리는 데이터 아이템에 대한 검색과 저장공간의 활용면에서는 T*-트리와 대동소이한 성능을 가지고 있으나, 범위 질의에서와 데이터 아이템에 대한 검색과 삭제시 중간노드에서의 노드간의 순회경로를 줄임으로써 보다 향상된 성능을 보여준다. 또한 T*-트리와 스레드 이진트리를 조합하는 경우에는 순회경로가 다소 단축되지만 중간노드에서 자신보다 높은 레벨의 후속 노드로의 순회는 기존의 인오더 트리 순회에 의존하지만, T*-트리에서는 후위포인터를 이용하므로 직접순회가 가능하게 된다. 본 논문에서는 제안된 T*-트리의 구조와 T*-트리의 검색, 삽입 및 삭제 연산을 위한 알고리즘을 설명한 후, 기존의 T*-트리와 성능분석을 실시하고 그 결과를 제시한다. 성능 분석결과 T*-트리는 데이터 검색의 경우 기존의 T*-트리와 거의 동일한 성능을 보였으며, 삽입과 식제등 색인구조의 변경시는 약간의 성능향상을 보였으나, 범위질의와 순차질의에서는 매우 향상된 성능을 나타냈다.

  • PDF

Indexing Scheme for Case-Based Designs using Memory-Based Learning (기억기반학습을 이용한 사례기반설계시 참조사례의 인덱싱)

  • Gang, Jae-Ho;Ryu, Gwang-Ryeol;Lee, Dong-Gon
    • Journal of KIISE:Computing Practices and Letters
    • /
    • v.5 no.1
    • /
    • pp.79-87
    • /
    • 1999
  • 사례기반추론(Case-Based Reasoning , CBR)은 새로운 문제가 주어질 때 과거의 유사한 문제 해결 사례를 기반으로 그 해법을 적절히 변용함으로써 새로운 문제에 적합한 해결책을 효율적으로 도출하고자 하는 문제 해결 접근 방법이다. 사례기반설계는 사례기반추론을 설계에 응용한 방법으로 유사한 요구 조건하에서 설계된 과거사례를 설계에 참고 및 활용하는 방법으로 선박개념설계 등 여러 분야에서 활용하고 있다. 이러한 사례기반설계기법을 이용하여 효율적으로 고품질의 설계를 도출하기 위해서는 설계하고자 하는 대상의 설계상의 요구조건과 부합되는 사례를 적절히 선정해야 하고, 선정된 사례와 현 설계조건과의 차이점을 명확하게 인지하여 현 상황에 맞게 변용할 수 있어야 한다. 본 논문에서는 과거 사례 선정 기록을 활용하여 그 선정 경향을 기억기반학습기법을 이용하여 학습함으로써 새로운 설계 시 적절한 사례를 선정하는 인덱싱 기법을 제시한다. 사례기반설계의 전형적인 예인 선박개념설계에서 설계 시 참조용도로 사용할 실적선을 선정하는 문제에 적용하여 실험에 본 결과 decision tree 나 간단한 휴리스틱을 적용하여 참조사례를 제시한 방법에 비해 본 논문에서 제시하는 기억기반학습을 적용한 방법이 우수함을 확인하였다.

An Efficient Compression Method for Multi-dimensional Index Structures (다차원 색인 구조를 위한 효율적인 압축 방법)

  • 조형주;정진완
    • Journal of KIISE:Databases
    • /
    • v.30 no.5
    • /
    • pp.429-437
    • /
    • 2003
  • Over the last decades, improvements in CPU speed have greatly exceeded those in memory and disk speeds by orders of magnitude and this enabled the use of compression techniques to reduce the database size as well as the query cost. Although compression techniques are employed in various database researches, there is little work on compressing multi-dimensional index structures. In this paper, we propose an efficient compression method called the hybrid encoding method (HEM) that is tailored to multi-dimensional indexing structures. The HEM compression significantly reduces the query cost and the size of multi-dimensional index structures. Through mathematical analyses and extensive experiments, we show that the HEM compression outperforms an existing method in terms of the index size and the query cost.

Content-based Image Retrieval System (내용기반 영상검색 시스템)

  • Yoo, Hun-Woo;Jang, Dong-Sik;Jung, She-Hwan;Park, Jin-Hyung;Song, Kwang-Seop
    • Journal of Korean Institute of Industrial Engineers
    • /
    • v.26 no.4
    • /
    • pp.363-375
    • /
    • 2000
  • In this paper we propose a content-based image retrieval method that can search large image databases efficiently by color, texture, and shape content. Quantized RGB histograms and the dominant triple (hue, saturation, and value), which are extracted from quantized HSV joint histogram in the local image region, are used for representing global/local color information in the image. Entropy and maximum entry from co-occurrence matrices are used for texture information and edge angle histogram is used for representing shape information. Relevance feedback approach, which has coupled proposed features, is used for obtaining better retrieval accuracy. Simulation results illustrate the above method provides 77.5 percent precision rate without relevance feedback and increased precision rate using relevance feedback for overall queries. We also present a new indexing method that supports fast retrieval in large image databases. Tree structures constructed by k-means algorithm, along with the idea of triangle inequality, eliminate candidate images for similarity calculation between query image and each database image. We find that the proposed method reduces calculation up to average 92.9 percent of the images from direct comparison.

  • PDF

A Practical Approximate Sub-Sequence Search Method for DNA Sequence Databases (DNA 시퀀스 데이타베이스를 위한 실용적인 유사 서브 시퀀스 검색 기법)

  • Won, Jung-Im;Hong, Sang-Kyoon;Yoon, Jee-Hee;Park, Sang-Hyun;Kim, Sang-Wook
    • Journal of KIISE:Databases
    • /
    • v.34 no.2
    • /
    • pp.119-132
    • /
    • 2007
  • In molecular biology, approximate subsequence search is one of the most important operations. In this paper, we propose an accurate and efficient method for approximate subsequence search in large DNA databases. The proposed method basically adopts a binary trie as its primary structure and stores all the window subsequences extracted from a DNA sequence. For approximate subsequence search, it traverses the binary trie in a breadth-first fashion and retrieves all the matched subsequences from the traversed path within the trie by a dynamic programming technique. However, the proposed method stores only window subsequences of the pre-determined length, and thus suffers from large post-processing time in case of long query sequences. To overcome this problem, we divide a query sequence into shorter pieces, perform searching for those subsequences, and then merge their results. To verify the superiority of the proposed method, we conducted performance evaluation via a series of experiments. The results reveal that the proposed method, which requires smaller storage space, achieves 4 to 17 times improvement in performance over the suffix tree based method. Even when the length of a query sequence is large, our method is more than an order of magnitude faster than the suffix tree based method and the Smith-Waterman algorithm.

Implementation of an Efficient Microbial Medical Image Retrieval System Applying Knowledge Databases (지식 데이타베이스를 적용한 효율적인 세균 의료영상 검색 시스템의 구현)

  • Shin Yong Won;Koo Bong Oh
    • Journal of the Korea Society of Computer and Information
    • /
    • v.10 no.1 s.33
    • /
    • pp.93-100
    • /
    • 2005
  • This study is to desist and implement an efficient microbial medical image retrieval system based on knowledge and content of them which can make use of more accurate decision on colony as doll as efficient education for new techicians. For this. re first address overall inference to set up flexible search path using rule-base in order U redure time required original microbial identification by searching the fastest path of microbial identification phase based on heuristics knowledge. Next, we propose a color ffature gfraction mtU, which is able to extract color feature vectors of visual contents from a inn microbial image based on especially bacteria image using HSV color model. In addition, for better retrieval performance based on large microbial databases, we present an integrated indexing technique that combines with B+-tree for indexing simple attributes, inverted file structure for text medical keywords list, and scan-based filtering method for high dimensional color feature vectors. Finally. the implemented system shows the possibility to manage and retrieve the complex microbial images using knowledge and visual contents itself effectively. We expect to decrease rapidly Loaming time for elementary technicians by tell organizing knowledge of clinical fields through proposed system.

  • PDF

An Index Structure for Substructure Searching In Chemical Databases (화학 데이타베이스에서 부분구조 검색을 위한 인덱스 구조)

  • Lee Hwangu;Cha Jaehyuk
    • Journal of KIISE:Databases
    • /
    • v.31 no.6
    • /
    • pp.641-649
    • /
    • 2004
  • The relationship between chemical structures and biological activities is researched briskly in the area of 'Medicinal Chemistry' At the base of these structure-based drug design tries, medicinal chemists search the existing drugs of similar chemical structure to target drug for the development of a new drug. Therefore, it is such necessary that an automatic system selects drug files that have a set of chemical moieties matching a user-defined query moiety. Substructure searching is the process of identifying a set of chemical moieties that match a specific query moiety. Testing for substructure searching was developed in the late 1950s. In graph theoretical terms, this problem corresponds to determining which graphs in a set are subgraph isomorphic to a specified query moiety. Testing for subgraph isomorphism has been proved, in the general case, to be an NP- complete problem. For the purpose of overcoming this difficulty, there were computational approaches. On the 1990s, a US patent has been granted on an atom-centered indexing scheme, used by the RS3 system; this has the virtue that the indexes generated can be searched by direct text comparison. This system is commercially used(http://www.acelrys.com/rs3). We define the RS3 system's drawback and present a new indexing scheme. The RS3 system treats substructure searching with substring matching by means of expressing chemical structure aspredefined strings. However, it has insufficient 'rerall' and 'precision‘ because it is impossible to index structures uniquely for same atom and same bond. To resolve this problem, we make the minimum-cost- spanning tree for one centered atom and describe a structure with paths per levels. Expressing 2D chemical structure into 1D a string has limit. Therefore, we break 2D chemical structure into 1D structure fragments. We present in this paper a new index technique to improve recall and precision surprisingly.

Efficient Content-Based Image Retrieval Method using Shape and Color feature (형태와 칼러성분을 이용한 효율적인 내용 기반의 이미지 검색 방법)

  • Youm, Sung-Ju;Kim, Woo-Saeng
    • The Transactions of the Korea Information Processing Society
    • /
    • v.3 no.4
    • /
    • pp.733-744
    • /
    • 1996
  • Content-based image retrieval(CBIR) is an image data retrieval methodology using characteristic values of image data those are generated by system automatically without any caption or text information. In this paper, we propose a content-based image data retrieval method using shape and color features of image data as characteristic values. For this, we present some image processing techniques used for feature extraction and indexing techniques based on trie and R tree for fast image data retrieval. In our approach, image query result is more reliable because both shape and color features are considered. Also, we how an image database which implemented according to our approaches and sample retrieval results which are selected by our system from 200 sample images, and an analysis about the result by considering the effect of characteristic values of shape and color.

  • PDF

An Efficient Spatiotemporal Index Unifying Temporal and Spatial Dimensions (시간과 공간의 단일화를 통한 효율적 시공간 색인)

  • Sin, Ye-Ho;Kim, Dong-Ho;Ryu, Geun-Ho
    • Journal of KIISE:Software and Applications
    • /
    • v.26 no.9
    • /
    • pp.1039-1051
    • /
    • 1999
  • 시공간 데이타베이스는 현실세계의 객체에 대하여 효율적인 공간 관리와 이력 관리를 지원한다. 이러한 시공간 데이타베이스는 시간차원과 공간차원이라는 이질적 데이타 공간을 관리하여야 하는 대단히 복잡한 시스템이다. 따라서 데이타에 대한 효율적 접근 방법에 대한 연구가 필수적이며, 이는 효율적 색인 기법의 개발을 통하여 이룰 수 있다. 그러나 시공간 데이타에 대한 접근방법 연구는 거의 이루어지지 않고 있으며 극소수의 사례들마저도 공간 객체의 이력 개념을 지원하는 것이 아니라 멀티미디어 객체의 상대적 시간만을 지원하고 있다. 따라서 이 논문에서는 공간 데이타의 이력을 표현하는 시공간 데이타에 대하여 효율적으로 색인하기 위한 방안으로서 시간과 공간을 단일화된 색인 영역으로 통합하는 단일화된 시공간 색인 모델을 제시하고, 이를 기존의 R-트리를 기반으로 확장한 색인을 설계 및 구현하였으며, 아울러 다양한 유형의 시공간 연산에 대한 색인의 성능을 평가하였다.Abstract Spatiotemporal databases are able to support an efficient spatial management as well as historical management for an object in the real world. It is very complex to manage these two dimensions why there exists on difference of inborn property of temporal and spatial dimensions. Therefore an efficient access method should be studied, and it can be done by means of development of efficient indexing technology.However, there is a few related work in the research of access methods of spatiotemporal data. Also the previous works do not support the concept of history for spatial object, and only support the relative time among multimedia objects. Therefore, in this paper, we propose a unified Spatiotemporal index model as an efficient index for Spatiotemporal data. And we not only design Spatiotemporal index that has been extended to historical management facility on the basis of conventional R-tree, but also implement it. Finally we have evaluated performance of index for the various kinds of Spatiotemporal operations.

Design and Implementation of e-Logistics System supporting Efficient Moving Objects Trajectory Management (효율적인 차량 궤적 관리를 지원하는 물류관리시스템의 설계 및 구현)

  • Lee, Eung-Jae;Nam, Kwang-Woo;Ryu, Keun-Ho
    • Journal of the Korean Association of Geographic Information Studies
    • /
    • v.9 no.2
    • /
    • pp.30-41
    • /
    • 2006
  • This paper proposes an e-logistics system supporting efficient vehicle moving trajectory management. Recent advances in wireless communications have given rise to a number of location-based services including logistics vehicle tracking, cellular phone user's location finding, and location-based commerce. Logistics systems typically entail tracking vehicles for purposes of the logistics center knowing the whereabouts of the vehicles and/or consignments. Moreover, storing and managing location trajectory of continuously moving vehicles and consignments is necessary for supporting efficient logistics plan and consignment. The proposed system is able to manage spatial objects in GIS as well as logistic information in the mobile environment. And for the efficiently managing and retrieving of transporting trajectory of logistics, we extend previous moving object indexing method, TB-Tree, to use multi-version framework and evaluate data updating performance. It is able to apply the proposed method to develop mobile contents services based on continuously changing location of moving object in the mobile environment.

  • PDF