• Title/Summary/Keyword: Index Database

Search Result 866, Processing Time 0.031 seconds

Towards a Indexing Structure for Querying Product Information in Relational Databases (관계형 데이터베이스 상품 정보 질의 처리를 위한 인덱싱)

  • Lee, Hyu-Nja;Shim, Jun-Ho
    • The Journal of Society for e-Business Studies
    • /
    • v.13 no.4
    • /
    • pp.209-222
    • /
    • 2008
  • The product information can be practically stored and managed, and also queried when we use a relational database. We need to develop a special indexing scheme in order to process the queries to ask for the semantic relationships of the product information. Such semantic relationships include ISA or taxonomy relationships that have the transitive properties. In this paper, we propose an index scheme to effectively process those queries with the transitive property. The proposed index scheme is based on a numbering scheme that has relatively low update cost.

  • PDF

Design and Implementation of a Main-Memory Database System for Real-time Mobile GIS Application (실시간 모바일 GIS 응용 구축을 위한 주기억장치 데이터베이스 시스템 설계 및 구현)

  • Kang, Eun-Ho;Yun, Suk-Woo;Kim, Kyung-Chang
    • The KIPS Transactions:PartD
    • /
    • v.11D no.1
    • /
    • pp.11-22
    • /
    • 2004
  • As random access memory chip gets cheaper, it becomes affordable to realize main memory-based database systems. Consequently, reducing cache misses emerges as the most important issue in current main memory databases, in which CPU speeds have been increasing at 60% per year, compared to the memory speeds at 10% per you. In this paper, we design and implement a main-memory database system for real-time mobile GIS. Our system is composed of 5 modules: the interface manager provides the interface for PDA users; the memory data manager controls spatial and non-spatial data in main-memory using virtual memory techniques; the query manager processes spatial and non-spatial query : the index manager manages the MR-tree index for spatial data and the T-tree index for non-spatial index : the GIS server interface provides the interface with disk-based GIS. The MR-tree proposed propagates node splits upward only if one of the internal nodes on the insertion path has empty space. Thus, the internal nodes of the MR-tree are almost 100% full. Our experimental study shows that the two-dimensional MR-tree performs search up to 2.4 times faster than the ordinary R-tree. To use virtual memory techniques, the memory data manager uses page tables for spatial data, non- spatial data, T-tree and MR-tree. And, it uses indirect addressing techniques for fast reloading from disk.

Methods for improving Database Performance through SQL Analysis in the Course Registration System (수강신청 시스템에서의 SQL 분석을 통한 데이터베이스 성능 향상 방안)

  • Kim, Hee Wan
    • The Journal of the Convergence on Culture Technology
    • /
    • v.6 no.4
    • /
    • pp.693-701
    • /
    • 2020
  • In this paper, SQL statements are analyzed to improve database performance in the current course registration system. The performance of the current database was measured through the execution plan of the SQL statements used in the transactions related to the course registration. Through the SQL analysis, the complemented SQL statements confirmed the improved performance. Overall, the performance of the course registration database system was improved through the analysis of the execution plan, and some improvement methods of the course registration SQL were shown as test results. The improved method is to reorganize the tables and index tables related to the course registration through database tuning, and utilize the SQL function to implement an optimized system that has evolved into a course database system with improved performance. The enrollment system re-adjusted by the proposed method showed excellent results in terms of performance compared to the previous enrollment system, and the integrated performance test result reduced the response time by 1.8 to 18 times.

Optimal Construction of Multiple Indexes for Time-Series Subsequence Matching (시계열 서브시퀀스 매칭을 위한 최적의 다중 인덱스 구성 방안)

  • Lim, Seung-Hwan;Kim, Sang-Wook;Park, Hee-Jin
    • Journal of KIISE:Databases
    • /
    • v.33 no.2
    • /
    • pp.201-213
    • /
    • 2006
  • A time-series database is a set of time-series data sequences, each of which is a list of changing values of the object in a given period of time. Subsequence matching is an operation that searches for such data subsequences whose changing patterns are similar to a query sequence from a time-series database. This paper addresses a performance issue of time-series subsequence matching. First, we quantitatively examine the performance degradation caused by the window size effect, and then show that the performance of subsequence matching with a single index is not satisfactory in real applications. We argue that index interpolation is fairly useful to resolve this problem. The index interpolation performs subsequence matching by selecting the most appropriate one from multiple indexes built on windows of their inherent sizes. For index interpolation, we first decide the sites of windows for multiple indexes to be built. In this paper, we solve the problem of selecting optimal window sizes in the perspective of physical database design. For this, given a set of query sequences to be peformed in a target time-series database and a set of window sizes for building multiple indexes, we devise a formula that estimates the cost of all the subsequence matchings. Based on this formula, we propose an algorithm that determines the optimal window sizes for maximizing the performance of entire subsequence matchings. We formally Prove the optimality as well as the effectiveness of the algorithm. Finally, we perform a series of extensive experiments with a real-life stock data set and a large volume of a synthetic data set. The results reveal that the proposed approach improves the previous one by 1.5 to 7.8 times.

A Study on Adding Index Terms for improving the retrieval efficiency of the STI database (과학기술문헌 데이터베이스의 검색효율 향상을 위한 색인 보완 방안)

  • Kim, Byung-kyu;Kim, Tae-jung;Kang, Mu-yeong;You, Beom-jong
    • Proceedings of the Korea Contents Association Conference
    • /
    • 2011.05a
    • /
    • pp.293-294
    • /
    • 2011
  • KISTI collects the scientific and technical articles published in Korea and builds the Korean STI database for scientists. The number of papers exceeds one million. To improve the search efficiency of the database additional processing is required. Abstracting, classification, indexing and extracting is a traditional processing method adding value to information. Indexing and classification are useful tool to assist efficient retrieval. In this paper, authors propose a method to improve information retrieval efficiency by assigning classification code and index terms to records of Korean STI database.

  • PDF

Parallel R-tree Using Multiple Disks (복수의 Disk를 사용하는 병렬형 R-tree)

  • 방갑산;김일민
    • Proceedings of the Korean Information Science Society Conference
    • /
    • 1998.10b
    • /
    • pp.114-116
    • /
    • 1998
  • 1차원 이상의 공간 데이터의 효율적인 처리는 현대의 멀티미디어 데이터베이스에 있어서 대단히 중요한 역할을 하고 있다. 공간데이터를 관리하는 공간 index structure는 대부분 serial processing을 위한 구조를 가지고 있다. 많은 application에서 방대한 양의 공간 데이터는 보조기억장치(예: disk)에 저장이 되어 사용이 되고 공간 index structure의 query반응시간을 현저하게 줄일 수 있다. 또한 여러개의 disk를 사용하는 병렬처리는 방대한 양의 공간 데이터를 저장하는데 적당하다. 본 논문에서는 PML-tree라는 병렬형 공간 index structure를 제안한다. PML-tree는 MXR-tree에 비해 높은 공간활용도와 빠른 처리시간을 보임으로써 공간 database를 위한 효율적인 index structure로 사용이 될 것으로 기대된다.

Design of Memory-Resident GIS Database Systems

  • Lee, J. H.;Nam, K.W.;Lee, S.H.;Park, J.H.
    • Proceedings of the KSRS Conference
    • /
    • 2003.11a
    • /
    • pp.499-501
    • /
    • 2003
  • As semiconductor memory becomes cheaper, the memory capacity of computer system is increasing. Therefore computer system has sufficient memory for a plentiful spatial data. With emerging spatial application required high performance, this paper presents a GIS database system in main memory. Memory residence can provide both functionality and performance for a database management system. This paper describes design of DBMS for storing, querying, managing and analyzing for spatial and non-spatial data in main-memory. This memory resident GIS DBMS supports SQL for spatial query, spatial data model, spatial index and interface for GIS tool or applications.

  • PDF

Correlations between Spatial Distribution of Alien Plants and Land Cover - Focused on National Ecosystem Survey - (외래식물의 공간분포와 토지피복간의 상관성 연구 - 전국자연환경조사 자료를 중심으로 -)

  • Jung, Tae-Jun;Shin, Hyun-Chul;Shin, Young-Kyu;Kim, Myung-Jin
    • Journal of Environmental Impact Assessment
    • /
    • v.22 no.5
    • /
    • pp.455-466
    • /
    • 2013
  • The aim of this study is to seek application plans of National Ecosystem Survey database based on comparison and examination of feasible analysis methods for distribution characteristics of alien plants. In order to set up a correlation analysis method between alien plants and environmental factors, we had reviewed the 3rd National Ecosystem Survey guide book and consequently, two kinds of analysis method were tested. One was 1/25,000 scale map boundary based analysis and the other was representative mountain area based analysis. In this study we restricted the research area to select reliable surveyed database from whole "2011 National Ecosystem Survey flora database" according to two major criteria. First, an area defined by 1/25,000 scale map boundary and representative mountain area where the number of surveyed flora records should be within top 20%. Second, land cover map should also be built up inside that area. As a result, 25 map boundaries and 25 representative mountain areas were extracted to be analyzed. To limit a boundary for every representative mountain area we had analyzed distribution of environmental factors around that area by manual inspection with SPOT-5 remote sensed satellite image then designated 3km buffer zone from each alien plant location in that area. After then, naturalized index (NI) and urbanized index (UI) was calculated and correlations analysis was carried out. With the result of correlation analysis by map boundary only agricultural land area showed significant value of r (0.4~0.6, correlated) and the rest of factors did not. In the case of representative mountain area, the result showed that agricultural land, road and forest area showed significant value of r (0.6~0.8, highly correlated) which was corresponding to existing researches. Therefore, representative mountain area based method is preferable when using the alien plants database of National Ecosystem Survey for species distribution analysis. And also, considering the way of database utilization is strongly suggested at the first stage of survey planning for promoting active use of national ecosystem survey database.

MPI: A Practical Index Scheme for XML Data in Object Databases

  • Song Ha-Joo
    • Journal of Korea Multimedia Society
    • /
    • v.8 no.6
    • /
    • pp.729-734
    • /
    • 2005
  • In order to access XML data stored in object databases, an efficient index scheme is inevitable. There have been several index schemes that can be used to efficiently retrieve XML data stored In object databases, but they are all the single path indexes that support indexing along a single schema path. Henee, if a query contains an extended path which is denoted by wild character ('*'), a query processor has to examine multiple index objects, resulting in poor performance and inconsistent index management. In this paper, we propose MPI (Multi-Path Index) scheme as a new index scheme that provides the functionality of multiple path indexes more efficiently, while it uses only one index structure. The proposed scheme is easy to manage since it considers the extended path as a logically single schema path. It is also practical since it can be implemented by little modification of the B -tree index structure.

  • PDF

Asymmetric Index Management Scheme for High-capacity Compressed Databases (대용량 압축 데이터베이스를 위한 비대칭 색인 관리 기법)

  • Byun, Si-Woo;Jang, Seok-Woo
    • Journal of the Korea Academia-Industrial cooperation Society
    • /
    • v.17 no.7
    • /
    • pp.293-300
    • /
    • 2016
  • Traditional databases exploit a record-based model, where the attributes of a record are placed contiguously in a slow hard disk to achieve high performance. On the other hand, for read-intensive data analysis systems, the column-based compressed database has become a proper model because of its superior read performance. Currently, flash memory SSD is largely recognized as the preferred storage media for high-speed analysis systems. This paper introduces a compressed column-storage model and proposes a new index and its data management scheme for a high-capacity data warehouse system. The proposed index management scheme is based on the asymmetric index duplication and achieves superior search performance using the master index and compact index, particularly for large read-mostly databases. In addition, the data management scheme contributes to the read performance and high reliability by compressing the related columns and replicating them in two mirrored SSD. Based on the results of the performance evaluation under the high workload conditions, the data management scheme outperforms the traditional scheme in terms of the search throughput and response time.