• 제목/요약/키워드: Index Database

검색결과 868건 처리시간 0.026초

음성 데이터베이스로부터의 효율적인 색인데이터베이스 구축과 정보검색 (The Extraction of Effective Index Database from Voice Database and Information Retrieval)

  • 박미성
    • 한국도서관정보학회지
    • /
    • 제35권3호
    • /
    • pp.271-291
    • /
    • 2004
  • 전자도서관과 같은 정보제공원은 이미지, 음성, 동영상 등과 같은 비정형 멀티미디어 데이터 서비스에 대한 요구를 받고 있다. 그리하여 본 연구에서는 음성 처리를 위해 어절생성기, 음절복원기, 형태소분석기, 교정기를 제안하였다. 제안한 음성처리 기술로 음성데이터베이스를 텍스트데이터베이스로 변환 한후 텍스트데이터베이스로부터 색인데이터베이스를 추출하였다. 그리고 추출한 색인데이터베이스로 텍스트와 음성의 내용기반정보검색에 활용할 수 있음을 보이기 위해 정보검색모델을 제안하였다.

  • PDF

A Column-Aware Index Management Using Flash Memory for Read-Intensive Databases

  • Byun, Si-Woo;Jang, Seok-Woo
    • Journal of Information Processing Systems
    • /
    • 제11권3호
    • /
    • pp.389-405
    • /
    • 2015
  • Most traditional database systems exploit a record-oriented model where the attributes of a record are placed contiguously in a hard disk to achieve high performance writes. However, for read-mostly data warehouse systems, the column-oriented database has become a proper model because of its superior read performance. Today, flash memory is largely recognized as the preferred storage media for high-speed database systems. In this paper, we introduce a column-oriented database model based on flash memory and then propose a new column-aware flash indexing scheme for the high-speed column-oriented data warehouse systems. Our index management scheme, which uses an enhanced $B^+$-Tree, achieves superior search performance by indexing an embedded segment and packing an unused space in internal and leaf nodes. Based on the performance results of two test databases, we concluded that the column-aware flash index management outperforms the traditional scheme in the respect of the mixed operation throughput and its response time.

요약된 Partitioned-Layer Index: Partitioned-Layer Index의 임의 접근 횟수를 줄이는 Top-k 질의 처리 방법 (Abstracted Partitioned-Layer Index: A Top-k Query Processing Method Reducing the Number of Random Accesses of the Partitioned-Layer Index)

  • 허준석
    • 한국멀티미디어학회논문지
    • /
    • 제13권9호
    • /
    • pp.1299-1313
    • /
    • 2010
  • Top-k 질의는 데이터베이스에서 사용자가 가장 원하는 k개의 객체를 구하는 질의이다. Top-k 질의를 효율적으로 처리하는 대표적인 연구로 Partitioned-Layer Index (간단히, PL-index) 방법이 있다. PL-index는 데이터베이스를 여러 개의 더 작은 데이터베이스로 분할하고 각 분할된 데이터베이스에 대해 sublayer들의 list (간단히, sublayer list)를 구성한다. 이때, 분할된 데이터베이스에 대해서 top-i 결과가 될 수 있는 객체들을 그 분할된 데이터베이스에 대한 i번째 sublayer로 구성한다. 그리고 주어진 질의에 맞춰 그 sublayer list들을 병합함으로써 질의 결과를 구한다. PL-index는 질의 처리 시 데이터베이스로부터 읽어 들이는 객체의 개수가 매우 작다는 장점을 가지지만, sublayer list들을 병합할 때에 임의 접근(random access)이 많이 발생하기 때문에 디스크 기반의 데이터베이스 환경에서 질의 처리 성능이 저하된다. 이에 본 논문에서는 임의 접근 횟수를 줄임으로써 디스크 기반의 데이터베이스 환경에서 PL-index의 질의 처리 성능을 크게 향상시키는 요약된(Abstracted) Partitioned-Layer Index (간단히, APL一index)를 제안한다. 먼저, PL-index의 각 sublayer를 가상의 (점) 객체로 요약함으로써 sublayer list들을 이러한 점 객체들의 list들(즉, APL-index)로 변형한다. 그리고 APL-index에 대해 질의 처리를 가상으로 수행하여 실제 질의 처리 시 접근할 sublayer를 예측한다, 그리고 예측된 sublayer들을 sublayer list별로 한꺼번에 읽어 들임으로 PL-index에서 발생하는 임의 접근 횟수를 줄인다. 합성 데이터와 실제 데이터에 대한 실험을 통하여 제안한 APL-index가 PL-index의 임의 접근 횟수를 크게 줄일 수 있음을 보인다.

객체 데이터베이스에서 스타 조인의 빠른처리를 위한 비트맵 색인 기법과 그의 선정 문제 (Bit-map Indexes and Their Selection Problem for Efficient Processing of Star Joins in Object Databases)

  • 조완섭;정태성;이현철;장혜경;안명상
    • Journal of Information Technology Applications and Management
    • /
    • 제10권2호
    • /
    • pp.19-31
    • /
    • 2003
  • We propose an indexing technique and an index selection algorithm for optimal OLAP query processing in object database systems, Although there are many research results on the relational database systems for OLAP Query processing, few researches have been done on the object database systems. Since OLAP queries represent complex business logic on a huge data ware-house, object database systems supporting the OLAP queries should have higher performance. Proposed bitmap index structure is an extension of conventional bitmap indexes for adapting object databases and provides higher performance with lower space overhead. We also propose a linear time solution of the index selection problem that will be used in the OLAP query optimization process.

  • PDF

ATM 교환기용 분산 주기억장치 상주 데이터베이스 시스템에서의 T-tree 색인 구조의 회복 기법 (The T-tree index recovery for distributed main-memory database systems in ATM switching systems)

  • 이승선;조완섭;윤용익
    • 한국통신학회논문지
    • /
    • 제22권9호
    • /
    • pp.1867-1879
    • /
    • 1997
  • DREAM-S는 ATM 네트워크용 교환 시스템에서 응용 프로그램들의 교환기 운용 데이터에 대한 실시간 처리 요구를 지원하기 위한 분산 주기억장치 상주 데이터베이스 시스템(Main Memory Database Systems)이다. DREAM-S는 클라이언트-서버 구조를 가지면서 서버 프로세서에만 디스크가 연결되어 있으며, 대량의 데이터로부터 원하는 데이터를 신속히 검색하기 위하여 T- Tree 색인 구조를 제공한다. 본 논문에서논 DREAM-S에서 T- Tree 색인 구조에 대한 회복 기법를 제안한다. 주기억장치 상주 데이터베이스는 디스크 상주 데이터베이스 보다 뛰어난 성능을 제공하지만 시스템 고장 시(정전 등과 같은 오류) 주기억장치에 저장된 모든 데이터(릴레이션과 색인 구조)가 파손될 수 있다. 따라서 고장 후 파손된 주기억장치 데이터베이스를 신속히 정상 데이터베이스 상태로 회복하는 회복 기법이 필수적이다. 제안된 회복 기법에서는 T-Tree 색인 구조를 각 프로세서의 주기억장치에만 유지하도록 함으로서 ATM 교환기 시스템의 성능에서 병복 현상을 일으킬 수 있는 서버 프로세서의 디스크 출입 오버헤드를 줄인다. 또한, 시스템 고장 후 서버와 모든 클라이언트 시스템들이 병렬 처리 방식으로 각자의 T- Tree(들)를 회복하도룩 함으로서 클라이언트 개수가 많은 경우에도 신속한 회복이 가능하도록 하였다.

  • PDF

Hyper-TH : 실시간 주기억장치 데이터베이스 시스템을 위한 색인기법 (Hyper-TH : An Index Mechanism for Real-Time Main Memory Database Systems)

  • 민영수;신재룡;이병엽;유재수
    • 정보기술과데이타베이스저널
    • /
    • 제8권2호
    • /
    • pp.103-114
    • /
    • 2001
  • In this paper, we propose an efficient index mechanism for real-time main memory database systems. Existing main memory index structures based on the tree can effectively support range searches. However, it doesn't guarantee the real-time characteristic because difference between the access time of a node and an average access time can be high. The index structures based on the hash have always a regular random access time on the simple searches and that speed is very fast. However they do not support range searches. To solve such problems, we propose a new index mechanism called Hyper Tree-Hash (Hyper-TH) that combines ECBH (Extendible Chained Bucket Hashing) and T*-tree. ECBH can be dynamically extended and has a very fast access time. T*-tree effectively supports the range searches. We show through our experiments that the proposed mechanism outperforms existing other index structures.

  • PDF

건강보험청구자료에서 동반질환 보정방법 (Comorbidity Adjustment in Health Insurance Claim Database)

  • 김경훈
    • 보건행정학회지
    • /
    • 제26권1호
    • /
    • pp.71-78
    • /
    • 2016
  • The value of using health insurance claim database is continuously rising in healthcare research. In studies where comorbidities act as a confounder, comorbidity adjustment holds importance. Yet researchers are faced with a myriad of options without sufficient information on how to appropriately adjust comorbidity. The purpose of this study is to assist in selecting an appropriate index, look back period, and data range for comorbidity adjustment. No consensus has been formed regarding the appropriate index, look back period and data range in comorbidity adjustment. This study recommends the Charlson comorbidity index be selected when predicting the outcome such as mortality, and the Elixhauser's comorbidity measures be selected when analyzing the relations between various comorbidities and outcomes. A longer look back period and inclusion of all diagnoses of both inpatient and outpatient data led to increased prevalence of comorbidities, but contributed little to model performance. Limited data range, such as the inclusion of primary diagnoses only, may complement limitations of the health insurance claim database, but could miss important comorbidities. This study suggests that all diagnoses of both inpatients and outpatients data, excluding rule-out diagnosis, be observed for at least 1 year look back period prior to the index date. The comorbidity index, look back period, and data range must be considered for comorbidity adjustment. To provide better guidance to researchers, follow-up studies should be conducted using the three factors based on specific diseases and surgeries.

A Study on Database Indexing Techniques and Their Limitations

  • Nasridinov, Aziz;Park, Young-Ho
    • 한국정보처리학회:학술대회논문집
    • /
    • 한국정보처리학회 2013년도 춘계학술발표대회
    • /
    • pp.898-899
    • /
    • 2013
  • There are numerous approaches to improve the performances of database systems. The most efficient ones are considered to be implementing an effective database indexing technique. This must guarantee the balance between the memory, storage resources and processor of the database server according to the type, structure, the physical organization and the cardinality of data, the type of queries and the number of competing transactions. The challenge is to find an appropriate index type that would suite these requirements. In this paper, we provide an overview of the most used database indexing techniques such as B-tree index and Bitmap index, present an analysis of these techniques and discuss their limitations.

Phantom Protection Method for Multi-dimensional Index Structures

  • Lee, Seok-Jae;Song, Seok-Il;Yoo, Jae-Soo
    • International Journal of Contents
    • /
    • 제3권2호
    • /
    • pp.6-17
    • /
    • 2007
  • Emerging modem database applications require multi-dimensional index structures to provide high performance for data retrieval. In order for a multi-dimensional index structure to be integrated into a commercial database system, efficient techniques that provide transactional access to data through this index structure are necessary. The techniques must support all degrees of isolation offered by the database system. Especially degree 3 isolation, called "no phantom read," protects search ranges from concurrent insertions and the rollbacks of deletions. In this paper, we propose a new phantom protection method for multi-dimensional index structures that uses a multi-level grid technique. The proposed mechanism is independent of the type of the multi-dimensional index structure, i.e., it can be applied to all types of index structures such as tree-based, file-based, and hash-based index structures. In addition, it has a low development cost and achieves high concurrency with a low lock overhead. It is shown through various experiments that the proposed method outperforms existing phantom protection methods for multi-dimensional index structures.

사고 데이터베이스를 활용한 안전 관리 시스템의 정량적 Audit 시스템 개발 (The Development of Quantitative Audit System for Safety Management Systems based on Accident Database)

  • 안성준;이창준
    • 한국안전학회지
    • /
    • 제28권1호
    • /
    • pp.40-46
    • /
    • 2013
  • In the chemical process industries, accidents have a high potential and large effects on catastrophic results. Therefore the safety management for accident prevention plays a crucial role to guarantee the process safety. For these reasons, many systematic methods for safety management system have been widely employed in the fields of chemical processes. PSM (Process safety management) is one of most representative methods. The audit system, which is one of PSM system components, evaluates the performance of PMS system. However, most existing safety audit systems are not systematic and these are performed based on knowledges and experiences of various specialist. Moreover, the safety audit is only performed based on each independent technical component. So, the results of safety audit are not a quantitative index but only a series of commentaries. Finally, it is very difficult to obtain the comparison with other plants or industries. In this study, the novel systematic method and index-based accident database of auditing safety management systems for quantitative assessment are proposed. First, the elements of safety audit replace technical methods to categories of accident database. The F-N curve of each category for accident database is employed to derive the index for quantitative assessment. The Accidental Factor Risk Index (AFRI) is suggested for evaluating the effect of each element in accident database and safety audit system. The safety audit can be modified according to the proposed index.