• 제목/요약/키워드: summary database

검색결과 82건 처리시간 0.026초

메타 데이타베이스와 관리기의 설계 및 구현-통계 데이타베이스를 중심으로 (The Design and Implementation of Meta database and manager)

  • 안성욱
    • 자연과학논문집
    • /
    • 제8권1호
    • /
    • pp.109-114
    • /
    • 1995
  • 통계 데이타베이스의 효율적 이용을 위해 통계 분석을 위한 요약 정보를 미리 계산하여 저장함으로써 사용자에게 빠른 응답시간내에 통계 정보를 제공하려는 요약 데이타베이스와 이의 효율적인 관리와 사용의 편리를 위한 메타 데이타베이스가 생성되고 관리되어야만 한다. 요약 데이타베이스를 효율적으로 이용한 통계 분석 작업의 환경과 사용자의 편이성을 지원하기 위하여 계층 구조 형태인 데이타 사전/디렉토리의 형태로 독립적으로 운영되는 메타 데이타베이스와 관리기의 설계 및 구현 작업과 이를 이용한 운영 방법 등이 제시되었다.

  • PDF

Development of Practical Data Mining Methods for Database Summarization

  • Lee, Do-Heon
    • 정보기술과데이타베이스저널
    • /
    • 제4권2호
    • /
    • pp.33-45
    • /
    • 1998
  • Database summarization is the procedure to obtain generalized and representative descriptions expressing the content of a large amount of database at a glance. We present a top-down summary refinement procedure to discover database summaries. The procedure exploits attribute concept hierarchies that represent ISA relationships among domain concepts. It begins with the most generalized summary and proceeds to find more specialized ones by stepwise refinements. This top-down paradigm reveals at least two important advantages compared to the previous bottom-up methods. Firstly, it provides a natural way of reflecting the user's own discovery preference interactively. Secondly, it does not produce too large intermediate result that makes it hard for the bottom-up approach to be applied in practical environment. The proposed procedure can also be easily extended for distributed databases. Information content measure of a database summary is derived in order to identify more informative summaries among the discovered results.

A Pattern Summary System Using BLAST for Sequence Analysis

  • Choi, Han-Suk;Kim, Dong-Wook;Ryu, Tae-W.
    • Genomics & Informatics
    • /
    • 제4권4호
    • /
    • pp.173-181
    • /
    • 2006
  • Pattern finding is one of the important tasks in a protein or DNA sequence analysis. Alignment is the widely used technique for finding patterns in sequence analysis. BLAST (Basic Local Alignment Search Tool) is one of the most popularly used tools in bio-informatics to explore available DNA or protein sequence databases. BLAST may generate a huge output for a large sequence data that contains various sequence patterns. However, BLAST does not provide a tool to summarize and analyze the patterns or matched alignments in the BLAST output file. BLAST lacks of general and robust parsing tools to extract the essential information out from its output. This paper presents a pattern summary system which is a powerful and comprehensive tool for discovering pattern structures in huge amount of sequence data in the BLAST. The pattern summary system can identify clusters of patterns, extract the cluster pattern sequences from the subject database of BLAST, and display the clusters graphically to show the distribution of clusters in the subject database.

통계 데이타베이스의 효율적 관리를 위한 관계형데이타베이스 관리 시스템에의 전위시스템 설계 (The Design of Front-end System to RDBMS for Effective Management of Statistical Database)

  • 안성옥;김용호
    • 자연과학논문집
    • /
    • 제5권2호
    • /
    • pp.25-32
    • /
    • 1992
  • 통계 데이타 베이스는 데이타가 단순한 통계치일 뿐만 아니라, 일반적인 통계처리에서 필요한 통계 분석을 위해 주로 사용되는 대량의 데이타 베이스를 말한다. 통계 데이타 베이스를 관리하기 위해 기존의 범용 데이타 베이스 관리 시스템을 그대로 이용하기에는 데이타 저장과 액세스의 비효율성, 사용의 편이성의 부족과 질의어 등의 부족으로 인해 사용자의 요구를 충족시키지 못해, 새로운 관리 방법의 필요성이 요구되어 왔다. 독자적 개발에 의한 새로운 소프트웨어로써 통계 데이타 베이스를 관리할 때의 실제 이용하기 어려운 현실적 제고를 고려하여, 이 논문에서는 관계형 데이타 베이스 시스템에의 전위 시스템인 SM-F 시스템을 설계하여, 이를 이용하여 통계 데이타 베이스를 관리하는 방법을 제시하였다. 이 시스템은 통계 데이타 베이스의 효율을 고려한 시멘틱 모델인 GROS 모델을 사용하며 통계분석을 지원하고 통계 요약 정보를 제공하기 위해, 메타 데이타 베이스와 요약 데이타 베이스를 저장하고 운영한다.

  • PDF

새로운 약물전달체계 회사 데이터베이스의 구축 (Newly Established Drug Delivery Systems Company Database)

  • 한인구;정혜선
    • Journal of Pharmaceutical Investigation
    • /
    • 제38권6호
    • /
    • pp.429-432
    • /
    • 2008
  • Drug delivery systems (DDS) have entered mainstream in the pharmaceutical industry in the recent years. Major pharmaceutical companies as well as small or medium-sized biotechnology companies are developing various DDS-based products. We have established Drug Delivery System Company Database, which is an online searchable database of companies that develop DDS-based products and technologies or supply formulations and/or materials. Company summary, products and key technologies are listed in the database. DDS technology fields also include administration routes and indications of drugs. DDS terminologies, Statistical analysis, Useful Links, Glossary and Comments pages are also provided.

실시간 전력 검침 정보의 시계열정보 통계처리 성능 및 데이터 품질 향상 방안 설계 (A Study on Improvement Method for Statistical Process and Quality of Electric Demand Load Profile)

  • 고종민;양일권;정남준;진성일
    • 전기학회논문지
    • /
    • 제57권11호
    • /
    • pp.2080-2085
    • /
    • 2008
  • KEPCO's AMR (Automatic Meter Reading) is a system that performs the real-time inspection and management of the 15-minute load profile of electric power consumption through a wired and/or wireless network such as CDMA. It has been utilized widely for real-time collection and data analysis. So far, KEPCO has focused on establishing wireless networks using CDMA and collecting data in real time but failed to consider sufficiently performances that can improve the quality of the original data required in terms of data utilization as well as establish the summary information. In this paper, we are going to show the functions that improve data quality by recording the final renewal time of any erroneous data and maintaining such data lists to use them in the rebuilding of summary information. The goals are to reduce any load applied mainly on the DBMS (Database Management System) of AMR, to enable the real-time performance of establishment in the summary information, and to obtain high-quality inspection data. The performance evaluation result has revealed a 10-fold improvement compared to the traditional disk-based DBMS system when the summary information is established.

위암환자를 위한 간호 데이터베이스 개발 (Development of the Nursing Database for Gastric Cancer Patients)

  • 정귀임;이병숙
    • 간호행정학회지
    • /
    • 제7권3호
    • /
    • pp.571-588
    • /
    • 2001
  • Purpose : This study was to develop the nursing database for gastric cancer patients for clinical application. Method : Nursing data that development of this data base is comprehensive connected with gastric cancer patient nursing process frame to foundation as classification. Result : Each stage was processed based on the System Development Life Cycle. At the Strategy Planning stage, gastric cancer patient nursing process were analyzed. At the system Analysis Stage, database flowchart was drawn up based on frame of nursing process was drawn up. At the system Design Stage, a system was developed based on the flowchart and named the Nursing Database. The Nursing Database consisted of the patient's Basic Information, Patient's Nursing History, Discharge summary, Nursing Assessment, Nursing Diagnosis, Nursing Intervention/activity, Nursing Evaluation, Statics, Code Registration. Each element in flowchart was coded and made into a database. Nursing Assessment classified according to Gorden's Health Pattern Typology, and nursing diagnosis draws the standard 27 name of Hanguls and connected with nursing assessment. Nursing intervention and nursing activity draw 192 of thing that present in NIC, connected this with nursing assessment. Nursing evaluation is linked with nursing assessment, diagnosis and intervention by achievement availability of nursing goals. Conclusion : The biggest advantage of this database nursing process that can manage nursing information exactly and rapidly to foundation be.

  • PDF

공간 선택률 추정을 위한 압축 히스토그램 기법 (A Compressed Histogram Technique for Spatial Selectivity Estimation)

  • 정재두;지정희;류근호
    • 한국공간정보시스템학회:학술대회논문집
    • /
    • 한국공간정보시스템학회 2004년도 국내 LBS 기술개발 및 표준화 동향세미나
    • /
    • pp.69-74
    • /
    • 2004
  • Selectivity estimation for spatial query is very important process in finding the most efficient execution plan. Many works have been performed to estimate accurately selectivity. Although they deal with some problems such as false-count, multi-count, they require a large amount of memory to retain accurate selectivity, so they can not get good results in little memory environments such as mobile-based small database. In order to solve this problem, we propose a new technique called MW histogram which is able to compress summary data and get reasonable results. It also has a flexible structure to react dynamic update. The experimental results showed that the MW histogram has lower relative error than MinSkew histogram and gets a good selectivity in little memory.

  • PDF

Selectivity Estimation for Spatial Databases

  • Chi, Jeong-Hee;Lee, Jin-Yul;Ryu, Keun-Ho
    • 대한원격탐사학회:학술대회논문집
    • /
    • 대한원격탐사학회 2003년도 Proceedings of ACRS 2003 ISRS
    • /
    • pp.766-768
    • /
    • 2003
  • Selectivity estimation for spatial query is curial in Spatial Database Management Systems(SDBMS). Many works have been performed to estimate accurate selectivity. Although they deal with some problems such as false-count, multi-count arising from properties of spatial dataset, they can not get such effects in little memory space.* Therefore, we need to compress spatial dataset into little memory. In this paper, we propose a new technique called MW Histogram which is able to compress summary data and get reasonable results. Our method is based on two techniques:(a)MinSkew partitioning algorithm which deal with skewed spatial datasets. efficiently (b) Wavelet transformation which compression effect is proven. We evaluate our method via real datasets. The experimental result shows that the MW Histogram has the ability of providing estimates with low relative error and retaining the similar estimates even if memory space is small.

  • PDF

인터넷 질의 처리를 위한 웨이블릿 변환에 기반한 통합 요약정보의 관리

  • 조문증;황규영;김상욱;심규석
    • 한국정보과학회논문지:데이타베이스
    • /
    • 제28권4호
    • /
    • pp.702-714
    • /
    • 2001
  • 최근, 인터넷 기술의 급격한 발전으로 인하여 다수의 정보원들을 처리 대상으로 하는 인터넷 질 의의 사용이 점차 확대되고 있다. 인터넷 질의 처리를 위해서는 여러 정보원들에 분산된 전체 데이타분포를 함축적으로 표현한 통합 요약정보가 필요하다 본 논문에서는 웨이블릿 변환을 기반으로 한 통합 요약정보의 관리 및 이를 이용한 인터넷 질의 최적처리에 관하여 논의한다. 통합 요약정보의 구성을 위한 가장 단순한 방법은 각 정보원에 분산된 데이타분포들을 합병한 후, 이를 기반으로 퉁합 요약정보를 구성하는 것이다. 그러나 이 방법은 큰 용량의 데이타분포를 전송, 저장. 통합하는 비용이 매우 크므로 실용적이지 야다. 본 논문 에서는 이러한 문점을 극복하기 위하여 웨이블릿 변환을 기반으로 요약정보들을 합병함으로써 통합 요약 정보를 구성하는 새로운 방법과 이를 이용한 인터넷 질의 최적화 방안을 제시한다. 웨이블릿 요약정보는 합 병 조건을 만족하도록 변환되며. 합병 과정이 웨이블릿의 특성으로 인하여 매우 단순하다는 장점을 갖는다 본 논문에서는 제안된 방법으로 구성된 통합 요약정보의 오타 상한선을 정량적으로 유도한다. 제안된 방법에 대한 실험 결과에 의하면, 히스토그램 요약정보의 합병과 웨이블릿 요약정보의 합병을 비교한 선택률 추정 실험은 통합 히스토그램에 비해 퉁합 웨이블릿 요약정보가 1.6 ~ 5.5배 더 정확하다는 결과를 보였다 또한,56개개의 정보원이 참여하는 인터넷 top-N 질의를 처리할 때, 통합 요약정보를 사용하지 않는 방법과 비교하 여 이를 사용하는 경우 약 44배의 성능 개건 효과를 보였다.

  • PDF