• Title/Summary/Keyword: 다차원 데이터베이스

Search Result 147, Processing Time 0.018 seconds

Implementation of a Data Processing Method to Enhance the Quality and Support the What-If Analysis for Traffic History Data (교통이력 데이터의 품질 개선과 What-If 분석을 위한 자료처리 기법의 구현)

  • Lee, Min-Soo;Cheong, Su-Jeong;Choi, Ok-Ju;Meang, Bo-Yeon
    • The KIPS Transactions:PartD
    • /
    • v.17D no.2
    • /
    • pp.87-102
    • /
    • 2010
  • A vast amount of traffic data is produced every day from detection devices but this data includes a considerable amount of errors and missing values. Moreover, this information is periodically deleted before it could be used as important analysis information. Therefore, this paper discusses the implementation of an integrated traffic history database system that continuously stores the traffic data as a multidimensional model and increases the validity and completeness of the data via a flow of processing steps, and provides a what-if analysis function. The implemented system provides various techniques to correct errors and missing data patterns, and a what-if analysis function that enables the analysis of results under various conditions by allowing the flexible definition of various process related environment variables and combinations of the processing flows. Such what-if analysis functions dramatically increase the usability of traffic data but are not provided by other traffic data systems. Experimantal results for cleaning the traffic history data showed that it provides superior performance in terms of validity and completeness.

A Nonunique Composite Foreign Key-Based Approach to Fact Table Modeling and MDX Query Composing (비유일 외래키 조합 복합키 기반의 사실테이블 모델링과 MDX 쿼리문 작성법)

  • Yu, Han-Ju;Lee, Duck-Sung;Choi, In-Soo
    • Journal of the Korea Society of Computer and Information
    • /
    • v.12 no.1 s.45
    • /
    • pp.177-188
    • /
    • 2007
  • A star schema consists of a central fact table, which is surrounded by one or more dimension tables. Each row in the fact table contains a multi-part primary key(or a composite foreign key) along with one or more columns containing various facts about the data stored in the row. Each of the composit foreign key components is related to a dimensional table. The combination of keys in the fact table creates a composite foreign key that is unique to the fact table record. The composite foreign key, however, is rarely unique to the fact table retold in real-world applications, particularly in financial applications. In order to make the composite foreign key be the determinant in real-world application, some precalculation might be performed in the SQL relational database, and cached in the OLAP database. However, there are many drawbacks to this approach. In some cases, this approach might give users the wrong results. In this paper, an approach to fact table modeling and related MDX query composing, which can be used in real-world applications without performing any precalculation and gives users the correct results, is proposed.

  • PDF

Hippocratic XML Databases: A Model and Access Control Mechanism (히포크라테스 XML 데이터베이스: 모델 및 액세스 통제 방법)

  • Lee Jae-Gil;Han Wook-Shin;Whang Kyu-Young
    • Journal of KIISE:Databases
    • /
    • v.31 no.6
    • /
    • pp.684-698
    • /
    • 2004
  • The Hippocratic database model recently proposed by Agrawal et al. incorporates privacy protection capabilities into relational databases. Since the Hippocratic database is based on the relational database, it needs extensions to be adapted for XML databases. In this paper, we propose the Hippocratic XML database model, an extension of the Hippocratic database model for XML databases and present an efficient access control mechanism under this model. In contrast to relational data, XML data have tree-like hierarchies. Thus, in order to manage these hierarchies of XML data, we extend and formally define such concepts presented in the Hippocratic database model as privacy preferences, privacy policies, privacy authorizations, and usage purposes of data records. Next, we present a new mechanism, which we call the authorization index, that is used in the access control mechanism. This authorization index, which is Implemented using a multi-dimensional index, allows us to efficiently search authorizations implied by the authorization granted on the nearest ancestor using the nearest neighbor search technique. Using synthetic and real data, we have performed extensive experiments comparing query processing time with those of existing access control mechanisms. The results show that the proposed access control mechanism improves the wall clock time by up to 13.6 times over the top-down access control strategy and by up to 20.3 times over the bottom-up access control strategy The major contributions of our paper are 1) extending the Hippocratic database model into the Hippocratic XML database model and 2) proposing an efficient across control mechanism that uses the authorization index and nearest neighbor search technique under this model.

Index-based Searching on Timestamped Event Sequences (타임스탬프를 갖는 이벤트 시퀀스의 인덱스 기반 검색)

  • 박상현;원정임;윤지희;김상욱
    • Journal of KIISE:Databases
    • /
    • v.31 no.5
    • /
    • pp.468-478
    • /
    • 2004
  • It is essential in various application areas of data mining and bioinformatics to effectively retrieve the occurrences of interesting patterns from sequence databases. For example, let's consider a network event management system that records the types and timestamp values of events occurred in a specific network component(ex. router). The typical query to find out the temporal casual relationships among the network events is as fellows: 'Find all occurrences of CiscoDCDLinkUp that are fellowed by MLMStatusUP that are subsequently followed by TCPConnectionClose, under the constraint that the interval between the first two events is not larger than 20 seconds, and the interval between the first and third events is not larger than 40 secondsTCPConnectionClose. This paper proposes an indexing method that enables to efficiently answer such a query. Unlike the previous methods that rely on inefficient sequential scan methods or data structures not easily supported by DBMSs, the proposed method uses a multi-dimensional spatial index, which is proven to be efficient both in storage and search, to find the answers quickly without false dismissals. Given a sliding window W, the input to a multi-dimensional spatial index is a n-dimensional vector whose i-th element is the interval between the first event of W and the first occurrence of the event type Ei in W. Here, n is the number of event types that can be occurred in the system of interest. The problem of‘dimensionality curse’may happen when n is large. Therefore, we use the dimension selection or event type grouping to avoid this problem. The experimental results reveal that our proposed technique can be a few orders of magnitude faster than the sequential scan and ISO-Depth index methods.hods.

A Study on Development of Operational System for Oil Spill Prediction Model (유출유 확산 예측 모델의 상시 운용 체계 개발에 관한 연구)

  • Kim, Hye-Jin;Lee, Moon-Jin;Oh, Se-Woong;Kang, Joon-Mook
    • Journal of the Korean Society of Marine Environment & Safety
    • /
    • v.17 no.4
    • /
    • pp.375-382
    • /
    • 2011
  • There is no system to obtain the basic data and proceed data and user input interface is complex, thus there are some limitation to utilize the oil spill prediction model. It is difficult to build the scientific response strategy in order to respond oil spill accident rapidly because it is impossible to operate the oil spill prediction model any time. In this study, the optimum operational system for oil spil prediction model has been developed considering the present system. External real time data has been linked because of impossibility of building all basic data and minimum database has been build in this study. Through this data system, real time oil spill prediction model can be utilized. And the user interface has been designed to reduce the error of the interface between user and model and the output interface has been proposed to analyze the result of modeling at multidimensional aspect. While the system for oil spill prediction model as the result of this study has some uncertainties because of depending on external data, the thing that we can predict oil spill using operate the model rapidly as soon as the accident occurred can be meaning in the response field.

Implementation of Saemangeum Coastal Environmental Information System Using GIS (지리정보시스템을 이용한 새만금 해양환경정보시스템 구축)

  • Kim, Jin-Ah;Kim, Chang-Sik;Park, Jin-Ah
    • Journal of the Korean Association of Geographic Information Studies
    • /
    • v.14 no.4
    • /
    • pp.128-136
    • /
    • 2011
  • To monitor and predict the change of coastal environment according to the construction of Saemangeum sea dyke and the development of land reclamation, we have done real-time and periodic ocean observation and numerical simulation since 2002. Saemangeum coastal environmental data can be largely classified to marine meteorology, ocean physics and circulation, water quality, marine geology and marine ecosystem and each part of data has been generated continuously and accumulated over about 10 years. The collected coastal environmental data are huge amounts of heterogeneous dataset and have some characteristics of multi-dimension, multivariate and spatio-temporal distribution. Thus the implementation of information system possible to data collection, processing, management and service is necessary. In this study, through the implementation of Saemangeum coastal environmental information system using geographic information system, it enables the integral data collection and management and the data querying and analysis of enormous and high-complexity data through the design of intuitive and effective web user interface and scientific data visualization using statistical graphs and thematic cartography. Furthermore, through the quantitative analysis of trend changed over long-term by the geo-spatial analysis with geo- processing, it's being used as a tool for provide a scientific basis for sustainable development and decision support in Saemangeum coast. Moreover, for the effective web-based information service, multi-level map cache, multi-layer architecture and geospatial database were implemented together.

Development of Multidimensional Analysis System for Bio-pathways (바이오 패스웨이 다차원 분석 시스템 개발)

  • Seo, Dongmin;Choi, Yunsoo;Jeon, Sun-Hee;Lee, Min-Ho
    • The Journal of the Korea Contents Association
    • /
    • v.14 no.11
    • /
    • pp.467-475
    • /
    • 2014
  • With the development of genomics, wearable device and IT/NT, a vast amount of bio-medical data are generated recently. Also, healthcare industries based on big-data are booming and big-data technology based on bio-medical data is rising rapidly as a core technology for improving the national health and aged society. A pathway is the biological deep knowledge that represents the relations of dynamics and interaction among proteins, genes and cells by a network. A pathway is wildly being used as an important part of a bio-medical big-data analysis. However, a pathway analysis requires a lot of time and effort because a pathway is very diverse and high volume. Also, multidimensional analysis systems for various pathways are nonexistent even now. In this paper, we proposed a pathway analysis system that collects user interest pathways from KEGG pathway database that supports the most widely used pathways, constructs a network based on a hierarchy structure of pathways and analyzes the relations of dynamics and interaction among pathways by clustering and selecting core pathways from the network. Finally, to verify the superiority of our pathway analysis system, we evaluate the performance of our system in various experiments.

Building the Data Mart on Antibiotic Usage for Infection Control (감염관리를 위한 항생제 사용량 데이터마트의 구축)

  • Rheem, Insoo
    • Korean Journal of Clinical Laboratory Science
    • /
    • v.48 no.4
    • /
    • pp.348-354
    • /
    • 2016
  • Data stored in hospital information systems has a great potential to improve adequacy assessment and quality management. Moreover, an establishment of a data warehouse has been known to improve quality management and to offer help to clinicians. This study constructed a data mart that can be used to analyze antibiotic usage as a part of systematic and effective data analysis of infection control information. Metadata was designed by using the XML DTD method after selecting components and evaluation measures for infection control. OLAP-a multidimensional analysis tool-for antibiotic usage analysis was developed by building a data mart through modeling. Experimental data were obtained from data on antibiotic usage at a university hospital in Cheonan area for one month in July of 1997. The major components of infection control metadata were antibiotic resistance information, antibiotic usage information, infection information, laboratory test information, patient information, and infection related costs. Among them, a data mart was constructed by designing a database to apply antibiotic usage information to a star schema. In addition, OLAP was demonstrated by calculating the statistics of antibiotic usage for one month. This study reports the development of a data mart on antibiotic usage for infection control through the implementation of XML and OLAP techniques. Building a conceptual, structured data mart would allow for a rapid delivery and diverse analysis of infection control information.

The Use of Music for Healthy Elderly : Literature Review (건강한 노화를 위한 음악활용 : 국내·외 연구 고찰)

  • Lee, Yea-Bit-Na;Kim, Soo Ji
    • The Journal of the Korea Contents Association
    • /
    • v.16 no.4
    • /
    • pp.335-346
    • /
    • 2016
  • This study aimed to review Korean-written and English-written studies on music intervention for older adults in terms of research variables and type of music application. Electronic databases and music-related journals were searched and a total of 40 studies, 19 Korean-written and 21 English-written studies, were included for analysis. The results showed that music intervention studies for older adults began to be conducted in Korea in the late 1990s and have continuously increased thus far, which is similar to studies overseas. Emotional area, especially depression, was targeted the most frequently in both studies in Korea and overseas. While singing of traditional folk song or popular music were applied the most in Korean studies, listening to classical music or relaxing music was primarily observed in studies overseas. While both studies in Korea and overseas used original music frequently, Korean-written studies used live music and showed tendency to select music based on preferences of participants more than studies overseas. These results indicate that growing population of older adults and increased needs for successful aging have been sufficiently reflected in studies on music-based intervention. Cultural and societal factors might affect the differences between studies in Korea and studies overseas. Future studies on music intervention considering cultural and societal needs of this population from multidimensional approach were also suggested.

Design and Implementation of High-dimensional Index Structure for the support of Concurrency Control (필터링에 기반한 고차원 색인구조의 동시성 제어기법의 설계 및 구현)

  • Lee, Yong-Ju;Chang, Jae-Woo;Kim, Hang-Young;Kim, Myung-Joon
    • The KIPS Transactions:PartD
    • /
    • v.10D no.1
    • /
    • pp.1-12
    • /
    • 2003
  • Recently, there have been many indexing schemes for multimedia data such as image, video data. But recent database applications, for example data mining and multimedia database, are required to support multi-user environment. In order for indexing schemes to be useful in multi-user environment, a concurrency control algorithm is required to handle it. So we propose a concurrency control algorithm that can be applied to CBF (cell-based filtering method), which uses the signature of the cell for alleviating the dimensional curse problem. In addition, we extend the SHORE storage system of Wisconsin university in order to handle high-dimensional data. This extended SHORE storage system provides conventional storage manager functions, guarantees the integrity of high-dimensional data and is flexible to the large scale of feature vectors for preventing the usage of large main memory. Finally, we implement the web-based image retrieval system by using the extended SHORE storage system. The key feature of this system is platform-independent access to the high-dimensional data as well as functionality of efficient content-based queries. Lastly. We evaluate an average response time of point query, range query and k-nearest query in terms of the number of threads.