Search | Korea Science

A Web Services-based Web OLAP Cube Browser (웹 서비스 기반의 웹 OLAP 큐브 브라우저)

Bae, Eun-Joo;Kim, Myung-Mi;Kim, Myung
- Proceedings of the Korea Information Processing Society Conference
- /
- 2002.11c
- /
- pp.1819-1822
- /
- 2002
OLAP(On-line Analytical Processing)은 데이터를 다차원적으로 집계하여 그 결과를 온라인으로 사용자에게 제공함으로써 고부가가치 창출에 사용되는 비즈니스 인텔리젼스 기술 중의 하나이다. 질의 처리 시간 제약 조건을 만족하기 위해 OLAP 시스템들은 다차원적 집계 결과를 미리 계산하여 저장해 둔다. 이 때 저장되는 데이터를 OLAP 큐브(cube)라고 하며, 데이터의 분석은 주로 큐브를 브라우징하면서 이루어진다. 본 연구에서는 웹 서비스를 통해 큐브를 브라우징할 수 있는 ‘웹 OLAP 큐브 브라우저(WOCB)'를 개발하였다. WOCB는 본 연구팀이 개발한 웹 서비스 기반의 OLAP API인 XMLMD를 사용하여 개발되었으며, 웹 서비스를 통해 전송되는 OLAP 큐브 데이터를 XML, HTML, 엑셀, 그래프 등의 다양한 형태로 검색할 수 있는 기능을 제공한다. 본 연구 결과는 이기종 플랫폼 환경에 분산되어 있는 OLAP 서버들을 통합할 때 하나의 컴포넌트로 사용될 수 있다는 장점을 갖는다.
PDF

Efficient Creation of Data Cube Using Hash Table in Data Warehouse (데이터 웨어하우스에서 해쉬 테이블을 이용한 효율적인 데이터 큐브 생성 기법)

Kim Hyungsun;You Byeongseob;Lee JaeDong;Bae Haeyoung
- Proceedings of the Korean Information Science Society Conference
- /
- 2005.11b
- /
- pp.211-213
- /
- 2005
데이터 웨어하우스는 축적된 대량의 데이터를 분석하여 의사결정을 지원하는 시스템이다. 의사결정을 위한 대량의 데이터 분석은 많은 비용을 요구하므로, 질의 처리 성능을 높이고 의사 결정자에게 빠른 응답을 제공하는 효율적인 데이터 큐브 생성 기법이 연구되었다. 기존 기법으로는 Multiway Array 기법과 H-Cubing 기법이 있다. Multiway Array 기법은 다차원 집계 연산에 필요한 모든 데이터를 배열로 저장하는 것으로 데이터의 양이 많아질수록 메모리 사용이 증가한다. H-Cubing 기법은 Hyper-Tree를 기반으로 튜플을 트리로 구축하므로 모든 튜플을 트리로 구축해야 하는 비용이 증가한다. 본 논문에서는 데이터 웨어하우스에서 해쉬 테이블을 이용한 효율적인 데이터 큐브 생성 기법을 제안한다. 제안 기법은 데이터 큐브 생성 시 필드 해쉬 테이블과 레코드 해쉬 테이블을 사용한다. 필드 해쉬 테이블은 저장될 레코드 순서 계산을 위하여 각 필드에 대해 레벨 값을 해쉬 테이블로 관리한다. 레코드 해쉬 테이블은 데이터 큐브 테이블에 저장될 레코드의 순서와 데이터 큐브 테이블에 저장하기 위한 임시 레코드의 위치를 관리한다. 필드 해쉬 테이블을 이용하여 다차원 데이터의 저장될 레코드 순서를 빠르게 찾아 저장함으로서 데이터 큐브의 생성속도가 향상된다. 또한 해쉬 테이블 만을 유지하면 되므로 메모리 사용량이 감소한다. 따라서 해쉬 테이블의 사용으로 데이터의 빠른 검색과 데이터 큐브 생성 요청에 빠른 응답이 가능하다.
PDF

Analysis of Multiple Dimension Hierarchies of OLAP Cubes (OLAP 큐브의 다중 차원계층구조에 대한 분석)

박영선;김지현;임윤선;김명
- Proceedings of the Korean Information Science Society Conference
- /
- 2004.10b
- /
- pp.115-117
- /
- 2004
롤업과 드릴다운은 다차원 데이터 분석을 위한 주요 연산으로, 각 차원에 정의된 계층구조를 통해 상세 데이터로부터 점차적으로 되는 정보를 분석가에게 제공한다. 이러한 연산 속도를 고속화하기 위해 OLAP 시스템은 사전에 집계 테이블들을 생성해 놓는다. 각 차원은 다중 계층구조를 가질 수도 있으며, 이런 경우 집계 테이블들을 모두 생성하게 되면 데이터 폭발 현상이 발생하게 된다. 본 연구에서는 다중계층 구조를 분류하고, 집계 테이블과 데이터 큐브의 크기를 계산하는 모델을 정립하였다. 이를 통해 분석가는 다중 계층구조에 따른 큐브 크기를 미리 예측할 수 있으며 계층 구조의 모양과 개수를 변경하여 데이터의 양을 조절할 수 있다.
PDF

H^-tree/H^-cubing-cubing: Improved Data Cube Structure and Cubing Method for OLAP on Data Stream (H^-tree/H^-cubing: 데이터 스트림의 OLAP를 위한 향상된 데이터 큐브 구조 및 큐빙 기법)

Chen, Xiangrui;Li, Yan;Lee, Dong-Wook;Kim, Gyoung-Bae;Bae, Hae-Young
- The KIPS Transactions:PartD
- /
- v.16D no.4
- /
- pp.475-486
- /
- 2009
Data cube plays an important role in multi-dimensional, multi-level data analysis. Meeting on-line analysis requirements of data stream, several cube structures have been proposed for OLAP on data stream, such as stream cube, flowcube, S-cube. Since it is costly to construct data cube and execute ad-hoc OLAP queries, more research works should be done considering efficient data structure, query method and algorithms. Stream cube uses H-cubing to compute selected cuboids and store the computed cells in an H-tree, which form the cuboids along popular-path. However, the H-tree layoutis disorderly and H-cubing method relies too much on popular path.In this paper, first, we propose $H^*$-tree, an improved data structure, which makes the retrieval operation in tree structure more efficient. Second, we propose an improved cubing method, $H^*$-cubing, with respect to computing the cuboids that cannot be retrieved along popular-path when an ad-hoc OLAP query is executed. $H^*$-tree construction and $H^*$-cubing algorithms are given. Performance study turns out that during the construction step, $H^*$-tree outperforms H-tree with a more desirable trade-off between time and memory usage, and $H^*$-cubing is better adapted to ad-hoc OLAP querieswith respect to the factors such as time and memory space.
https://doi.org/10.3745/KIPSTD.2009.16-D.4.475 인용 PDF KSCI

Volume Ray Casting Acceleration Method using Modified Marching Cubes Tables (변형된 마칭큐브 테이블을 이용한 볼륨 광선 투과법 가속화)

Lim, Suk-Hyun;Kim, Ju-Hwan;Shin, Byeong-Seok
- Journal of KIISE:Computer Systems and Theory
- /
- v.36 no.3
- /
- pp.210-216
- /
- 2009
Several empty-space leaping methods have been proposed for CPU-based volume ray casting. When sample points are located in semi-transparent cells, however, previous leaping methods perform unnecessary resamplings even if the scalar values on those points are confined within transparent range. A semi-transparent cells leaping method for volume ray casting using the Marching Cubes algorithm is proposed to solve this problem in our previous work. When a ray reaches a semi-transparent cell, our method performs in-out test between current sample point and the bounding box enclosing the triangles generated by the Marching Cubes. If the sample point lies on outside of the bounding box, we estimate the point is regarded as transparent. In this case, the ray advances to the next sample point without performing a resampling operation. We can frequently refer the tables for neighboring voxels, however, when we exploit conventional data structures of the Marching Cubes. We propose modified Marching Cubes tables for solving this problem.
PDF KSCI

Discovery-Driven Exploration Method in Lung Cancer 2-DE Gel Images Using the Data Cube (데이터 큐브를 이용한 폐암 2-DE 젤 이미지에서의 예외 탐사)

Shim, Jung-Eun;Lee, Won-Suk
- The KIPS Transactions:PartD
- /
- v.15D no.5
- /
- pp.681-690
- /
- 2008
In proteomics research, the identification of differentially expressed proteins observed under specific conditions is one of key issues. There are several ways to detect the change of a specific protein's expression level such as statistical analysis and graphical visualization. However, it is quiet difficult to handle the spot information of an individual protein manually by these methods, because there are a considerable number of proteins in a tissue sample. In this paper, using database and data mining techniques, the application plan of OLAP data cube and Discovery-driven exploration is proposed. By using data cubes, it is possible to analyze the relationship between proteins and relevant clinical information as well as analyzing the differentially expressed proteins by disease. We propose the measure and exception indicators which are suitable to analyzing protein expression level changes are proposed. In addition, we proposed the reducing method of calculating InExp in Discovery-driven exploration. We also evaluate the utility and effectiveness of the data cube and Discovery-driven exploration in the lung cancer 2-DE gel image.
https://doi.org/10.3745/KIPSTD.2008.15-D.5.681 인용 PDF KSCI

SPEC: Space Efficient Cubes for Data Warehouses (SPEC : 데이타 웨어하우스를 위한 저장 공간 효율적인 큐브)

Chun Seok-Ju;Lee Seok-Lyong;Kang Heum-Geun;Chung Chin-Wan
- Journal of KIISE:Databases
- /
- v.32 no.1
- /
- pp.1-11
- /
- 2005
An aggregation query computes aggregate information over a data cube in the query range specified by a user Existing methods based on the prefix-sum approach use an additional cube called the prefix-sum cube(PC), to store the cumulative sums of data, causing a high space overhead. This space overhead not only leads to extra costs for storage devices, but also causes additional propagations of updates and longer access time on physical devices. In this paper, we propose a new prefix-sum cube called 'SPEC' which drastically reduces the space of the PC in a large data warehouse. The SPEC decreases the update propagation caused by the dependency between values in cells of the PC. We develop an effective algorithm which finds dense sub-cubes from a large data cube. We perform an extensive experiment with respect to various dimensions of the data cube and query sizes, and examine the effectiveness and performance ot our proposed method. Experimental results show that the SPEC significantly reduces the space of the PC while maintaining a reasonable query performance.
PDF KSCI

Particle-Based Extended Marching Cubes with Efficient Quadratic Error Function (효율적인 2차 오차 함수를 이용한 입자 기반 Extended Marching Cubes)

Yu-Bin Kwon;Jong-Hyun Kim
- Proceedings of the Korean Society of Computer Information Conference
- /
- 2024.01a
- /
- pp.387-390
- /
- 2024
본 논문에서는 효율적인 2차 오차 함수를 이용하여 입자 기반에서 EMC(Extended Marching Cubes) 알고리즘을 구현할 수 있는 새로운 알고리즘을 제안한다. Smoothing 커널(Kernels)을 통해 계산한 입자 평균 위치에서 레벨셋(Level-set)을 계산해 스칼라장을 구축한다. 그리고 난 뒤 SPH(Smoothed particle hydrodynamics)기반의 커널을 통해 밀도, 입자 평균 위치를 계산한다. 스칼라장을 이용해 등가 곡면(Isosurface)을 찾고 음함수로 표현된 표면을 구성한다. SPH 커널을 공간에서 미분하면 공간상의 어느 위치에서나 기울기를 계산할 수 있고, 이를 통해 얻어진 법선벡터를 이용하여 일반적인 EMC나 DC(Dual contouring)에서 사용하는 2차 오차 함수를 효율적으로 설계한다. 결과적으로 제안하는 방법은 메쉬와 같이 연결정보다 없는 입자 기반 데이터에서도 EMC 알고리즘을 구현하여 볼륨(Volume) 손실을 줄이고, 복잡한 음함수 표면을 표현할 수 있게 한다.
PDF

An Efficient ROLAP Cube Generation Scheme (효율적인 ROLAP 큐브 생성 방법)

Kim, Myung;Song, Ji-Sook
- Journal of KIISE:Databases
- /
- v.29 no.2
- /
- pp.99-109
- /
- 2002
ROLAP(Relational Online Analytical Processing) is a process and methodology for a multidimensional data analysis that is essential to extract desired data and to derive value-added information from an enterprise data warehouse. In order to speed up query processing, most ROLAP systems pre-compute summary tables. This process is called 'cube generation' and it mostly involves intensive table sorting stages. (1) showed that it is much faster to generate ROLAP summary tables indirectly using a MOLAP(multidimensional OLAP) cube generation algorithm. In this paper, we present such an indirect ROLAP cube generation algorithm that is fast and scalable. High memory utilization is achieved by slicing the input fact table along one or more dimensions before generating summary tables. High speed is achieved by producing summary tables from their smallest parents. We showed the efficiency of our algorithm through experiments.
PDF KSCI

Materialized View Selection for Datacube using Simplified Lattice in Datawarehouse (데이터웨어하우스에서 단순화된 격자를 이용한 데이터큐크의 실체뷰 선택)

Jang, Ji-Suk;Seo, Eun-Ju;Lee, Jeon-Yeong
- Journal of KIISE:Software and Applications
- /
- v.26 no.5
- /
- pp.604-612
- /
- 1999
데이터웨어하우스는 데이터분석을 위한 특수 목적의 데이터베이스로, 주로 데이터분석을 위한 질의(이하 OLAP 질의)가 던져진다. 그런데 OLAP질의는 수백만개의 레코드를 가지 테이블에 대해 하나 이상의 aggregation 함수와 group-by 연산자가 포함되므로, 질의 처리 시간은 수 분에서 수 시간이 걸린다. 이를 개선하기위하여 데이터 큐브를 구현함에 있어서의 문제는 디스크 공간이 한정되어 있기 때문에 평균 질의 처리 시간이 훨씬 짧아진다하더라도 모든 셀들을 실체화할수 없다. 따라서 한정된 디스크 공간을 최대한 활용하면서 가능한 빠른 평균 질의 처리 시간을 얻을 수 있도록 데이터 큐브의 일부만을 실체화 시켜야한다. 본 논문의 주제와 관련된 연구로는 Harinarayan[4] 이 제안한 greedy 알고리즘이 있다. 이 알고리즘은 1) 데이터 큐브를 격자구조로 표현한후, 2) 격자의 위에서부터 아래로 차례로 뷰들을 방문하면서 방문한 뷰가 실체화되 경우 데이터 큐브에 주는 이익을 계산한다. 3) 그 중 가장 이익이 큰 것을 선택한다. 3) 그중 가장 이익이 큰 것을 선택한다. 2)와 3)의 과정은 k개의 뷰를 선택할 경우, k번 반복된다. 이 알고리즘의 운영 시간은 데이터 큐브를 구성하는 뷰의 개수가 n개이고 그 중에서 k개를 실체화할 경우에 O(kn2)이다. 본 논문에서는 운영 시간을 향상시킨 수정된 greedy 알고리즘을 제안한다. 알고리즘 내부에서 실체화할 뷰를 선택할 때 격자를 단순화시킨 트리를 사용함으로써, 알고리즘 운영시간을 O(kn2)에서 O(kn)으로 향상시켰다.

Search Result 46, Processing Time 0.024 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)