• Title/Summary/Keyword: multidimensional data

Search Result 653, Processing Time 0.026 seconds

Restricted maximum likelihood estimation of a censored random effects panel regression model

  • Lee, Minah;Lee, Seung-Chun
    • Communications for Statistical Applications and Methods
    • /
    • v.26 no.4
    • /
    • pp.371-383
    • /
    • 2019
  • Panel data sets have been developed in various areas, and many recent studies have analyzed panel, or longitudinal data sets. Maximum likelihood (ML) may be the most common statistical method for analyzing panel data models; however, the inference based on the ML estimate will have an inflated Type I error because the ML method tends to give a downwardly biased estimate of variance components when the sample size is small. The under estimation could be severe when data is incomplete. This paper proposes the restricted maximum likelihood (REML) method for a random effects panel data model with a censored dependent variable. Note that the likelihood function of the model is complex in that it includes a multidimensional integral. Many authors proposed to use integral approximation methods for the computation of likelihood function; however, it is well known that integral approximation methods are inadequate for high dimensional integrals in practice. This paper introduces to use the moments of truncated multivariate normal random vector for the calculation of multidimensional integral. In addition, a proper asymptotic standard error of REML estimate is given.

A Physical Design Method of Storage Structures for MOLAP Systems of Data Warehouse (데이터 웨어하우스의 다차원 온라인 분석처리 시스템을 위한 저장구조의 물리적 설계기법)

  • Lee Jong-Hak
    • Journal of Korea Multimedia Society
    • /
    • v.8 no.3
    • /
    • pp.297-312
    • /
    • 2005
  • Aggregation is an operation that plays a key role in multidimensional OLAP (MOLAP) systems of data warehouse. Existing aggregation operations in MOLAP have been proposed for file structures such as multidimensional arrays. These tile structures do not work well with skewed distributions. This paper presents a physical design methodology for storage structures ni MOLAP that use the multidimensional tile organizations adapting to a skewed distribution. In uniform data distribution, we first show that the performance of multidimensional analytical processing is highly affected by the similarity of the shapes between query regions and page regions in the domain space of the multidimensional file organizations. And than, in skewed distributions, we reflect the effect of data distributions on the design by using the shapes of the normalized query regions that are weighted with data density of those query regions. Finally, we demonstrate that the physical design methodology theoretically derived is indeed correct in real environments. In the two-dimensional file organizations, the results of experiments indicate that the performance of the proposed method is enhanced by more than seven times over the conventional method. We expect that the performance will be more enhanced when the dimensionality is more than two. The result confirms that the proposed physical design methodology is useful in a practical way.

  • PDF

A Bitmap Index for Chunk-Based MOLAP Cubes (청크 기반 MOLAP 큐브를 위한 비트맵 인덱스)

  • Lim, Yoon-Sun;Kim, Myung
    • Journal of KIISE:Databases
    • /
    • v.30 no.3
    • /
    • pp.225-236
    • /
    • 2003
  • MOLAP systems store data in a multidimensional away called a 'cube' and access them using way indexes. When a cube is placed into disk, it can be Partitioned into a set of chunks of the same side length. Such a cube storage scheme is called the chunk-based MOLAP cube storage scheme. It gives data clustering effect so that all the dimensions are guaranteed to get a fair chance in terms of the query processing speed. In order to achieve high space utilization, sparse chunks are further compressed. Due to data compression, the relative position of chunks cannot be obtained in constant time without using indexes. In this paper, we propose a bitmap index for chunk-based MOLAP cubes. The index can be constructed along with the corresponding cube generation. The relative position of chunks is retained in the index so that chunk retrieval can be done in constant time. We placed in an index block as many chunks as possible so that the number of index searches is minimized for OLAP operations such as range queries. We showed the proposed index is efficient by comparing it with multidimensional indexes such as UB-tree and grid file in terms of time and space.

A Study on the Effective Spatial Data Warehouse (효율적인 공간 데이타 웨어하우스에 관한 연구)

  • 이기영
    • Journal of the Korea Society of Computer and Information
    • /
    • v.3 no.4
    • /
    • pp.126-131
    • /
    • 1998
  • Spatial data warehouse, whose importance is being increased, is composed of huge amounts of historical spatial data for organizational decision making and it also allows users to obtain useful geospatial information through analyzing and summmarizing spatial data. In this paper, we survey effective spatial multidimensional model which is based on virtual scenario for spatial data warehouse modelling. Therefore, we describe spatial multidimensional analytical query which provide multiple analytical functions according tom user's requests.

  • PDF

OLAP System and Performance Evaluation for Analyzing Web Log Data (웹 로그 분석을 위한 OLAP 시스템 및 성능 평가)

  • 김지현;용환승
    • Journal of Korea Multimedia Society
    • /
    • v.6 no.5
    • /
    • pp.909-920
    • /
    • 2003
  • Nowadays, IT for CRM has been growing and developed rapidly. Typical techniques are statistical analysis tools, on-line multidimensional analytical processing (OLAP) tools, and data mining algorithms (such neural networks, decision trees, and association rules). Among customer data, web log data is very important and to use these data efficiently, applying OLAP technology to analyze multi-dimensionally. To make OLAP cube, we have to precalculate multidimensional summary results in order to get fast response. But as the number of dimensions and sparse cells increases, data explosion occurs seriously and the performance of OLAP decreases. In this paper, we presented why the web log data sparsity occurs and then what kinds of sparsity patterns generate in the two and t.he three dimensions for OLAP. Based on this research, we set up the multidimensional data models and query models for benchmark with each sparsity patterns. Finally, we evaluated the performance of three OLAP systems (MS SQL 2000 Analysis Service, Oracle Express and C-MOLAP).

  • PDF

Multidimensional Adaptive Noise Cancellation of Stress ECG Signal

  • Gautam, Alka;Lee, Young-Dong;Chung, Wan-Young
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2008.05a
    • /
    • pp.285-288
    • /
    • 2008
  • In ubiquitous computing environment the biological signal ECG (Electrocardiogram signal) is usually recorded with noise components. Adaptive interference (or noise) canceller do adaptive filtering of the noise reference input to maximally match and subtract out noise or interference from the primary (signal plus noise) input thereby adaptively eliminate unwanted interference from the ECG signal. Measured Stress ECG (or exercise ECG signal) signal have three major noisy component like baseline wander noise, motion artifact noise and EMG (Electro-mayo-cardiogram) noise. These noises are not only distorted signal but also root of incorrect diagnosis while ECG data are analyzed. Motion artifact and EMG noises behave like wide band spectrum signals, and they considerably do overlapping with the ECG spectrum. Here the multidimensional adaptive method used for filtering which is more effective to improve signal to noise ratio.

  • PDF

Multidimensional Differential-Linear Cryptanalysis of ARIA Block Cipher

  • Yi, Wentan;Ren, Jiongjiong;Chen, Shaozhen
    • ETRI Journal
    • /
    • v.39 no.1
    • /
    • pp.108-115
    • /
    • 2017
  • ARIA is a 128-bit block cipher that has been selected as a Korean encryption standard. Similar to AES, it is robust against differential cryptanalysis and linear cryptanalysis. In this study, we analyze the security of ARIA against differential-linear cryptanalysis. We present five rounds of differential-linear distinguishers for ARIA, which can distinguish five rounds of ARIA from random permutations using only 284.8 chosen plaintexts. Moreover, we develop differential-linear attacks based on six rounds of ARIA-128 and seven rounds of ARIA-256. This is the first multidimensional differential-linear cryptanalysis of ARIA and it has lower data complexity than all previous results. This is a preliminary study and further research may obtain better results in the future.

An Analysis of Multidimensional Productivity for the Shipbuilding Performance (조선 성과 측정을 위한 다차원 생산성의 분석)

  • Kim, Yearnmin
    • Korean Management Science Review
    • /
    • v.34 no.2
    • /
    • pp.57-66
    • /
    • 2017
  • The purpose of this study is to analyze the multidimensional productivity of the shipbuilding performance and to explain the role of different factors, such as man-hour, dock period, number of building block, launching process rate, automatic welding percent, and drawing fault rate which are important production-related variables in most shipbuilding companies. The shipbuilding productivity is obtained using Data Envelopment Analysis (DEA) approach. Then, a Tobit model is considered to measure the influence of different factors on the measured productivity. The results reveal that this productivity measure can substitute a representative shipbuilding productivity index (CGT/man-hour) in shipbuilding industries. Also, this multidimensional productivity analysis using DEA and Tobit reveals complex relationships between production-related variables and CGT and sale.

Application of Multidimensional Scaling Method for E-Commerce Personalized Recommendation (전자상거래 개인화 추천을 위한 다차원척도법의 활용)

  • Kim Jong U;Yu Gi Hyeon;Easley Robert F.
    • Proceedings of the Korean Operations and Management Science Society Conference
    • /
    • 2002.05a
    • /
    • pp.93-97
    • /
    • 2002
  • In this paper, we propose personalized recommendation techniques based on multidimensional scaling (MDS) method for Business to Consumer Electronic Commerce. The multidimensional scaling method is traditionally used in marketing domain for analyzing customers' perceptional differences about brands and products. In this study, using purchase history data, customers in learning dataset are assigned to specific product categories, and after then using MDS a positioning map is generated to map product categories and alternative advertisements. The positioning map will be used to select personalized advertisement in real time situation. In this paper, we suggest the detail design of personalized recommendation method using MDS and compare with other approaches (random approach, collaborative filtering, and TOP3 approach)

  • PDF

Evaluation of research performances for 28 national universities (국내 28개 국공립대학교의 연구성과에 대한 평가)

  • Jeong, Dong Bin
    • Journal of the Korean Data and Information Science Society
    • /
    • v.25 no.6
    • /
    • pp.1241-1251
    • /
    • 2014
  • Based on the 4 principal research-performance criteria in 28 national universities in Korea, both cluster analysis and multidimensional scaling are performed in this paper. We can classify and/or specialize the initially unknown groups into a group of relatively homogeneous universities and then create new groupings without any preconceived notion of what clusters may arise. Furthermore, the level of similarity of individual universities can be visualized on the multidimensional space so that each university is then assigned coordinates in each of the 2 dimensions. Both types and characteristics of each university can be relatively evaluated and be practically exploited for the policy of the university authority through these results.