• Title/Summary/Keyword: Data Compare

Search Result 8,085, Processing Time 0.048 seconds

Simulation studies to compare bayesian wavelet shrinkage methods in aggregated functional data

  • Alex Rodrigo dos Santos Sousa
    • Communications for Statistical Applications and Methods
    • /
    • v.30 no.3
    • /
    • pp.311-330
    • /
    • 2023
  • The present work describes simulation studies to compare the performances in terms of averaged mean squared error of bayesian wavelet shrinkage methods in estimating component curves from aggregated functional data. Five bayesian methods available in the literature were considered to be compared in the studies: The shrinkage rule under logistic prior, shrinkage rule under beta prior, large posterior mode (LPM) method, amplitude-scale invariant Bayes estimator (ABE) and Bayesian adaptive multiresolution smoother (BAMS). The so called Donoho-Johnstone test functions, logit and SpaHet functions were considered as component functions and the scenarios were defined according to different values of sample size and signal to noise ratio in the datasets. It was observed that the signal to noise ratio of the data had impact on the performances of the methods. An application of the methodology and the results to the tecator dataset is also done.

Bootstrap Confidence Intervals for Regression Coefficients under Censored Data

  • Cho, Kil-Ho;Jeong, Seong-Hwa
    • Journal of the Korean Data and Information Science Society
    • /
    • v.13 no.2
    • /
    • pp.355-363
    • /
    • 2002
  • Using the Buckley-James method, we construct bootstrap confidence intervals for the regression coefficients under the censored data. And we compare these confidence intervals in terms of the coverage probabilities and the expected confidence interval lengths through Monte Carlo simulation.

  • PDF

A Spatial Regression for Hospital Data

  • Choi, Yong-Seok;Kang, Chang-Wan;Choi, Seung-Bae
    • Journal of the Korean Data and Information Science Society
    • /
    • v.17 no.4
    • /
    • pp.1271-1278
    • /
    • 2006
  • Recently, a profit analysis in hospital management is considered as an important marketing concept. When spatial variability is presented, we must analyze the hospital data with spatial statistical methods. In this study, we present a regression model using spatial covariance for adjustment. And we compare the nonspatial model with spatial model.

  • PDF

A Time-Segmented Storage Structure and Migration Strategies for Temporal Data (시간지원 데이터를 위한 분리 저장 구조와 데이터 이동 방법)

  • Yun, Hong-Won;Kim, Gyeong-Seok
    • The Transactions of the Korea Information Processing Society
    • /
    • v.6 no.4
    • /
    • pp.851-867
    • /
    • 1999
  • Numerous proposals for extending the relational data model as well as conceptual and object-oriented data models have been suggested. However, there has been relatively less research in the area of defining segmented storage structure and data migration strategies for temporal data. This paper presents the segmented storage structure in order to increment search performance and the two data migration strategies for segmented storage structure. this paper presents the two data migration strategies : the migration strategy by Time granularity, the migration strategy by LST-GET. In the migration strategy by Time Granularity, the dividing time point to assign the entity versions to the past segment, the current segment, and future segment is defined and the searching and moving process for data validity at a granularity level are described. In the migration strategy by LST-GET, we describe the process how to compute the value of dividing criterion. searching and moving processes are described for migration on the future segment and the current segment and entity versions 새 assign on each segment are defined. We simulate the search performance of the segmented storage structure in order to compare it with conventional storage structure in relational database system. And extensive simulation studies are performed in order to compare the search performance of the migration strategies with the segmented storage structure.

  • PDF

A Study on a Statistical Matching Method Using Clustering for Data Enrichment

  • Kim Soon Y.;Lee Ki H.;Chung Sung S.
    • Communications for Statistical Applications and Methods
    • /
    • v.12 no.2
    • /
    • pp.509-520
    • /
    • 2005
  • Data fusion is defined as the process of combining data and information from different sources for the effectiveness of the usage of useful information contents. In this paper, we propose a data fusion algorithm using k-means clustering method for data enrichment to improve data quality in knowledge discovery in database(KDD) process. An empirical study was conducted to compare the proposed data fusion technique with the existing techniques and shows that the newly proposed clustering data fusion technique has low MSE in continuous fusion variables.

유전자 알고리즘을 활용한 데이터 불균형 해소 기법의 조합적 활용

  • Jang, Yeong-Sik;Kim, Jong-U;Heo, Jun
    • Proceedings of the Korea Inteligent Information System Society Conference
    • /
    • 2007.05a
    • /
    • pp.309-320
    • /
    • 2007
  • The data imbalance problem which can be uncounted in data mining classification problems typically means that there are more or less instances in a class than those in other classes. It causes low prediction accuracy of the minority class because classifiers tend to assign instances to major classes and ignore the minor class to reduce overall misclassification rate. In order to solve the data imbalance problem, there has been proposed a number of techniques based on resampling with replacement, adjusting decision thresholds, and adjusting the cost of the different classes. In this paper, we study the feasibility of the combination usage of the techniques previously proposed to deal with the data imbalance problem, and suggest a combination method using genetic algorithm to find the optimal combination ratio of the techniques. To improve the prediction accuracy of a minority class, we determine the combination ratio based on the F-value of the minority class as the fitness function of genetic algorithm. To compare the performance with those of single techniques and the matrix-style combination of random percentage, we performed experiments using four public datasets which has been generally used to compare the performance of methods for the data imbalance problem. From the results of experiments, we can find the usefulness of the proposed method.

  • PDF

Modelling nonlinear polymer rheology is still challenging

  • Marrucci Giuseppe;Ianniruberto Giovanni
    • Korea-Australia Rheology Journal
    • /
    • v.17 no.3
    • /
    • pp.111-116
    • /
    • 2005
  • The new tube model with variable diameter (Marrucci and Ianniruberto, 2004), recently introduced to interpret new elongational data of polymer melts, is here extended to encompass arbitrary flows, specifically shear flows. The predicted results compare well with existing data of entangled polymer melts. Challenges still remain when the comparison is extended to recent elongational data on entangled solutions by Sridhar.

Nonparametric Test for Money and Income Causality

  • Jeong, Ki-Ho
    • Journal of the Korean Data and Information Science Society
    • /
    • v.15 no.2
    • /
    • pp.485-493
    • /
    • 2004
  • This paper considers the test of money and income causality. Jeong (1991, 2003) developed a nonparametric causality test based on the kernel estimation method. We apply the nonparametric test to USA data of money and income. We also compare the test results with ones of the conventional parametric test.

  • PDF

Adsorption Characteristics of Benzene by Carbonized Cast (탄화분변토를 이용한 Benzene의 흡착특성)

  • 김재홍;손희정;김미룡
    • Journal of environmental and Sanitary engineering
    • /
    • v.14 no.1
    • /
    • pp.97-102
    • /
    • 1999
  • This study was carried out view that reuse of sludge of adsorbent for benzene in carbonized cast compare with activated carbon. Not only the carbonized cast is good than carbonized carbon in cation exchange capacity and 12 adsorption capacity, but also benzene adsorption capacity is no differences compare to activated carbon. As results, benzene adsorption capacity of carbonized cast and activated carbon are decreased as temperature increase($25~70^{\circ}C$).It is compatible in Lamgmuir model. Therefore, carbonized cast is applied general adsorbent. From experimental results and data regression, in model concerning effect of temperature, relative errors between the experimental data and those calculated by the model are within the range of 1.2~7.8%. In relative humidity effect (RH 0.25~0.50) of benzene adsorption, modified Freundlich model : $QB_{enzene}{;\}QB_{enzene},{\}_{RH=0}=1-kRH^{IN}$, relative errors between the experimental data and those calculated by the model are within are range of 0.5-5.1%. The constants k and l/n in equation were found to be 1.25, 1.89 in carbonized cast.

  • PDF

Evaluation of Conversion Action Data Mechanisms in Cost-Per-Action Advertising

  • Tian, Li;Lee, Kyoung-Jun
    • 한국경영정보학회:학술대회논문집
    • /
    • 2008.06a
    • /
    • pp.428-433
    • /
    • 2008
  • The online advertising industry's business model undertakes the change from CPM (cost-per-mille)-based to CPC (cost-per-click)-based. However, due to the problem of 'Click Fraud', CPA (cost-per-action) has been regarded as a new step. For CPA, publishers need to get information after a user clicks an advertisement. Therefore, in CPA, the key is to get Conversion Action Data (CAD). This paper introduces two existing mechanisms for getting CAD, compare their characteristics, and analyze their limitations. Then the two new mechanisms are introduced and their requirements and feasibility are analyzed. Lastly, we compare the existing two and the new two mechanisms, and point out each mechanism's business possibility, value and Application Area. This paper will help publishers choose the most appropriate mechanism on the basis of their situation.

  • PDF