• 제목/요약/키워드: Multivariate simulation

검색결과 173건 처리시간 0.022초

Selection of markers in the framework of multivariate receiver operating characteristic curve analysis in binary classification

  • Sameera, G;Vishnu, Vardhan R
    • Communications for Statistical Applications and Methods
    • /
    • 제26권2호
    • /
    • pp.79-89
    • /
    • 2019
  • Classification models pertaining to receiver operating characteristic (ROC) curve analysis have been extended from univariate to multivariate setup by linearly combining available multiple markers. One such classification model is the multivariate ROC curve analysis. However, not all markers contribute in a real scenario and may mask the contribution of other markers in classifying the individuals/objects. This paper addresses this issue by developing an algorithm that helps in identifying the important markers that are significant and true contributors. The proposed variable selection framework is supported by real datasets and a simulation study, it is shown to provide insight about the individual marker's significance in providing a classifier rule/linear combination with good extent of classification.

A modified test for multivariate normality using second-power skewness and kurtosis

  • Namhyun Kim
    • Communications for Statistical Applications and Methods
    • /
    • 제30권4호
    • /
    • pp.423-435
    • /
    • 2023
  • The Jarque and Bera (1980) statistic is one of the well known statistics to test univariate normality. It is based on the sample skewness and kurtosis which are the sample standardized third and fourth moments. Desgagné and de Micheaux (2018) proposed an alternative form of the Jarque-Bera statistic based on the sample second power skewness and kurtosis. In this paper, we generalize the statistic to a multivariate version by considering some data driven directions. They are directions given by the normalized standardized scaled residuals. The statistic is a modified multivariate version of Kim (2021), where the statistic is generalized using an empirical standardization of the scaled residuals of data. A simulation study reveals that the proposed statistic shows better power when the dimension of data is big.

다변량 공정 모니터링에서 이상신호 발생시 원인 식별에 관한 연구 (Notes on identifying source of out-of-control signals in phase II multivariate process monitoring)

  • 이성임
    • 응용통계연구
    • /
    • 제31권1호
    • /
    • pp.1-11
    • /
    • 2018
  • 최근 다변량 공정관리는 다양한 응용 분야에서 중요해지고 있는 추세이다. 예를 들어, 제조 산업 분야에서는 다변량 품질특성치를 동시에 모니터링할 필요가 있다. 그러나, 다변량 관리도는 이상신호가 발생한 경우 그 원인이 되는 개별적인 변수를 식별하기가 어렵기 때문에, 실제로는 기대만큼 유용하게 쓰이고 있지 않은 형편이다. 이에 본 논문에서는 새로운 관측치에 대한 개별적인 신뢰구간을 사용하여 이상신호의 원인을 탐지하는 세 가지 방법을 소개하고, 시뮬레이션 연구를 통해 이상신호의 원인이 되는 개별적인 변수를 식별하고 해석하는 데 있어 주의할 점이 무엇인지 살펴보기로 한다.

A GEE approach for the semiparametric accelerated lifetime model with multivariate interval-censored data

  • Maru Kim;Sangbum Choi
    • Communications for Statistical Applications and Methods
    • /
    • 제30권4호
    • /
    • pp.389-402
    • /
    • 2023
  • Multivariate or clustered failure time data often occur in many medical, epidemiological, and socio-economic studies when survival data are collected from several research centers. If the data are periodically observed as in a longitudinal study, survival times are often subject to various types of interval-censoring, creating multivariate interval-censored data. Then, the event times of interest may be correlated among individuals who come from the same cluster. In this article, we propose a unified linear regression method for analyzing multivariate interval-censored data. We consider a semiparametric multivariate accelerated failure time model as a statistical analysis tool and develop a generalized Buckley-James method to make inferences by imputing interval-censored observations with their conditional mean values. Since the study population consists of several heterogeneous clusters, where the subjects in the same cluster may be related, we propose a generalized estimating equations approach to accommodate potential dependence in clusters. Our simulation results confirm that the proposed estimator is robust to misspecification of working covariance matrix and statistical efficiency can increase when the working covariance structure is close to the truth. The proposed method is applied to the dataset from a diabetic retinopathy study.

A Cointegration Test Based on Weighted Symmetric Estimator

  • Son Bu-Il;Shin Key-Il
    • Communications for Statistical Applications and Methods
    • /
    • 제12권3호
    • /
    • pp.797-805
    • /
    • 2005
  • Multivariate unit root tests for the VAR(p) model have been commonly used in time series analysis. Several unit root tests were developed and recently Shin(2004) suggested a cointegration test based on weighted symmetric estimator. In this paper, we suggest a multivariate unit root test statistic based on the weighted symmetric estimator. Using a small simulation study, we compare the powers of the new test statistic with the statistics suggested in Shin(2004) and Fuller(1996).

Estimating Parameters in Muitivariate Normal Mixtures

  • Ahn, Sung-Mahn;Baik, Sung-Wook
    • Communications for Statistical Applications and Methods
    • /
    • 제18권3호
    • /
    • pp.357-365
    • /
    • 2011
  • This paper investigates a penalized likelihood method for estimating the parameter of normal mixtures in multivariate settings with full covariance matrices. The proposed model estimates the number of components through the addition of a penalty term to the usual likelihood function and the construction of a penalized likelihood function. We prove the consistency of the estimator and present the simulation results on the multi-dimensional nor-mal mixtures up to the 8-dimension.

CUSUM Chart to Monitor Dispersion Matrix for Multivariate Normal Process

  • 장덕준;권용만;홍연웅
    • 한국데이터정보과학회:학술대회논문집
    • /
    • 한국데이터정보과학회 2003년도 춘계학술대회
    • /
    • pp.89-95
    • /
    • 2003
  • Cumulative sum(CUSUM) control charts for monitoring dispersion matrix under multivariate normal process are proposed. Performances of the proposed CUSUM charts are measured in terms of average run length(ARL) by simulation. Numerical results show that small reference values of the proposed CUSUM chart is more efficient for small shifts in the production process.

  • PDF

Nonparametric Test for Multivariate Location Translation Alternatives

  • Na, Jong-Hwa
    • Communications for Statistical Applications and Methods
    • /
    • 제7권3호
    • /
    • pp.799-809
    • /
    • 2000
  • In this paper we propose a nonparametric one sided test for location parameters in p-variate(p$\geq$2) location translation model. The exact null distributions of test statistics are calculated by permutation principle in the case of relatively small sample sizes and the asymptotic distributions are also considered. The powers of various tests are compared through computer simulation and thep-values with real data are also suggested through example.

  • PDF

A Simultaneous Test for Multivariate Normality and Independence with Application to Univariate Residuals

  • Park, Cheol-Yong
    • Journal of the Korean Data and Information Science Society
    • /
    • 제17권1호
    • /
    • pp.115-122
    • /
    • 2006
  • A test is suggested for detecting deviations from both multivariate normality and independence. This test can be used for assessing the normality and independence of univariate time series residuals. We derive the limiting distribution of the test statistic and a simulation study is conducted to study the accuracy of the limiting distribution in finite samples. Finally, we apply our method to a real data of time series.

  • PDF

Design Centering by Genetic Algorithm and Coarse Simulation

  • Jinkoo Lee
    • 한국CDE학회논문집
    • /
    • 제2권4호
    • /
    • pp.215-221
    • /
    • 1997
  • A new approach in solving design centering problem is presented. Like most stochastic optimization problems, optimal design centering problems have intrinsic difficulties in multivariate intergration of probability density functions. In order to avoid to avoid those difficulties, genetic algorithm and very coarse Monte Carlo simulation are used in this research. The new algorithm performs robustly while producing improved yields. This result implies that the combination of robust optimization methods and approximated simulation schemes would give promising ways for many stochastic optimizations which are inappropriate for mathematical programming.

  • PDF