• Title/Summary/Keyword: multivariate statistical methods

Search Result 463, Processing Time 0.025 seconds

Testing Homogeneity of Diagonal Covariance Matrices of K Multivariate Normal Populations

  • Kim, Hea-Jung
    • Communications for Statistical Applications and Methods
    • /
    • v.6 no.3
    • /
    • pp.929-938
    • /
    • 1999
  • We propose a criterion for testing homogeneity of diagonal covariance matrices of K multivariate normal populations. It is based on a factorization of usual likelihood ratio intended to propose and develop a criterion that makes use of properties of structures of the diagonal convariance matrices. The criterion then leads to a simple test as well as to an accurate asymptotic distribution of the test statistic via general result by Box (1949).

  • PDF

Admissible Hierarchical Bayes Estimators of a Multivariate Normal Mean Shrinking towards a Regression Surface

  • Cho, Byung-Yup;Choi, Kuey-Chung;Chang, In-Hong
    • Communications for Statistical Applications and Methods
    • /
    • v.3 no.2
    • /
    • pp.205-216
    • /
    • 1996
  • Consider the problem of estimating a multivariate normal mean with an unknown covarience matrix under a weighted sum of squared error losses. We first provide hierarchical Bayes estimators which shrink the usual (maximum liklihood, uniformly minimum variance unbiased) estimator towards a regression surface and then prove the admissibility of these estimators using Blyth's (1951) method.

  • PDF

A Post-stratified Estimation in Multivariate Stratified Sampling Surveys

  • Park, Jinwoo
    • Communications for Statistical Applications and Methods
    • /
    • v.6 no.3
    • /
    • pp.755-760
    • /
    • 1999
  • In multivariate stratified sampling surveys it is general to use a few stratification variables which are highly correlated with the important variables at design stage. But there might be some secondary study variables which are not so highly correlated with those stratification variables. In that case it is not efficient to use the same type of estimator due to the secondary variables as the one base on the important variables. A post-stratified estimation is proposed to increase the efficiency of the estimator with existence of secondary variables. The proposed method is illustrated with a set of fishery household population survey data.

  • PDF

Binary classification on compositional data

  • Joo, Jae Yun;Lee, Seokho
    • Communications for Statistical Applications and Methods
    • /
    • v.28 no.1
    • /
    • pp.89-97
    • /
    • 2021
  • Due to boundedness and sum constraint, compositional data are often transformed by logratio transformation and their transformed data are put into traditional binary classification or discriminant analysis. However, it may be problematic to directly apply traditional multivariate approaches to the transformed data because class distributions are not Gaussian and Bayes decision boundary are not polynomial on the transformed space. In this study, we propose to use flexible classification approaches to transformed data for compositional data classification. Empirical studies using synthetic and real examples demonstrate that flexible approaches outperform traditional multivariate classification or discriminant analysis.

Multivariate analysis of longitudinal surveys for population median

  • Priyanka, Kumari;Mittal, Richa
    • Communications for Statistical Applications and Methods
    • /
    • v.24 no.3
    • /
    • pp.255-269
    • /
    • 2017
  • This article explores the analysis of longitudinal surveys in which same units are investigated on several occasions. Multivariate exponential ratio type estimator has been proposed for the estimation of the finite population median at the current occasion in two occasion longitudinal surveys. Information on several additional auxiliary variables, which are stable over time and readily available on both the occasions, has been utilized. Properties of the proposed multivariate estimator, including the optimum replacement strategy, are presented. The proposed multivariate estimator is compared with the sample median estimator when there is no matching from a previous occasion and with the exponential ratio type estimator in successive sampling when information is available on only one additional auxiliary variable. The merits of the proposed estimator are justified by empirical interpretations and validated by a simulation study with the help of some natural populations.

Omnibus tests for multivariate normality based on Mardia's skewness and kurtosis using normalizing transformation

  • Kim, Namhyun
    • Communications for Statistical Applications and Methods
    • /
    • v.27 no.5
    • /
    • pp.501-510
    • /
    • 2020
  • Mardia (Biometrika, 57, 519-530, 1970) defined measures of multivariate skewness and kurtosis. Based on these measures, omnibus test statistics of multivariate normality are proposed using normalizing transformations. The transformations we consider are normal approximation and a Wilson-Hilferty transformation. The normalizing transformation proposed by Enomoto et al. (Communications in Statistics-Simulation and Computation, 49, 684-698, 2019) for the Mardia's kurtosis is also considered. A comparison of power is conducted by a simulation study. As a result, sum of squares of the normal approximation to the Mardia's skewness and the Enomoto's normalizing transformation to the Mardia's kurtosis seems to have relatively good power over the alternatives that are considered.

Multivariate confidence region using quantile vectors

  • Hong, Chong Sun;Kim, Hong Il
    • Communications for Statistical Applications and Methods
    • /
    • v.24 no.6
    • /
    • pp.641-649
    • /
    • 2017
  • Multivariate confidence regions were defined using a chi-square distribution function under a normal assumption and were represented with ellipse and ellipsoid types of bivariate and trivariate normal distribution functions. In this work, an alternative confidence region using the multivariate quantile vectors is proposed to define the normal distribution as well as any other distributions. These lower and upper bounds could be obtained using quantile vectors, and then the appropriate region between two bounds is referred to as the quantile confidence region. It notes that the upper and lower bounds of the bivariate and trivariate quantile confidence regions are represented as a curve and surface shapes, respectively. The quantile confidence region is obtained for various types of distribution functions that are both symmetric and asymmetric distribution functions. Then, its coverage rate is also calculated and compared. Therefore, we conclude that the quantile confidence region will be useful for the analysis of multivariate data, since it is found to have better coverage rates, even for asymmetric distributions.

Independence and maximal volume of d-dimensional random convex hull

  • Son, Won;Park, Seongoh;Lim, Johan
    • Communications for Statistical Applications and Methods
    • /
    • v.25 no.1
    • /
    • pp.79-89
    • /
    • 2018
  • In this paper, we study the maximal property of the volume of the convex hull of d-dimensional independent random vectors. We show that the volume of the random convex hull from a multivariate location-scale family indexed by ${\Sigma}$ is stochastically maximized in simple stochastic order when ${\Sigma}$ is diagonal. The claim can be applied to a broad class of multivariate distributions that include skewed/unskewed multivariate t-distributions. We numerically investigate the proven stochastic relationship between the dependent and independent random convex hulls with the Gaussian random convex hull. The numerical results confirm our theoretical findings and the maximal property of the volume of the independent random convex hull.

A fast approximate fitting for mixture of multivariate skew t-distribution via EM algorithm

  • Kim, Seung-Gu
    • Communications for Statistical Applications and Methods
    • /
    • v.27 no.2
    • /
    • pp.255-268
    • /
    • 2020
  • A mixture of multivariate canonical fundamental skew t-distribution (CFUST) has been of interest in various fields. In particular, interest in the unsupervised learning society is noteworthy. However, fitting the model via EM algorithm suffers from significant processing time. The main cause is due to the calculation of many multivariate t-cdfs (cumulative distribution functions) in E-step. In this article, we provide an approximate, but fast calculation method for the in univariate fashion, which is the product of successively conditional univariate t-cdfs with Taylor's first order approximation. By replacing all multivariate t-cdfs in E-step with the proposed approximate versions, we obtain the admissible results of fitting the model, where it gives 85% reduction time for the 5 dimensional skewness case of the Australian Institution Sport data set. For this approach, discussions about rough properties, advantages and limits are also presented.

Selection of markers in the framework of multivariate receiver operating characteristic curve analysis in binary classification

  • Sameera, G;Vishnu, Vardhan R
    • Communications for Statistical Applications and Methods
    • /
    • v.26 no.2
    • /
    • pp.79-89
    • /
    • 2019
  • Classification models pertaining to receiver operating characteristic (ROC) curve analysis have been extended from univariate to multivariate setup by linearly combining available multiple markers. One such classification model is the multivariate ROC curve analysis. However, not all markers contribute in a real scenario and may mask the contribution of other markers in classifying the individuals/objects. This paper addresses this issue by developing an algorithm that helps in identifying the important markers that are significant and true contributors. The proposed variable selection framework is supported by real datasets and a simulation study, it is shown to provide insight about the individual marker's significance in providing a classifier rule/linear combination with good extent of classification.