• Title/Summary/Keyword: Multivariate methods

Search Result 2,322, Processing Time 0.028 seconds

A Comparison of the Efficiency of Location Estimators in Bivariate t distribution

  • Choi, Byong Su;Lee, Seung-Chun
    • Communications for Statistical Applications and Methods
    • /
    • v.10 no.3
    • /
    • pp.895-907
    • /
    • 2003
  • Recent demands for representing the location of multivariate data produce various multivariate medians such as Tukey median, Oja median and spatial median. They are considered as multivariate versions of the median which is widely recognized as a robust alternative to the arithmetic mean. Many studies show that those multivariate median preserve the robustness. However, the effectiveness of those medians is not fully identified. In this note the relative efficiencies of the multivariate medians are investigated in various configurations under the bivariate t-distribution. It is shown that Tukey median outperforms the others in most configurations.

On inference of multivariate means under ranked set sampling

  • Rochani, Haresh;Linder, Daniel F.;Samawi, Hani;Panchal, Viral
    • Communications for Statistical Applications and Methods
    • /
    • v.25 no.1
    • /
    • pp.1-13
    • /
    • 2018
  • In many studies, a researcher attempts to describe a population where units are measured for multiple outcomes, or responses. In this paper, we present an efficient procedure based on ranked set sampling to estimate and perform hypothesis testing on a multivariate mean. The method is based on ranking on an auxiliary covariate, which is assumed to be correlated with the multivariate response, in order to improve the efficiency of the estimation. We showed that the proposed estimators developed under this sampling scheme are unbiased, have smaller variance in the multivariate sense, and are asymptotically Gaussian. We also demonstrated that the efficiency of multivariate regression estimator can be improved by using Ranked set sampling. A bootstrap routine is developed in the statistical software R to perform inference when the sample size is small. We use a simulation study to investigate the performance of the method under known conditions and apply the method to the biomarker data collected in China Health and Nutrition Survey (CHNS 2009) data.

On the second order property of elliptical multivariate regular variation

  • Moosup Kim
    • Communications for Statistical Applications and Methods
    • /
    • v.31 no.4
    • /
    • pp.459-466
    • /
    • 2024
  • Multivariate regular variation is a popular framework of multivariate extreme value analysis. However, a suitable parametric model needs to be introduced for efficient estimation of its spectral measure. In such a view, elliptical distributions have been employed for deriving such models. On the other hand, the second order behavior of multivariate regular variation has to be specified for investigating the property of the estimator. This paper derives such a behavior by imposing a widely adopted second order regular variation condition on the representation of elliptical distributions. As result, the second order variation for the convergence to spectral measure is characterized by a signed measure with a regular varying index. Moreover, it leads to the asymptotic bias of the estimator. For demonstration, multivariate t-distribution is considered.

Statistical Methods for Multivariate Missing Data in Health Survey Research (보건조사연구에서 다변량결측치가 내포된 자료를 효율적으로 분석하기 위한 통계학적 방법)

  • Kim, Dong-Kee;Park, Eun-Cheol;Sohn, Myong-Sei;Kim, Han-Joong;Park, Hyung-Uk;Ahn, Chae-Hyung;Lim, Jong-Gun;Song, Ki-Jun
    • Journal of Preventive Medicine and Public Health
    • /
    • v.31 no.4 s.63
    • /
    • pp.875-884
    • /
    • 1998
  • Missing observations are common in medical research and health survey research. Several statistical methods to handle the missing data problem have been proposed. The EM algorithm (Expectation-Maximization algorithm) is one of the ways of efficiently handling the missing data problem based on sufficient statistics. In this paper, we developed statistical models and methods for survey data with multivariate missing observations. Especially, we adopted the EM algorithm to handle the multivariate missing observations. We assume that the multivariate observations follow a multivariate normal distribution, where the mean vector and the covariance matrix are primarily of interest. We applied the proposed statistical method to analyze data from a health survey. The data set we used came from a physician survey on Resource-Based Relative Value Scale(RBRVS). In addition to the EM algorithm, we applied the complete case analysis, which uses only completely observed cases, and the available case analysis, which utilizes all available information. The residual and normal probability plots were evaluated to access the assumption of normality. We found that the residual sum of squares from the EM algorithm was smaller than those of the complete-case and the available-case analyses.

  • PDF

Copula modelling for multivariate statistical process control: a review

  • Busababodhin, Piyapatr;Amphanthong, Pimpan
    • Communications for Statistical Applications and Methods
    • /
    • v.23 no.6
    • /
    • pp.497-515
    • /
    • 2016
  • Modern processes often monitor more than one quality characteristic that are referred to as multivariate statistical process control (MSPC) procedures. The MSPC is the most rapidly developing sector of statistical process control and increases interest in the simultaneous inspection of several related quality characteristics. Most multivariate detection procedures based on a multi-normality assumptions are independent, but there are many processes that assume non-normality and correlation. Many multivariate control charts have a lack of related joint distribution. Copulas are tool to construct multivariate modelling and formalizing the dependence structure between random variables and applied in several fields. From copula literature review, there are a few copula to apply in MSPC that have multivariate control charts, and represent a successful tool to identify an out-of-control process. This paper presents various types of copulas modelling for the multivariate control chart. The performance measures of the control chart are the average run length (ARL) and the average number of observations to signal (ANOS). Furthermore, a Monte Carlo simulation is shown when the observations were from an exponential distribution.

Influence Analysis of the Liklihood Ratio Test in Multivariate Behrens-Fisher Problem

  • Jung, Kang-Mo;Kim, Myung-Geun
    • Communications for Statistical Applications and Methods
    • /
    • v.6 no.3
    • /
    • pp.939-946
    • /
    • 1999
  • We propose methods for detecting influential observations that have a large influence on the likelihood ratio test statistic for the multivariate Behrens-Fisher problem. For this purpose we derive the influence curve and the derivative influence of the likelihood ratio test statistic. An illustrative example is given to show the effectiveness of the proposed methods on the identification of influential observations.

  • PDF

Detecting Influential Observations in Multivariate Statistical Analysis of Incomplete Data by PCA (주성분분석에 의한 결손 자료의 영향값 검출에 대한 연구)

  • 김현정;문승호;신재경
    • The Korean Journal of Applied Statistics
    • /
    • v.13 no.2
    • /
    • pp.383-392
    • /
    • 2000
  • Since late 1970, methods of influence or sensitivity analysis for detecting influential observations have been studied not only in regression and related methods but also in various multivariate methods. If results of multivariate analyses sometimes depend heavily on a small number of observations, we should be very careful to draw a conclusion. Similar phenomena may also occur in the case of incomplete data. In this research we try to study such influential observations in multivariate statistical analysis of incomplete data. Case of principal component analysis is studied with a numerical example.

  • PDF

Development of Multivariate Analysis System by Using SAS/AF and SCL

  • Han, Sang-Tae;Kang, Hyuncheol;Lee, Seong-Keon;Jang, Myung-Seok;Lee, Duck-Ki;Ryu, Dong-Kyun
    • Communications for Statistical Applications and Methods
    • /
    • v.8 no.2
    • /
    • pp.507-514
    • /
    • 2001
  • In recent years, the development and the embodiment of information analysis system has been sprightly carried out in several fields of study. In this study, as and extension of these studies, we develop a system for multivariate analysis which might be widely used in social and natural sciences. This multivariate analysis system is developed by using multivariate analysis procedures in SAS/STAT software. Also, the system supply users with he environment of GUI(Graphical User Interface), which is constructed with AF(application frame) and SCL(screen control language) of SAS software, in order to help users to use the system with easy.

  • PDF

Multivariate measures of skewness for the scale mixtures of skew-normal distributions

  • Kim, Hyoung-Moon;Zhao, Jun
    • Communications for Statistical Applications and Methods
    • /
    • v.25 no.2
    • /
    • pp.109-130
    • /
    • 2018
  • Several measures of multivariate skewness for scale mixtures of skew-normal distributions are derived. As a special case, those of multivariate skew-t distribution are considered in detail. Furthermore, the similarities, differences, and behavior of these measures are explored for cases of some specific members of the multivariate skew-normal and skew-t distributions using a simulation study. Since some measures are vectors, it is better to take all measures in the same scale when comparing them. In order to attain such a set of comparable indices, the sample version is considered for each of the skewness measures that are taken as test statistics for the hypothesis of t distribution against skew-t distribution. An application is reported for the data set consisting of 71 total glycerol and magnesium contents in Grignolino wine.

Bayesian Analysis of a New Skewed Multivariate Probit for Correlated Binary Response Data

  • Kim, Hea-Jung
    • Journal of the Korean Statistical Society
    • /
    • v.30 no.4
    • /
    • pp.613-635
    • /
    • 2001
  • This paper proposes a skewed multivariate probit model for analyzing a correlated binary response data with covariates. The proposed model is formulated by introducing an asymmetric link based upon a skewed multivariate normal distribution. The model connected to the asymmetric multivariate link, allows for flexible modeling of the correlation structure among binary responses and straightforward interpretation of the parameters. However, complex likelihood function of the model prevents us from fitting and analyzing the model analytically. Simulation-based Bayesian inference methodologies are provided to overcome the problem. We examine the suggested methods through two data sets in order to demonstrate their performances.

  • PDF