• Title/Summary/Keyword: multivariate statistical method

Search Result 294, Processing Time 0.034 seconds

Moments calculation for truncated multivariate normal in nonlinear generalized mixed models

  • Lee, Seung-Chun
    • Communications for Statistical Applications and Methods
    • /
    • v.27 no.3
    • /
    • pp.377-383
    • /
    • 2020
  • The likelihood-based inference in a nonlinear generalized mixed model often requires computing moments of truncated multivariate normal random variables. Many methods have been proposed for the computation using a recurrence relation or the moment generating function; however, these methods rely on high dimensional numerical integrations. The numerical method is known to be inefficient for high dimensional integral in accuracy. Besides the accuracy, the methods demand too much computing time to use them in practical analyses. In this note, a moment calculation method is proposed under an assumption of a certain covariance structure that occurred mostly in generalized mixed models. The method needs only low dimensional numerical integrations.

A Resetting Scheme for Process Parameters using the Mahalanobis-Taguchi System

  • Park, Chang-Soon
    • The Korean Journal of Applied Statistics
    • /
    • v.25 no.4
    • /
    • pp.589-603
    • /
    • 2012
  • Mahalanobis-Taguchi system(MTS) is a statistical tool for classifying the normal group and abnormal group in multivariate data structures. In addition to the classification itself, the MTS uses a method for selecting variables useful for the classification. This method can be used efficiently especially when the abnormal group data are scattered without a specific directionality. When the feedback adjustment procedure through the measurements of the process output for controlling process input variables is not practically possible, the reset procedure can be an alternative one. This article proposes a reset procedure using the MTS. Moreover, a method for identifying input variables to reset is also proposed by the use of the contribution. The identification of the root-cause parameters using the existing dimension-reduced contribution tends to be difficult due to the variety of correlation relationships of multivariate data structures. However, it became possible to provide an improved decision when used together with the location-centered contribution and the individual-parameter contribution.

An importance sampling for a function of a multivariate random variable

  • Jae-Yeol Park;Hee-Geon Kang;Sunggon Kim
    • Communications for Statistical Applications and Methods
    • /
    • v.31 no.1
    • /
    • pp.65-85
    • /
    • 2024
  • The tail probability of a function of a multivariate random variable is not easy to estimate by the crude Monte Carlo simulation. When the occurrence of the function value over a threshold is rare, the accurate estimation of the corresponding probability requires a huge number of samples. When the explicit form of the cumulative distribution function of each component of the variable is known, the inverse transform likelihood ratio method is directly applicable scheme to estimate the tail probability efficiently. The method is a type of the importance sampling and its efficiency depends on the selection of the importance sampling distribution. When the cumulative distribution of the multivariate random variable is represented by a copula and its marginal distributions, we develop an iterative algorithm to find the optimal importance sampling distribution, and show the convergence of the algorithm. The performance of the proposed scheme is compared with the crude Monte Carlo simulation numerically.

Multivariate Statistical Analysis and Prediction for the Flash Points of Binary Systems Using Physical Properties of Pure Substances (순수 성분의 물성 자료를 이용한 2성분계 혼합물의 인화점에 대한 다변량 통계 분석 및 예측)

  • Lee, Bom-Sock;Kim, Sung-Young
    • Journal of the Korean Institute of Gas
    • /
    • v.11 no.3
    • /
    • pp.13-18
    • /
    • 2007
  • The multivariate statistical analysis, using the multiple linear regression(MLR), have been applied to analyze and predict the flash points of binary systems. Prediction for the flash points of flammable substances is important for the examination of the fire and explosion hazards in the chemical process design. In this paper, the flash points are predicted by MLR based on the physical properties of pure substances and the experimental flash points data. The results of regression and prediction by MLR are compared with the values calculated by Raoult's law and Van Laar equation.

  • PDF

Diagnosis of Observations after Fit of Multivariate Skew t-Distribution: Identification of Outliers and Edge Observations from Asymmetric Data

  • Kim, Seung-Gu
    • The Korean Journal of Applied Statistics
    • /
    • v.25 no.6
    • /
    • pp.1019-1026
    • /
    • 2012
  • This paper presents a method for the identification of "edge observations" located on a boundary area constructed by a truncation variable as well as for the identification of outliers and the after fit of multivariate skew $t$-distribution(MST) to asymmetric data. The detection of edge observation is important in data analysis because it provides information on a certain critical area in observation space. The proposed method is applied to an Australian Institute of Sport(AIS) dataset that is well known for asymmetry in data space.

AUTOMATED ELECTROFACIES DETERMINATION USING MULTIVARIATE STATISTICAL ANALYSIS

  • Kim Jungwhan;Lim Jong-Se
    • 한국석유지질학회:학술대회논문집
    • /
    • spring
    • /
    • pp.10-14
    • /
    • 1998
  • A systematic methodology is developed for the electrofacies determination from wireline log data using multivariate statistical analysis. To consider corresponding contribution of each log and reduce the computational dimension, multivariate logs are transformed into a single variable through principal components analysis. Resultant principal components logs are segmented using the statistical zonation method to enhance the efficiency and quality of the interpreted results. Hierarchical cluster analysis is then used to group the segments into electrofacies. Optimal number of groups is determined on the basis of the ratio of within-group variance to total variance and core data. This technique is applied to the wells in the Korea Continental Shelf. The results of field application demonstrate that the prediction of lithology based on the electrofacies classification matches well to the core and the cutting data with high reliability This methodology for electrofacies classification can be used to define the reservoir characteristics which are helpful to the reservoir management.

  • PDF

Bayesian Parameter Estimation using the MCMC method for the Mean Change Model of Multivariate Normal Random Variates

  • Oh, Mi-Ra;Kim, Eoi-Lyoung;Sim, Jung-Wook;Son, Young-Sook
    • Communications for Statistical Applications and Methods
    • /
    • v.11 no.1
    • /
    • pp.79-91
    • /
    • 2004
  • In this thesis, Bayesian parameter estimation procedure is discussed for the mean change model of multivariate normal random variates under the assumption of noninformative priors for all the parameters. Parameters are estimated by Gibbs sampling method. In Gibbs sampler, the change point parameter is generated by Metropolis-Hastings algorithm. We apply our methodology to numerical data to examine it.

A Unit Root Test for Multivariate Autoregressive Model with Multiple Unit Roots

  • Shin, Key-Il
    • Journal of the Korean Statistical Society
    • /
    • v.26 no.3
    • /
    • pp.397-405
    • /
    • 1997
  • Recently maximum likelihood estimators using unconditional likelihood function are used for testing unit roots. When one wants to use this method the determinant term of initial values in the multivariate unconditional likelihood function produces a complicated function of the elements in the coefficient matrix and variance matrix. In this paper an approximation of the determinant term is calculated and based on this aproximation an approximated unconditional likelihood function is calculated. The approximated unconditional maximum likelihood estimators can be used to test for unit roots. When multivariate process has one unit root the limiting distribution obtained by this method and the limiting distribution using exact unconditional likelihood function are the same.

  • PDF

Partially linear multivariate regression in the presence of measurement error

  • Yalaz, Secil;Tez, Mujgan
    • Communications for Statistical Applications and Methods
    • /
    • v.27 no.5
    • /
    • pp.511-521
    • /
    • 2020
  • In this paper, a partially linear multivariate model with error in the explanatory variable of the nonparametric part, and an m dimensional response variable is considered. Using the uniform consistency results found for the estimator of the nonparametric part, we derive an estimator of the parametric part. The dependence of the convergence rates on the errors distributions is examined and demonstrated that proposed estimator is asymptotically normal. In main results, both ordinary and super smooth error distributions are considered. Moreover, the derived estimators are applied to the economic behaviors of consumers. Our method handles contaminated data is founded more effectively than the semiparametric method ignores measurement errors.

A fast approximate fitting for mixture of multivariate skew t-distribution via EM algorithm

  • Kim, Seung-Gu
    • Communications for Statistical Applications and Methods
    • /
    • v.27 no.2
    • /
    • pp.255-268
    • /
    • 2020
  • A mixture of multivariate canonical fundamental skew t-distribution (CFUST) has been of interest in various fields. In particular, interest in the unsupervised learning society is noteworthy. However, fitting the model via EM algorithm suffers from significant processing time. The main cause is due to the calculation of many multivariate t-cdfs (cumulative distribution functions) in E-step. In this article, we provide an approximate, but fast calculation method for the in univariate fashion, which is the product of successively conditional univariate t-cdfs with Taylor's first order approximation. By replacing all multivariate t-cdfs in E-step with the proposed approximate versions, we obtain the admissible results of fitting the model, where it gives 85% reduction time for the 5 dimensional skewness case of the Australian Institution Sport data set. For this approach, discussions about rough properties, advantages and limits are also presented.