• 제목/요약/키워드: Multivariate Statistical Method

검색결과 294건 처리시간 0.018초

Moments calculation for truncated multivariate normal in nonlinear generalized mixed models

  • Lee, Seung-Chun
    • Communications for Statistical Applications and Methods
    • /
    • 제27권3호
    • /
    • pp.377-383
    • /
    • 2020
  • The likelihood-based inference in a nonlinear generalized mixed model often requires computing moments of truncated multivariate normal random variables. Many methods have been proposed for the computation using a recurrence relation or the moment generating function; however, these methods rely on high dimensional numerical integrations. The numerical method is known to be inefficient for high dimensional integral in accuracy. Besides the accuracy, the methods demand too much computing time to use them in practical analyses. In this note, a moment calculation method is proposed under an assumption of a certain covariance structure that occurred mostly in generalized mixed models. The method needs only low dimensional numerical integrations.

A Resetting Scheme for Process Parameters using the Mahalanobis-Taguchi System

  • Park, Chang-Soon
    • 응용통계연구
    • /
    • 제25권4호
    • /
    • pp.589-603
    • /
    • 2012
  • Mahalanobis-Taguchi system(MTS) is a statistical tool for classifying the normal group and abnormal group in multivariate data structures. In addition to the classification itself, the MTS uses a method for selecting variables useful for the classification. This method can be used efficiently especially when the abnormal group data are scattered without a specific directionality. When the feedback adjustment procedure through the measurements of the process output for controlling process input variables is not practically possible, the reset procedure can be an alternative one. This article proposes a reset procedure using the MTS. Moreover, a method for identifying input variables to reset is also proposed by the use of the contribution. The identification of the root-cause parameters using the existing dimension-reduced contribution tends to be difficult due to the variety of correlation relationships of multivariate data structures. However, it became possible to provide an improved decision when used together with the location-centered contribution and the individual-parameter contribution.

An importance sampling for a function of a multivariate random variable

  • Jae-Yeol Park;Hee-Geon Kang;Sunggon Kim
    • Communications for Statistical Applications and Methods
    • /
    • 제31권1호
    • /
    • pp.65-85
    • /
    • 2024
  • The tail probability of a function of a multivariate random variable is not easy to estimate by the crude Monte Carlo simulation. When the occurrence of the function value over a threshold is rare, the accurate estimation of the corresponding probability requires a huge number of samples. When the explicit form of the cumulative distribution function of each component of the variable is known, the inverse transform likelihood ratio method is directly applicable scheme to estimate the tail probability efficiently. The method is a type of the importance sampling and its efficiency depends on the selection of the importance sampling distribution. When the cumulative distribution of the multivariate random variable is represented by a copula and its marginal distributions, we develop an iterative algorithm to find the optimal importance sampling distribution, and show the convergence of the algorithm. The performance of the proposed scheme is compared with the crude Monte Carlo simulation numerically.

순수 성분의 물성 자료를 이용한 2성분계 혼합물의 인화점에 대한 다변량 통계 분석 및 예측 (Multivariate Statistical Analysis and Prediction for the Flash Points of Binary Systems Using Physical Properties of Pure Substances)

  • 이범석;김성영
    • 한국가스학회지
    • /
    • 제11권3호
    • /
    • pp.13-18
    • /
    • 2007
  • 다변량 통계 분석법(Multivariate statistical analysis method)의 대표적 방법인 다중 선형 회귀법(Multiple linear regression. MLR)을 이용하여 2성분계 혼합물의 인화점을 회귀 분석하고 예측하였다. 가연성 물질의 인화점에 대한 예측은 실제 화학 공정 설계에서 화재 및 폭발 위험성을 판단하는 중요한 부분 중의 하나이다. 본 연구에서는 순수 성분의 물성 자료만을 이용하여 2성분계 혼합물의 인화점 실험 자료에 대해 다중 선형 회귀법(MLR)을 수행하였고, 이를 이용하여 새로운 혼합물에 대한 인화점을 예측하였다. 2성분계 혼합물의 인화점에 대한 MLR의 회귀 성능과 새로운 혼합물에 대한 예측 성능을 알아보기 위해, 기존의 인화점 추정 방법인 Raoult의 법칙과 Van Laar식에 의한 추정값과 비교해 보았다.

  • PDF

Diagnosis of Observations after Fit of Multivariate Skew t-Distribution: Identification of Outliers and Edge Observations from Asymmetric Data

  • Kim, Seung-Gu
    • 응용통계연구
    • /
    • 제25권6호
    • /
    • pp.1019-1026
    • /
    • 2012
  • This paper presents a method for the identification of "edge observations" located on a boundary area constructed by a truncation variable as well as for the identification of outliers and the after fit of multivariate skew $t$-distribution(MST) to asymmetric data. The detection of edge observation is important in data analysis because it provides information on a certain critical area in observation space. The proposed method is applied to an Australian Institute of Sport(AIS) dataset that is well known for asymmetry in data space.

AUTOMATED ELECTROFACIES DETERMINATION USING MULTIVARIATE STATISTICAL ANALYSIS

  • Kim Jungwhan;Lim Jong-Se
    • 한국석유지질학회:학술대회논문집
    • /
    • 한국석유지질학회 1998년도 제5차 학술발표회 발표논문집
    • /
    • pp.10-14
    • /
    • 1998
  • A systematic methodology is developed for the electrofacies determination from wireline log data using multivariate statistical analysis. To consider corresponding contribution of each log and reduce the computational dimension, multivariate logs are transformed into a single variable through principal components analysis. Resultant principal components logs are segmented using the statistical zonation method to enhance the efficiency and quality of the interpreted results. Hierarchical cluster analysis is then used to group the segments into electrofacies. Optimal number of groups is determined on the basis of the ratio of within-group variance to total variance and core data. This technique is applied to the wells in the Korea Continental Shelf. The results of field application demonstrate that the prediction of lithology based on the electrofacies classification matches well to the core and the cutting data with high reliability This methodology for electrofacies classification can be used to define the reservoir characteristics which are helpful to the reservoir management.

  • PDF

Bayesian Parameter Estimation using the MCMC method for the Mean Change Model of Multivariate Normal Random Variates

  • Oh, Mi-Ra;Kim, Eoi-Lyoung;Sim, Jung-Wook;Son, Young-Sook
    • Communications for Statistical Applications and Methods
    • /
    • 제11권1호
    • /
    • pp.79-91
    • /
    • 2004
  • In this thesis, Bayesian parameter estimation procedure is discussed for the mean change model of multivariate normal random variates under the assumption of noninformative priors for all the parameters. Parameters are estimated by Gibbs sampling method. In Gibbs sampler, the change point parameter is generated by Metropolis-Hastings algorithm. We apply our methodology to numerical data to examine it.

A Unit Root Test for Multivariate Autoregressive Model with Multiple Unit Roots

  • Shin, Key-Il
    • Journal of the Korean Statistical Society
    • /
    • 제26권3호
    • /
    • pp.397-405
    • /
    • 1997
  • Recently maximum likelihood estimators using unconditional likelihood function are used for testing unit roots. When one wants to use this method the determinant term of initial values in the multivariate unconditional likelihood function produces a complicated function of the elements in the coefficient matrix and variance matrix. In this paper an approximation of the determinant term is calculated and based on this aproximation an approximated unconditional likelihood function is calculated. The approximated unconditional maximum likelihood estimators can be used to test for unit roots. When multivariate process has one unit root the limiting distribution obtained by this method and the limiting distribution using exact unconditional likelihood function are the same.

  • PDF

Partially linear multivariate regression in the presence of measurement error

  • Yalaz, Secil;Tez, Mujgan
    • Communications for Statistical Applications and Methods
    • /
    • 제27권5호
    • /
    • pp.511-521
    • /
    • 2020
  • In this paper, a partially linear multivariate model with error in the explanatory variable of the nonparametric part, and an m dimensional response variable is considered. Using the uniform consistency results found for the estimator of the nonparametric part, we derive an estimator of the parametric part. The dependence of the convergence rates on the errors distributions is examined and demonstrated that proposed estimator is asymptotically normal. In main results, both ordinary and super smooth error distributions are considered. Moreover, the derived estimators are applied to the economic behaviors of consumers. Our method handles contaminated data is founded more effectively than the semiparametric method ignores measurement errors.

A fast approximate fitting for mixture of multivariate skew t-distribution via EM algorithm

  • Kim, Seung-Gu
    • Communications for Statistical Applications and Methods
    • /
    • 제27권2호
    • /
    • pp.255-268
    • /
    • 2020
  • A mixture of multivariate canonical fundamental skew t-distribution (CFUST) has been of interest in various fields. In particular, interest in the unsupervised learning society is noteworthy. However, fitting the model via EM algorithm suffers from significant processing time. The main cause is due to the calculation of many multivariate t-cdfs (cumulative distribution functions) in E-step. In this article, we provide an approximate, but fast calculation method for the in univariate fashion, which is the product of successively conditional univariate t-cdfs with Taylor's first order approximation. By replacing all multivariate t-cdfs in E-step with the proposed approximate versions, we obtain the admissible results of fitting the model, where it gives 85% reduction time for the 5 dimensional skewness case of the Australian Institution Sport data set. For this approach, discussions about rough properties, advantages and limits are also presented.