• Title/Summary/Keyword: multivariate linear models

Search Result 68, Processing Time 0.021 seconds

Bayes Prediction Density in Linear Models

  • Kim, S.H.
    • Communications for Statistical Applications and Methods
    • /
    • v.8 no.3
    • /
    • pp.797-803
    • /
    • 2001
  • This paper obtained Bayes prediction density for the spatial linear model with non-informative prior. It showed the results that predictive inferences is completely unaffected by departures from the normality assumption in the direction of the elliptical family and the structure of prediction density is unchanged by more than one additional future observations.

  • PDF

Selection of markers in the framework of multivariate receiver operating characteristic curve analysis in binary classification

  • Sameera, G;Vishnu, Vardhan R
    • Communications for Statistical Applications and Methods
    • /
    • v.26 no.2
    • /
    • pp.79-89
    • /
    • 2019
  • Classification models pertaining to receiver operating characteristic (ROC) curve analysis have been extended from univariate to multivariate setup by linearly combining available multiple markers. One such classification model is the multivariate ROC curve analysis. However, not all markers contribute in a real scenario and may mask the contribution of other markers in classifying the individuals/objects. This paper addresses this issue by developing an algorithm that helps in identifying the important markers that are significant and true contributors. The proposed variable selection framework is supported by real datasets and a simulation study, it is shown to provide insight about the individual marker's significance in providing a classifier rule/linear combination with good extent of classification.

Methodology for Determining Functional Forms in Developing Statistical Collision Models (교통사고모형 개발에서의 함수식 도출 방법론에 관한 연구)

  • Baek, Jong-Dae;Hummer, Joseph
    • International Journal of Highway Engineering
    • /
    • v.14 no.5
    • /
    • pp.189-199
    • /
    • 2012
  • PURPOSES: The purpose of this study is to propose a new methodology for developing statistical collision models and to show the validation results of the methodology. METHODS: A new modeling method of introducing variables into the model one by one in a multiplicative form is suggested. A method for choosing explanatory variables to be introduced into the model is explained. A method for determining functional forms for each explanatory variable is introduced as well as a parameter estimating procedure. A model selection method is also dealt with. Finally, the validation results is provided to demonstrate the efficacy of the final models developed using the method suggested in this study. RESULTS: According to the results of the validation for the total and injury collisions, the predictive powers of the models developed using the method suggested in this study were better than those of generalized linear models for the same data. CONCLUSIONS: Using the methodology suggested in this study, we could develop better statistical collision models having better predictive powers. This was because the methodology enabled us to find the relationships between dependant variable and each explanatory variable individually and to find the functional forms for the relationships which can be more likely non-linear.

On an Approximation to the Distribution of Product of Independent Beta Variates

  • Hea Jung Kim
    • Communications for Statistical Applications and Methods
    • /
    • v.1 no.1
    • /
    • pp.81-86
    • /
    • 1994
  • A Chi-square approximation to the distribution of product of independent Beta variates denoted by U is developed. The distribution is commonly used as a test criterion for the general linear hypothesis about the multivariate linear models. The approximation is obtained by fitting a logarithmic function of U to a Chi-square variate in terms of the first three moments. It is compared with the well known approximations due to Box(1949), Rao(1948), and Mudholkar and Trivedi(1980). It is found that the Chi-square approximation compares favorably with the other three approximations.

  • PDF

A spatial heterogeneity mixed model with skew-elliptical distributions

  • Farzammehr, Mohadeseh Alsadat;McLachlan, Geoffrey J.
    • Communications for Statistical Applications and Methods
    • /
    • v.29 no.3
    • /
    • pp.373-391
    • /
    • 2022
  • The distribution of observations in most econometric studies with spatial heterogeneity is skewed. Usually, a single transformation of the data is used to approximate normality and to model the transformed data with a normal assumption. This assumption is however not always appropriate due to the fact that panel data often exhibit non-normal characteristics. In this work, the normality assumption is relaxed in spatial mixed models, allowing for spatial heterogeneity. An inference procedure based on Bayesian mixed modeling is carried out with a multivariate skew-elliptical distribution, which includes the skew-t, skew-normal, student-t, and normal distributions as special cases. The methodology is illustrated through a simulation study and according to the empirical literature, we fit our models to non-life insurance consumption observed between 1998 and 2002 across a spatial panel of 103 Italian provinces in order to determine its determinants. Analyzing the posterior distribution of some parameters and comparing various model comparison criteria indicate the proposed model to be superior to conventional ones.

Elemental analysis of rice using laser-ablation sampling: Determination of rice-polishing degree

  • Yonghoon Lee
    • Analytical Science and Technology
    • /
    • v.37 no.1
    • /
    • pp.12-24
    • /
    • 2024
  • In this study, laser-induced breakdown spectroscopy (LIBS) was used to estimate the degree of rice polishing. As-threshed rice seeds were dehusked and polished for different times, and the resulting grains were analyzed using LIBS. Various atomic, ionic, and molecular emissions were identified in the LIBS spectra. Their correlation with the amount of polished-off matter was investigated. Na I and Rb I emission line intensities showed linear sensitivity in the widest range of polished-off-matter amount. Thus, univariate models based on those lines were developed to predict the weight percent of polished-off matter and showed 3-5 % accuracy performances. Partial least squares-regression (PLS-R) was also applied to develop a multivariate model using Si I, Mg I, Ca I, Na I, K I, and Rb I emission lines. It outperformed the univariate models in prediction accuracy (2 %). Our results suggest that LIBS can be a reliable tool for authenticating the degree of rice polishing, which is closed related to nutrition, shelf life, appearance, and commercial value of rice products.

Short-term Construction Investment Forecasting Model in Korea (건설투자(建設投資)의 단기예측모형(短期豫測模型) 비교(比較))

  • Kim, Kwan-young;Lee, Chang-soo
    • KDI Journal of Economic Policy
    • /
    • v.14 no.1
    • /
    • pp.121-145
    • /
    • 1992
  • This paper examines characteristics of time series data related to the construction investment(stationarity and time series components such as secular trend, cyclical fluctuation, seasonal variation, and random change) and surveys predictibility, fitness, and explicability of independent variables of various models to build a short-term construction investment forecasting model suitable for current economic circumstances. Unit root test, autocorrelation coefficient and spectral density function analysis show that related time series data do not have unit roots, fluctuate cyclically, and are largely explicated by lagged variables. Moreover it is very important for the short-term construction investment forecasting to grasp time lag relation between construction investment series and leading indicators such as building construction permits and value of construction orders received. In chapter 3, we explicate 7 forecasting models; Univariate time series model (ARIMA and multiplicative linear trend model), multivariate time series model using leading indicators (1st order autoregressive model, vector autoregressive model and error correction model) and multivariate time series model using National Accounts data (simple reduced form model disconnected from simultaneous macroeconomic model and VAR model). These models are examined by 4 statistical tools that are average absolute error, root mean square error, adjusted coefficient of determination, and Durbin-Watson statistic. This analysis proves two facts. First, multivariate models are more suitable than univariate models in the point that forecasting error of multivariate models tend to decrease in contrast to the case of latter. Second, VAR model is superior than any other multivariate models; average absolute prediction error and root mean square error of VAR model are quitely low and adjusted coefficient of determination is higher. This conclusion is reasonable when we consider current construction investment has sustained overheating growth more than secular trend.

  • PDF

Parallelism Test of Slope in Simple Linear Regression Models (회귀모형의 기울기에 대한 품행성 검정)

  • Park, Hyun-Wook;Kim, Dong-Jae
    • Communications for Statistical Applications and Methods
    • /
    • v.16 no.1
    • /
    • pp.75-83
    • /
    • 2009
  • Parallelism tests are proposed for slope in the simple linear regression models. In this paper, we suggest the parametric test using HSD testing method (Tukey,1953) and distribution-free test using Kruskal-wallis (1952) for more than three slopes. Monte Carlo simulation study is adapted to compare the power of the proposed methods with Wilks' Lambda multivariate procedure.

KCYP data analysis using Bayesian multivariate linear model (베이지안 다변량 선형 모형을 이용한 청소년 패널 데이터 분석)

  • Insun, Lee;Keunbaik, Lee
    • The Korean Journal of Applied Statistics
    • /
    • v.35 no.6
    • /
    • pp.703-724
    • /
    • 2022
  • Although longitudinal studies mainly produce multivariate longitudinal data, most of existing statistical models analyze univariate longitudinal data and there is a limitation to explain complex correlations properly. Therefore, this paper describes various methods of modeling the covariance matrix to explain the complex correlations. Among them, modified Cholesky decomposition, modified Cholesky block decomposition, and hypersphere decomposition are reviewed. In this paper, we review these methods and analyze Korean children and youth panel (KCYP) data are analyzed using the Bayesian method. The KCYP data are multivariate longitudinal data that have response variables: School adaptation, academic achievement, and dependence on mobile phones. Assuming that the correlation structure and the innovation standard deviation structure are different, several models are compared. For the most suitable model, all explanatory variables are significant for school adaptation, and academic achievement and only household income appears as insignificant variables when cell phone dependence is a response variable.

A recent overview on financial and special time series models (금융 및 특수시계열 모형의 조망)

  • Hwang, S.Y.
    • The Korean Journal of Applied Statistics
    • /
    • v.29 no.1
    • /
    • pp.1-12
    • /
    • 2016
  • Contrasted with the standard linear ARMA models, financial time series exhibits non-standard features such as fat-tails, non-normality, volatility clustering and asymmetries which are usually referred to as "stylized facts" in financial time series context (Terasvirta, 2009). We are accordingly led to ad hoc models (apart from ARMA) to accommodate stylized facts (Andersen et al., 2009). The paper aims to give a contemporary overview on financial and special time series models based on the recent literature and on the author's publications. Various models are illustrated including asymmetric models, integer valued models, multivariate models and high frequency models. Selected statistical issues on the models are discussed, bringing some perspectives to the future works in this area.