• Title/Summary/Keyword: Multivariate Regression Model

Search Result 415, Processing Time 0.025 seconds

Note on response dimension reduction for multivariate regression

  • Yoo, Jae Keun
    • Communications for Statistical Applications and Methods
    • /
    • v.26 no.5
    • /
    • pp.519-526
    • /
    • 2019
  • Response dimension reduction in a sufficient dimension reduction (SDR) context has been widely ignored until Yoo and Cook (Computational Statistics and Data Analysis, 53, 334-343, 2008) founded theories for it and developed an estimation approach. Recent research in SDR shows that a semi-parametric approach can outperform conventional non-parametric SDR methods. Yoo (Statistics: A Journal of Theoretical and Applied Statistics, 52, 409-425, 2018) developed a semi-parametric approach for response reduction in Yoo and Cook (2008) context, and Yoo (Journal of the Korean Statistical Society, 2019) completes the semi-parametric approach by proposing an unstructured method. This paper theoretically discusses and provides insightful remarks on three versions of semi-parametric approaches that can be useful for statistical practitioners. It is also possible to avoid numerical instability by presenting the results for an orthogonal transformation of the response variables.

Multivariate Statistical Analysis Approach to Predict the Reactor Properties and the Product Quality of a Direct Esterification Reactor for PET Synthesis (다변량 통계분석법을 이용한 PET 중합공정 중 직접 에스테르화 반응기의 거동 및 생산제품 예측)

  • Kim Sung Young;Chung Chang Bock;Choi Soo Hyoung;Lee Bomsock;Lee Bomsock
    • Journal of Institute of Control, Robotics and Systems
    • /
    • v.11 no.6
    • /
    • pp.550-557
    • /
    • 2005
  • The multivariate statistical analysis methods, using both multiple linear regression(MLR) and partial least square(PLS), have been applied to predict the reactor properties and the product quality of a direct esterification reactor for polyethylene terephthalate(PET) synthesis. On the basis of the set of data including the flow rate of water vapor, the flow rate of EG vapor, the concentration of acid end groups of a product and other operating conditions such as temperature, pressure, reaction times and feed monomer mole ratio, two multi-variable analysis methods have been applied. Their regression and prediction abilities also have been compared. The prediction results are critically compared with the actual plant data and the other mathematical model based results in reliability. This paper shows that PLS method approach can be used for the reasonably accurate prediction of a product quality of a direct esterification reactor in PET synthesis process.

The Impact of Audit Characteristics on Firm Performance: An Empirical Study from an Emerging Economy

  • Rahman, Md. Musfiqur;Meah, Mohammad Rajon;Chaudhory, Nasir Uddin
    • The Journal of Asian Finance, Economics and Business
    • /
    • v.6 no.1
    • /
    • pp.59-69
    • /
    • 2019
  • The auditor, an important instrument of corporate governance, ensures the transparency and accountability of the firm to the stakeholders. The objective of this paper is to explore the impact of audit characteristics on firm performance. In this study, external audit quality (BIG4), frequencies of audit committee meetings, and audit committee size are used as the proxies of audit characteristics and firm performance is measured through ROA, profit margin and EPS. A total of 503 firm years are considered as sample size from the listed manufacturing firms of Dhaka Stock Exchange (DSE) during the period of 2013 to 2017 to find out the impact of audit characteristics on firm performance. In this study, multivariate regression analysis is conducted using the pooled OLS method. Moreover, time dummy and lag model of multivariate analysis are also analyzed as robust check. The multivariate regression results find that external audit quality (BIG4) and audit committee size are significantly positively associated with firm performance. This study also finds that there is a significant negative relationship between audit committee meeting and firm performance. This study recommends that the regulatory authority and audit committee should review the frequencies of audit committee meeting to make it more effective to ensure better firm performance.

Variable Selection with Nonconcave Penalty Function on Reduced-Rank Regression

  • Jung, Sang Yong;Park, Chongsun
    • Communications for Statistical Applications and Methods
    • /
    • v.22 no.1
    • /
    • pp.41-54
    • /
    • 2015
  • In this article, we propose nonconcave penalties on a reduced-rank regression model to select variables and estimate coefficients simultaneously. We apply HARD (hard thresholding) and SCAD (smoothly clipped absolute deviation) symmetric penalty functions with singularities at the origin, and bounded by a constant to reduce bias. In our simulation study and real data analysis, the new method is compared with an existing variable selection method using $L_1$ penalty that exhibits competitive performance in prediction and variable selection. Instead of using only one type of penalty function, we use two or three penalty functions simultaneously and take advantages of various types of penalty functions together to select relevant predictors and estimation to improve the overall performance of model fitting.

A study on log-density ratio in logistic regression model for binary data

  • Kahng, Myung-Wook
    • Journal of the Korean Data and Information Science Society
    • /
    • v.22 no.1
    • /
    • pp.107-113
    • /
    • 2011
  • We present methods for studying the log-density ratio, which allow us to select which predictors are needed, and how they should be included in the logistic regression model. Under multivariate normal distributional assumptions, we investigate the form of the log-density ratio as a function of many predictors. The linear, quadratic and crossproduct terms are required in general. If two covariance matrices are equal, then the crossproduct and quadratic terms are not needed. If the variables are uncorrelated, we do not need the crossproduct terms, but we still need the linear and quadratic terms.

A Comparative Study of Estimation by Analogy using Data Mining Techniques

  • Nagpal, Geeta;Uddin, Moin;Kaur, Arvinder
    • Journal of Information Processing Systems
    • /
    • v.8 no.4
    • /
    • pp.621-652
    • /
    • 2012
  • Software Estimations provide an inclusive set of directives for software project developers, project managers, and the management in order to produce more realistic estimates based on deficient, uncertain, and noisy data. A range of estimation models are being explored in the industry, as well as in academia, for research purposes but choosing the best model is quite intricate. Estimation by Analogy (EbA) is a form of case based reasoning, which uses fuzzy logic, grey system theory or machine-learning techniques, etc. for optimization. This research compares the estimation accuracy of some conventional data mining models with a hybrid model. Different data mining models are under consideration, including linear regression models like the ordinary least square and ridge regression, and nonlinear models like neural networks, support vector machines, and multivariate adaptive regression splines, etc. A precise and comprehensible predictive model based on the integration of GRA and regression has been introduced and compared. Empirical results have shown that regression when used with GRA gives outstanding results; indicating that the methodology has great potential and can be used as a candidate approach for software effort estimation.

Repetitive model refinement for structural health monitoring using efficient Akaike information criterion

  • Lin, Jeng-Wen
    • Smart Structures and Systems
    • /
    • v.15 no.5
    • /
    • pp.1329-1344
    • /
    • 2015
  • The stiffness of a structure is one of several structural signals that are useful indicators of the amount of damage that has been done to the structure. To accurately estimate the stiffness, an equation of motion containing a stiffness parameter must first be established by expansion as a linear series model, a Taylor series model, or a power series model. The model is then used in multivariate autoregressive modeling to estimate the structural stiffness and compare it to the theoretical value. Stiffness assessment for modeling purposes typically involves the use of one of three statistical model refinement approaches, one of which is the efficient Akaike information criterion (AIC) proposed in this paper. If a newly added component of the model results in a decrease in the AIC value, compared to the value obtained with the previously added component(s), it is statistically justifiable to retain this new component; otherwise, it should be removed. This model refinement process is repeated until all of the components of the model are shown to be statistically justifiable. In this study, this model refinement approach was compared with the two other commonly used refinement approaches: principal component analysis (PCA) and principal component regression (PCR) combined with the AIC. The results indicate that the proposed AIC approach produces more accurate structural stiffness estimates than the other two approaches.

Implicit Treatment of Technical Specification and Thermal Hydraulic Parameter Uncertainties in Gaussian Process Model to Estimate Safety Margin

  • Fynan, Douglas A.;Ahn, Kwang-Il
    • Nuclear Engineering and Technology
    • /
    • v.48 no.3
    • /
    • pp.684-701
    • /
    • 2016
  • The Gaussian process model (GPM) is a flexible surrogate model that can be used for nonparametric regression for multivariate problems. A unique feature of the GPM is that a prediction variance is automatically provided with the regression function. In this paper, we estimate the safety margin of a nuclear power plant by performing regression on the output of best-estimate simulations of a large-break loss-of-coolant accident with sampling of safety system configuration, sequence timing, technical specifications, and thermal hydraulic parameter uncertainties. The key aspect of our approach is that the GPM regression is only performed on the dominant input variables, the safety injection flow rate and the delay time for AC powered pumps to start representing sequence timing uncertainty, providing a predictive model for the peak clad temperature during a reflood phase. Other uncertainties are interpreted as contributors to the measurement noise of the code output and are implicitly treated in the GPM in the noise variance term, providing local uncertainty bounds for the peak clad temperature. We discuss the applicability of the foregoing method to reduce the use of conservative assumptions in best estimate plus uncertainty (BEPU) and Level 1 probabilistic safety assessment (PSA) success criteria definitions while dealing with a large number of uncertainties.

MARS inverse analysis of soil and wall properties for braced excavations in clays

  • Zhang, Wengang;Zhang, Runhong;Goh, Anthony. T.C.
    • Geomechanics and Engineering
    • /
    • v.16 no.6
    • /
    • pp.577-588
    • /
    • 2018
  • A major concern in deep excavation project in soft clay deposits is the potential for adjacent buildings to be damaged as a result of the associated excessive ground movements. In order to accurately determine the wall deflections using a numerical procedure such as the finite element method, it is critical to use the correct soil parameters such as the stiffness/strength properties. This can be carried out by performing an inverse analysis using the measured wall deflections. This paper firstly presents the results of extensive plane strain finite element analyses of braced diaphragm walls to examine the influence of various parameters such as the excavation geometry, soil properties and wall stiffness on the wall deflections. Based on these results, a multivariate adaptive regression splines (MARS) model was developed for inverse parameter identification of the soil relative stiffness ratio. A second MARS model was also developed for inverse parameter estimation of the wall system stiffness, to enable designers to determine the appropriate wall size during the preliminary design phase. Soil relative stiffness ratios and system stiffness values derived via these two different MARS models were found to compare favourably with a number of field and published records.

Prediction of the compressive strength of self-compacting concrete using surrogate models

  • Asteris, Panagiotis G.;Ashrafian, Ali;Rezaie-Balf, Mohammad
    • Computers and Concrete
    • /
    • v.24 no.2
    • /
    • pp.137-150
    • /
    • 2019
  • In this paper, surrogate models such as multivariate adaptive regression splines (MARS) and M5P model tree (M5P MT) methods have been investigated in order to propose a new formulation for the 28-days compressive strength of self-compacting concrete (SCC) incorporating metakaolin as a supplementary cementitious materials. A database comprising experimental data has been assembled from several published papers in the literature and the data have been used for training and testing. In particular, the data are arranged in a format of seven input parameters covering contents of cement, coarse aggregate to fine aggregate ratio, water, metakaolin, super plasticizer, largest maximum size and binder as well as one output parameter, which is the 28-days compressive strength. The efficiency of the proposed techniques has been demonstrated by means of certain statistical criteria. The findings have been compared to experimental results and their comparisons shows that the MARS and M5P MT approaches predict the compressive strength of SCC incorporating metakaolin with great precision. The performed sensitivity analysis to assign effective parameters on 28-days compressive strength indicates that cementitious binder content is the most effective variable in the mixture.