• Title/Summary/Keyword: Explanatory variables

Search Result 1,100, Processing Time 0.03 seconds

Comments on the regression coefficients (다중회귀에서 회귀계수 추정량의 특성)

  • Kahng, Myung-Wook
    • The Korean Journal of Applied Statistics
    • /
    • v.34 no.4
    • /
    • pp.589-597
    • /
    • 2021
  • In simple and multiple regression, there is a difference in the meaning of regression coefficients, and not only are the estimates of regression coefficients different, but they also have different signs. Understanding the relative contribution of explanatory variables in a regression model is an important part of regression analysis. In a standardized regression model, the regression coefficient can be interpreted as the change in the response variable with respect to the standard deviation when the explanatory variable increases by the standard deviation in a situation where the values of the explanatory variables other than the corresponding explanatory variable are fixed. However, the size of the standardized regression coefficient is not a proper measure of the relative importance of each explanatory variable. In this paper, the estimator of the regression coefficient in multiple regression is expressed as a function of the correlation coefficient and the coefficient of determination. Furthermore, it is considered in terms of the effect of an additional explanatory variable and additional increase in the coefficient of determination. We also explore the relationship between estimates of regression coefficients and correlation coefficients in various plots. These results are specifically applied when there are two explanatory variables.

Analysis of statistical models for ozone concentrations at the Paju city in Korea (경기도 파주시 오존농도의 통계모형 연구)

  • Lee, Hoon-Ja
    • Journal of the Korean Data and Information Science Society
    • /
    • v.20 no.6
    • /
    • pp.1085-1092
    • /
    • 2009
  • The ozone data is one of the important environmental data for measurement of the atmospheric condition of the country. In this article, the Autoregressive Error (ARE) model and Neural Networks (NN) model have been considered for analyzing the ozone data at the northern part of the Gyeonggi-Do, Paju monitoring site in Korea. In the both ARE model and NN model, seven meteorological variables and four pollution variables are used as the explanatory variables for the ozone data set. The seven meteorological variables are daily maximum temperature, wind speed, relative humidity, rainfall, dew point temperature, steam pressure, and amount of cloud. The four air pollution explanatory variables are Sulfur dioxide ($SO_2$), Nitrogen dioxide ($NO_2$), Cobalt (CO), and Promethium 10 (PM10). The result showed that the NN model is generally better suited for describing the ozone concentration than the ARE model. However, the ARE model will be expected also good when we add the explanatory variables in the model.

  • PDF

Analysis of Time Series Models for Ozone Concentrations at the Uijeongbu City in Korea

  • Lee, Hoon-Ja
    • Journal of the Korean Data and Information Science Society
    • /
    • v.19 no.4
    • /
    • pp.1153-1164
    • /
    • 2008
  • The ozone data is one of the important environmental data for measurement of the atmospheric condition of the country. In this article, the Autoregressive Error (ARE) model have been considered for analyzing the ozone data at the northern part of the Gyeonggi-Do, Uijeongbu monitoring site in Korea. The result showed that both overall and monthly ARE models are suited for describing the ozone concentration. In the ARE model, seven meteorological variables and four pollution variables are used as the as the explanatory variables for the ozone data set. The seven meteorological variables are daily maximum temperature, wind speed, relative humidity, rainfall, dew point temperature, steam pressure, and amount of cloud. The four air pollution explanatory variables are Sulfur dioxide(SO2), Nitrogen dioxide(NO2), Cobalt(CO), and Promethium 10(PM10). Also, the high level ozone data (over 80ppb) have been analyzed four ARE models, General ARE, HL ARE, PM10 add ARE, Temperature add ARE model. The result shows that the General ARE, HL ARE, and PM10 add ARE models are suited for describing the high level of ozone data.

  • PDF

Analysis of time series models for PM10 concentrations at the Suwon city in Korea (경기도 수원시 미세먼지 농도의 시계열모형 연구)

  • Lee, Hoon-Ja
    • Journal of the Korean Data and Information Science Society
    • /
    • v.21 no.6
    • /
    • pp.1117-1124
    • /
    • 2010
  • The PM10 (Promethium 10) data is one of the important environmental data for measurement of the atmospheric condition of the country. In this article, the Autoregressive Error (ARE) model has been considered for analyzing the monthly PM10 data at the southern part of the Gyeonggi-Do, Suwon monitoring site in Korea. In the ARE model, six meteorological variables and four pollution variables are used as the explanatory variables for the PM10 data set. The six meteorological variables are daily maximum temperature, wind speed, relative humidity, rainfall, radiation, and amount of cloud. The four air pollution explanatory variables are sulfur dioxide ($SO_2$), nitrogen dioxide ($NO_2$), carbon monoxide (CO), and ozone ($O_3$). The result showed that the monthly ARE models explained about 13-49% for describing the PM10 concentration.

Utilization of Electrical Conductivity to Improve Prediction Accuracy of Cooking Loss of Pork Loin

  • Kyung Jo;Seonmin Lee;Hyun Gyung Jeong;Dae-Hyun Lee;Sangwon Yoon;Yoonji Chung;Samooel Jung
    • Food Science of Animal Resources
    • /
    • v.43 no.1
    • /
    • pp.113-123
    • /
    • 2023
  • This study investigated the predictability of cooking loss of pork loin through relatively easy and quick measurable quality properties. The pH, color, moisture, protein content, and cooking loss of 100 pork loins were measured. The explanatory variables included in all linear regression models with an adjust-r2 value of ≥0.5 were pH and the protein content. In the linear regression model predicting cooking loss, the highest adjust-r2 value was 0.7, with pH, CIE L*, CIE b*, moisture, and protein content as the explanatory variables. In 30 pork loins, electrical conductivity was additionally measured, and as a result of linear regression analysis for predicting cooking loss, the highest adjust-r2 value was 0.646 with electrical conductivity measured at 40 Hz, with pH and color as the explanatory variables. Ordinal logistic regression analysis was performed to predict the three grades (low, middle, and high) of loin cooking loss using pH, color, and 40 Hz electrical conductivity as the explanatory variables, and the percent concordance was 93.8%. In conclusion, the addition of electrical conductivity as an explanatory variable did not increase the prediction accuracy of the linear regression model for predicting cooking loss; however, it was demonstrated that it is possible to predict and classify the cooking loss grade of pork loin through quality properties that can be measured quickly and easily.

A Study for the Drivers of Movie Box-office Performance (영화흥행 영향요인 선택에 관한 연구)

  • Kim, Yon Hyong;Hong, Jeong Han
    • The Korean Journal of Applied Statistics
    • /
    • v.26 no.3
    • /
    • pp.441-452
    • /
    • 2013
  • This study analyzed the relationship between key film and a box office record success factors based on movies released in the first quarter of 2013 in Korea. An over-fitting problem can happen if there are too many explanatory variables inserted to regression model; in addition, there is a risk that the estimator is instable when there is multi-collinearity among the explanatory variables. For this reason, optimal variable selection based on high explanatory variables in box-office performance is of importance. Among the numerous ways to select variables, LASSO estimation applied by a generalized linear model has the smallest prediction error that can efficiently and quickly find variables with the highest explanatory power to box-office performance in order.

Variables Affecting Brand Relationship Quality of Fashion Product Consumers (패션상품 소비자들의 상표관계본질에 영향을 미치는 변인 연구)

  • Chae, Jin-Mie
    • Journal of the Korean Home Economics Association
    • /
    • v.46 no.8
    • /
    • pp.121-133
    • /
    • 2008
  • The purpose of this study was to decipher the BRQ (Brand Relationship Quality) Construct of fashion product consumers and to identify influencing variables. Consumption values of consumers, fashion product attributes and marketing communication instruments were investigated as affecting variables. A questionnaire was distributed to 350 women between the age of 20 and 50 living in Seoul and other metropolitan areas from Mar. 5 to Mar. 18, 2008. Collected data were analyzed by factor analysis, reliability, multiple regression, frequency and percentage using SPSS (Version 15,0) statistical Package. The results were as follows: Firstly, the BRQ Construct of fashion product consumers was proved to be composed of five factors; 'self-connective love', 'pursuing symbol', 'trust', 'intimacy', and 'knowledge'. Secondly, consumption values had a significant effect on the BRQ Construct and their explanatory power was 32%, 'Epistemic value' was the most significant variable affecting BRQ. 'Functional value' had a significant influence on some BRQ factors. Thirdly, explanatory power of fashion product attributes affecting BRQ was 45%. 'Aesthetic attribute' had significant effect on all BRQ factors, while 'functional attribute' showed significant effect on only 'pursuing symbol', 'trust', and 'intimacy'. Finally, marketing communication mixes showed 21% explanatory power and 'word of mouth' and 'display' among marketing communication mixes were found as the most influential variables affecting the BRQ Construct.

Bootstrap Testing for Reliability of Stess-Strength Model with Explanatory Variables

  • Park, Jin-Pyo;Kang, Sang-Gil;Cho, Jang-Sik
    • Journal of the Korean Data and Information Science Society
    • /
    • v.9 no.2
    • /
    • pp.263-273
    • /
    • 1998
  • In this paper, we consider some approximate testings for the reliability of the stress-strength model when the stress X and strength Y each depends linearly on some explanatory variables z and w, respectively. We construct a bootstrap procedure for testing for various values of the reliability and compare the power of the bootstrap test with the test based on Mann-Whitney type estimator by Park et.al.(1996) for small and moderate sample size.

  • PDF

Biplots of Multivariate Data Guided by Linear and/or Logistic Regression

  • Huh, Myung-Hoe;Lee, Yonggoo
    • Communications for Statistical Applications and Methods
    • /
    • v.20 no.2
    • /
    • pp.129-136
    • /
    • 2013
  • Linear regression is the most basic statistical model for exploring the relationship between a numerical response variable and several explanatory variables. Logistic regression secures the role of linear regression for the dichotomous response variable. In this paper, we propose a biplot-type display of the multivariate data guided by the linear regression and/or the logistic regression. The figures show the directional flow of the response variable as well as the interrelationship of explanatory variables.

Bayesian Analysis for a Functional Regression Model with Truncated Errors in Variables

  • Kim, Hea-Jung
    • Journal of the Korean Statistical Society
    • /
    • v.31 no.1
    • /
    • pp.77-91
    • /
    • 2002
  • This paper considers a functional regression model with truncated errors in explanatory variables. We show that the ordinary least squares (OLS) estimators produce bias in regression parameter estimates under misspecified models with ignored errors in the explanatory variable measurements, and then propose methods for analyzing the functional model. Fully parametric frequentist approaches for analyzing the model are intractable and thus Bayesian methods are pursued using a Markov chain Monte Carlo (MCMC) sampling based approach. Necessary theories involved in modeling and computation are provided. Finally, a simulation study is given to illustrate and examine the proposed methods.