• Title/Summary/Keyword: explanatory variable

Search Result 426, Processing Time 0.029 seconds

Analysis of statistical models for ozone concentrations at the Paju city in Korea (경기도 파주시 오존농도의 통계모형 연구)

  • Lee, Hoon-Ja
    • Journal of the Korean Data and Information Science Society
    • /
    • v.20 no.6
    • /
    • pp.1085-1092
    • /
    • 2009
  • The ozone data is one of the important environmental data for measurement of the atmospheric condition of the country. In this article, the Autoregressive Error (ARE) model and Neural Networks (NN) model have been considered for analyzing the ozone data at the northern part of the Gyeonggi-Do, Paju monitoring site in Korea. In the both ARE model and NN model, seven meteorological variables and four pollution variables are used as the explanatory variables for the ozone data set. The seven meteorological variables are daily maximum temperature, wind speed, relative humidity, rainfall, dew point temperature, steam pressure, and amount of cloud. The four air pollution explanatory variables are Sulfur dioxide ($SO_2$), Nitrogen dioxide ($NO_2$), Cobalt (CO), and Promethium 10 (PM10). The result showed that the NN model is generally better suited for describing the ozone concentration than the ARE model. However, the ARE model will be expected also good when we add the explanatory variables in the model.

  • PDF

A Study for the Drivers of Movie Box-office Performance (영화흥행 영향요인 선택에 관한 연구)

  • Kim, Yon Hyong;Hong, Jeong Han
    • The Korean Journal of Applied Statistics
    • /
    • v.26 no.3
    • /
    • pp.441-452
    • /
    • 2013
  • This study analyzed the relationship between key film and a box office record success factors based on movies released in the first quarter of 2013 in Korea. An over-fitting problem can happen if there are too many explanatory variables inserted to regression model; in addition, there is a risk that the estimator is instable when there is multi-collinearity among the explanatory variables. For this reason, optimal variable selection based on high explanatory variables in box-office performance is of importance. Among the numerous ways to select variables, LASSO estimation applied by a generalized linear model has the smallest prediction error that can efficiently and quickly find variables with the highest explanatory power to box-office performance in order.

Analysis of statistical models on temperature at the Seosan city in Korea (충청남도 서산시 기온의 통계적 모형 연구)

  • Lee, Hoonja
    • Journal of the Korean Data and Information Science Society
    • /
    • v.25 no.6
    • /
    • pp.1293-1300
    • /
    • 2014
  • The temperature data influences on various policies of the country. In this article, the autoregressive error (ARE) model has been considered for analyzing the monthly and seasonal temperature data at the northern part of the Chungcheong Namdo, Seosan monitoring site in Korea. In the ARE model, five meteorological variables, four greenhouse gas variables and five pollution variables are used as the explanatory variables for the temperature data set. The five meteorological variables are wind speed, rainfall, radiation, amount of cloud, and relative humidity. The four greenhouse gas variables are carbon dioxide ($CO_2$), methane ($CH_4$), nitrous oxide ($N_2O$), and chlorofluorocarbon ($CFC_{11}$). And the five air pollution explanatory variables are particulate matter ($PM_{10}$), sulfur dioxide ($SO_2$), nitrogen dioxide ($NO_2$), ozone ($O_3$), and carbon monoxide (CO). The result showed that the monthly ARE model explained about 39-63% for describing the temperature. However, the ARE model will be expected better when we add the more explanatory variables in the model.

An educational tool for binary logistic regression model using Excel VBA (엑셀 VBA를 이용한 이분형 로지스틱 회귀모형 교육도구 개발)

  • Park, Cheolyong;Choi, Hyun Seok
    • Journal of the Korean Data and Information Science Society
    • /
    • v.25 no.2
    • /
    • pp.403-410
    • /
    • 2014
  • Binary logistic regression analysis is a statistical technique that explains binary response variable by quantitative or qualitative explanatory variables. In the binary logistic regression model, the probability that the response variable equals, say 1, one of the binary values is to be explained as a transformation of linear combination of explanatory variables. This is one of big barriers that non-statisticians have to overcome in order to understand the model. In this study, an educational tool is developed that explains the need of the binary logistic regression analysis using Excel VBA. More precisely, this tool explains the problems related to modeling the probability of the response variable equal to 1 as a linear combination of explanatory variables and then shows how these problems can be solved through some transformations of the linear combination.

A linearity test statistic in a simple linear regression (단순회귀모형에서 선형성 검정통계량)

  • Park, Chun Gun;Lee, Kyeong Eun
    • Journal of the Korean Data and Information Science Society
    • /
    • v.25 no.2
    • /
    • pp.305-315
    • /
    • 2014
  • In a simple linear regression, a linear relationship between an explanatory variable and a response variable can be easily recognized in the scatter plot of them. The lack of fit test for the replicated data is commonly used for testing the linearity but it is not easy to test the linearity when the explanatory variable is not replicated. In this paper, we propose three new test statistics for testing the linearity regardless of replication using the principle of average slope and validate them through several simulations and empirical studies.

Check for regression coefficient using jackknife and bootstrap methods in clinical data (잭나이프 및 붓스트랩 방법을 이용한 임상자료의 회귀계수 타당성 확인)

  • Sohn, Ki-Cheul;Shin, Im-Hee
    • Journal of the Korean Data and Information Science Society
    • /
    • v.23 no.4
    • /
    • pp.643-648
    • /
    • 2012
  • There are lots of analysis to determine the relation between dependent variable and explanatory variables. Often the regression analysis is used to do this, and we can analyze the how much the explanatory variable can be related with dependent variable and how much the regression model can explain the data. But the validation check of regression model is usually determined by coefficient of determination. We should check the validation of regression coefficient with different methods. This paper introduces the method for validation check the regression coefficient using the jackknife regression and bootstrap regression in clinical data.

Predicting Gross Box Office Revenue for Domestic Films

  • Song, Jongwoo;Han, Suji
    • Communications for Statistical Applications and Methods
    • /
    • v.20 no.4
    • /
    • pp.301-309
    • /
    • 2013
  • This paper predicts gross box office revenue for domestic films using the Korean film data from 2008-2011. We use three regression methods, Linear Regression, Random Forest and Gradient Boosting to predict the gross box office revenue. We only consider domestic films with a revenue size of at least KRW 500 million; relevant explanatory variables are chosen by data visualization and variable selection techniques. The key idea of analyzing this data is to construct the meaningful explanatory variables from the data sources available to the public. Some variables must be categorized to conduct more effective analysis and clustering methods are applied to achieve this task. We choose the best model based on performance in the test set and important explanatory variables are discussed.

Analysis of Factors Affecting Successful Bid Price in Public Construction Technical Bidding (공공공사 기술형 입찰에서의 낙찰가격에 미치는 요인 분석)

  • Lee, Jung-Woong;Yi, Sung-Wook
    • Asia-Pacific Journal of Business
    • /
    • v.13 no.1
    • /
    • pp.213-230
    • /
    • 2022
  • Purpose - The purpose of this study is to find out any potential factors for explanatory variables when calculating the virtual successful bid rate in case of no collusion. Design/methodology/approach - An empirical analysis was conducted in this study with a regression analysis that included 725 bid samples under the public construction technical type bidding. Findings - The result of the basic analysis showed that there are several factors affecting the successful bid rate. First, collusion variable; second, government variable; third, successful bidder design score variable and the number of bidder variable among bidding features; fourth, turnkey variable based on the alternative method; fifth, civil works variable and plant works variable based on building work; sixth, asset variable and the fourth-quarter performance difference variable. However, the technical proposal method variable among bidding features was found to be statistically insignificant in column(4). Research implications or Originality - The significance of this research is that new variable such as the government variable and the fourth-quarter performance difference variable were added in the regression model, which showed statistically significant research results.

Variable Selection Based on Mutual Information

  • Huh, Moon-Y.;Choi, Byong-Su
    • Communications for Statistical Applications and Methods
    • /
    • v.16 no.1
    • /
    • pp.143-155
    • /
    • 2009
  • Best subset selection procedure based on mutual information (MI) between a set of explanatory variables and a dependent class variable is suggested. Derivation of multivariate MI is based on normal mixtures. Several types of normal mixtures are proposed. Also a best subset selection algorithm is proposed. Four real data sets are employed to demonstrate the efficiency of the proposals.

An Escalation Model of Muzakki's Trust and Loyalty towards Payment of Zakat at BAZNAS Indonesia

  • ROZIQ, Ahmad;SULISTIYO, Agung Budi;SHULTHONI, Moch.;ANUGERAH, Eza Gusti
    • The Journal of Asian Finance, Economics and Business
    • /
    • v.8 no.3
    • /
    • pp.551-559
    • /
    • 2021
  • This study aims to analyze the effect of (a) sharia governance on transparency, muzakki's trust, and muzakki's loyalty in paying zakat to zakat management organizations, (b) trust and accountability on muzakki loyalty in paying zakat to zakat management organizations, (c) transparency, loyalty and accountability on the amount of zakat payments to zakat management organizations This is explanatory research with data analysis techniques using the smartPLS method. The sample is 117 people who had paid zakat to BAZNAS, Indonesia. The results showed that the sharia governance variable had a significant effect on the accountability, transparency, and muzakki trust variables. The accountability variable has a significant effect on the muzakki loyalty variable and not on the zakat payment variable. The transparency variable has a significant effect on the muzakki loyalty variable and not on the zakat payment variable. The muzakki trust variable has a significant effect on the muzakki loyalty variable, and the loyalty variable has a significant effect on the zakat payment variable. This new model can explain the variables that affect the increase in trust and loyalty in increasing the amount of zakat payments. Muzakki's trust and loyalty improvement model against zakat payment in BAZNAS Indonesia, explains how BAZNAS organizers can increase zakat revenue, loyalty, trust, and good sharia governance.