• 제목/요약/키워드: Explanatory Variables

검색결과 1,129건 처리시간 0.03초

Improving Deep Learning Models Considering the Time Lags between Explanatory and Response Variables

  • Chaehyeon Kim;Ki Yong Lee
    • Journal of Information Processing Systems
    • /
    • 제20권3호
    • /
    • pp.345-359
    • /
    • 2024
  • A regression model represents the relationship between explanatory and response variables. In real life, explanatory variables often affect a response variable with a certain time lag, rather than immediately. For example, the marriage rate affects the birth rate with a time lag of 1 to 2 years. Although deep learning models have been successfully used to model various relationships, most of them do not consider the time lags between explanatory and response variables. Therefore, in this paper, we propose an extension of deep learning models, which automatically finds the time lags between explanatory and response variables. The proposed method finds out which of the past values of the explanatory variables minimize the error of the model, and uses the found values to determine the time lag between each explanatory variable and response variables. After determining the time lags between explanatory and response variables, the proposed method trains the deep learning model again by reflecting these time lags. Through various experiments applying the proposed method to a few deep learning models, we confirm that the proposed method can find a more accurate model whose error is reduced by more than 60% compared to the original model.

Confidence Intervals for the Stress-strength Models with Explanatory Variables

  • Lee, Sangyeol;Park, Eunsik
    • Journal of the Korean Statistical Society
    • /
    • 제27권4호
    • /
    • pp.435-449
    • /
    • 1998
  • In this paper, we consider the problem of constructing the lower cofidence intervals for the reliability P(X < Y z,w), where the stress X and the strength Y are the random variables with explanatory variables z and w, respectively. As an estimator of the reliability, a Mann-Whitney type statistic is considered. It is shown that under regularity conditions, the proposed estimator is asymptotically normal. Based on the result, the distribution free lower confidence intervals are constructed.

  • PDF

Analysis of PM10 Concentration using Auto-Regressive Error Model at Pyeongtaek City in Korea (자기회귀오차모형을 이용한 평택시 PM10 농도 분석)

  • Lee, Hoon-Ja
    • Journal of Korean Society for Atmospheric Environment
    • /
    • 제27권3호
    • /
    • pp.358-366
    • /
    • 2011
  • The purpose of this study was to analyze the monthly and seasonal PM10 data using the Autoregressive Error (ARE) model at the southern part of the Gyeonggi-Do, Pyeongtaek monitoring site in Korea. In the ARE model, six meteorological variables and four pollution variables are used as the explanatory variables. The six meteorological variables are daily maximum temperature, wind speed, amount of cloud, relative humidity, rainfall, and global radiation. The four air pollution variables are sulfur dioxide ($SO_2$), nitrogen dioxide ($NO_2$), carbon monoxide (CO), and ozone ($O_3$). The result shows that monthly ARE models explained about 17~49% of the PM10 concentration. However, the ARE model could be improved if we add the more explanatory variables in the model.

University Students' Economic Distress and Coping Behavior in Meal Management (대학생의 경제적 불안과 식생활 대처행동)

  • 서정희;홍순명
    • Journal of the Korean Home Economics Association
    • /
    • 제38권1호
    • /
    • pp.39-49
    • /
    • 2000
  • This research investigated the effect of socio-economic variables and economic distress variables on the university students' coping behavior in meal management. The data used in this research included 544 university students in Ulsan Areas. The independent explanatory power of socio-economic variables was larger than economic distress variables. But the explanatory power was increased in the regression analysis model that was included both the socio-economic variables and the economic distress variables. The influencing variables that effected the level of coping behavior in meal management were the amount of discretionary expenditure, gender, status of housing, employment distress and income distress.

  • PDF

Analysis of statistical models on temperature at the Seosan city in Korea (충청남도 서산시 기온의 통계적 모형 연구)

  • Lee, Hoonja
    • Journal of the Korean Data and Information Science Society
    • /
    • 제25권6호
    • /
    • pp.1293-1300
    • /
    • 2014
  • The temperature data influences on various policies of the country. In this article, the autoregressive error (ARE) model has been considered for analyzing the monthly and seasonal temperature data at the northern part of the Chungcheong Namdo, Seosan monitoring site in Korea. In the ARE model, five meteorological variables, four greenhouse gas variables and five pollution variables are used as the explanatory variables for the temperature data set. The five meteorological variables are wind speed, rainfall, radiation, amount of cloud, and relative humidity. The four greenhouse gas variables are carbon dioxide ($CO_2$), methane ($CH_4$), nitrous oxide ($N_2O$), and chlorofluorocarbon ($CFC_{11}$). And the five air pollution explanatory variables are particulate matter ($PM_{10}$), sulfur dioxide ($SO_2$), nitrogen dioxide ($NO_2$), ozone ($O_3$), and carbon monoxide (CO). The result showed that the monthly ARE model explained about 39-63% for describing the temperature. However, the ARE model will be expected better when we add the more explanatory variables in the model.

A Comparison on Confidence Intervals for P(X>Y) with Explanatory Variables

  • Lee, In-Suk;Cho, Jang-Sik
    • Journal of Korean Society for Quality Management
    • /
    • 제25권1호
    • /
    • pp.193-203
    • /
    • 1997
  • In this paper, we obtain some a, pp.oximate confidence intervals for the reliability of the stress-strength model when the stress and strength each depend on some explanatory variables, respectively. Also we compare the confidence intervals via Monte Carlo simulation.

  • PDF

A Study on Sex Role Identity and Makeup Behavior (여대생(女大生)의 성역할(性役割) 정체감(正體感)과 화장(化粧) 행동(行動)에 관(關)한 연구(硏究))

  • Kuh, Ja-Myung;Lee, Kwuy-Young
    • Journal of Fashion Business
    • /
    • 제6권2호
    • /
    • pp.124-136
    • /
    • 2002
  • This objective study were to classify the contents of makeup behavior, to investigate the relationship between makeup behavior and sex role identity, and to examine how the makeup behavior, makeup satisfaction was influenced by sex role identity and demographics. To achieve this, the researchers surveyed 162 women for the ages of 18 through 25. The result of this study are followed. 1) Four factor of makeup behavior were sexual attractiveness, aesthetic, psychological dependence and makeup interest. 2) There were significant positive relationship between makeup behavior and sex role identity. 3) Sexual attractiveness were influenced by femininity, income. The explanatory power of the 2 variables were 8.5%. Aesthetic were influenced by masculinity. The explanatory power of the 1 variable was 9.2%. Psychological dependence were influenced by femininity. The explanatory power of the 1 variable was 8.2%. Makeup interest were influenced by masculinity, age. The explanatory power of the 2 variables were 9.0%. 4 Makeup satisfaction were influenced by sexual attractiveness, aesthetic. The explanatory power of the 2 variables were 22.1%.

An educational tool for regression models with dummy variables using Excel VBA (엑셀 VBA을 이용한 가변수 회귀모형 교육도구 개발)

  • Choi, Hyun Seok;Park, Cheolyong
    • Journal of the Korean Data and Information Science Society
    • /
    • 제24권3호
    • /
    • pp.593-601
    • /
    • 2013
  • We often need to include categorial variables as explanatory variables in regression models. The categorial variables in regression models can be quantified through dummy variables. In this study, we provide an education tool using Excel VBA for displaying regression lines along with test results for regression models with a continuous explanatory variable and one or two categorical explanatory variables. The regression lines with test results are provided step by step for the model(s) with interaction(s), the model(s) without interaction(s) but with dummy variables, and the model without dummy variable(s). With this tool, we can easily understand the meaning of dummy variables and interaction effect through graphics and further decide which model is more suited to the data on hand.

Predicting Gross Box Office Revenue for Domestic Films

  • Song, Jongwoo;Han, Suji
    • Communications for Statistical Applications and Methods
    • /
    • 제20권4호
    • /
    • pp.301-309
    • /
    • 2013
  • This paper predicts gross box office revenue for domestic films using the Korean film data from 2008-2011. We use three regression methods, Linear Regression, Random Forest and Gradient Boosting to predict the gross box office revenue. We only consider domestic films with a revenue size of at least KRW 500 million; relevant explanatory variables are chosen by data visualization and variable selection techniques. The key idea of analyzing this data is to construct the meaningful explanatory variables from the data sources available to the public. Some variables must be categorized to conduct more effective analysis and clustering methods are applied to achieve this task. We choose the best model based on performance in the test set and important explanatory variables are discussed.

A Study on the Design of Tolerance for Process Parameter using Decision Tree and Loss Function (의사결정나무와 손실함수를 이용한 공정파라미터 허용차 설계에 관한 연구)

  • Kim, Yong-Jun;Chung, Young-Bae
    • Journal of Korean Society of Industrial and Systems Engineering
    • /
    • 제39권1호
    • /
    • pp.123-129
    • /
    • 2016
  • In the manufacturing industry fields, thousands of quality characteristics are measured in a day because the systems of process have been automated through the development of computer and improvement of techniques. Also, the process has been monitored in database in real time. Particularly, the data in the design step of the process have contributed to the product that customers have required through getting useful information from the data and reflecting them to the design of product. In this study, first, characteristics and variables affecting to them in the data of the design step of the process were analyzed by decision tree to find out the relation between explanatory and target variables. Second, the tolerance of continuous variables influencing on the target variable primarily was shown by the application of algorithm of decision tree, C4.5. Finally, the target variable, loss, was calculated by a loss function of Taguchi and analyzed. In this paper, the general method that the value of continuous explanatory variables has been used intactly not to be transformed to the discrete value and new method that the value of continuous explanatory variables was divided into 3 categories were compared. As a result, first, the tolerance obtained from the new method was more effective in decreasing the target variable, loss, than general method. In addition, the tolerance levels for the continuous explanatory variables to be chosen of the major variables were calculated. In further research, a systematic method using decision tree of data mining needs to be developed in order to categorize continuous variables under various scenarios of loss function.