• Title/Summary/Keyword: Stepwise 회귀분석

Search Result 445, Processing Time 0.03 seconds

A Prediction Method Combining Clustering Method and Stepwise Regression (군집분석 기법과 단계별 회귀모델을 결합한 예측 방법)

  • Chong Il-gyo;Jun Chi-Hyuck
    • Proceedings of the Korean Operations and Management Science Society Conference
    • /
    • 2002.05a
    • /
    • pp.949-952
    • /
    • 2002
  • A regression model is used in predicting the response variable given predictor variables However, in case of large number of predictor variables, a regression model has some problems such as multicollinearity, interpretation of the functional relationship between the response and predictors and prediction accuracy. A clustering method and stepwise regression could be used to reduce the amount of data by grouping predictors having similar properties and by selecting the subset of predictors. respectively. This paper proposes a prediction method combining clustering method and stepwise regression. The proposed method fits a global model and local models and predicts responses given new observations by using both models. The paper also compares the performance of proposed method with stepwise regression via a real data of ample obtained in a steel process.

  • PDF

Applying regional regression analysis of the hydrologic model parameters for assessing climate change impacts in the ungaged watershed (미계측 유역의 기후변화 영향평가를 위한 수문모형 매개변수의 지역회귀분석 적용)

  • Kim, Youngil;Seo, Seung Beom;Kim, Sung Jin;Kim, Young-Oh
    • Proceedings of the Korea Water Resources Association Conference
    • /
    • 2017.05a
    • /
    • pp.219-219
    • /
    • 2017
  • 상대적으로 유역의 관측 자료가 충분하지 못하거나 검증되지 않았을 경우 미계측 유역으로 정의되며 수문모형의 매개변수 검정을 할 수 없으므로 다른 방법을 고안해야 한다. 이를 위해 기존 연구에서는 지역적 특성을 고려한 지역회기분석을 통해 미계측 유역의 유량을 산정하였는데, 대부분 유역의 특성과 연 평균 유출량 자료의 관계를 이용한 회귀식으로 실시간 유량의 변화를 고려하기 어려웠다. 본 연구에서는 개념적 강우-유출모형으로 많이 사용되고 있는 개념적 수문모형인 GR4J의 매개변수에 대해 미계측 유역의 특성을 고려한 변수들을 이용하여 회귀식을 구하고 그 적용성을 평가하였다. 이를 통해 미계측 유역의 유량 시계열 자료를 생성할 수 있었다. 또한 IPCC에서 발간한 AR5의 RCP 4.5 시나리오를 적용하여 미래 유출량을 산정하였다. 우선 지역회귀분석을 적용하기 위해 수문모형을 이용한 계측 유역의 유출량을 구하였으며 22개의 전국 댐 상류 지점을 기준으로 SCE 알고리즘을 이용하여 GR4J의 최적 매개변수를 구하고 각 유역별로 물리적, 지형적, 기상학적 특성을 고려하여 11개의 변수를 선택하였다. 각 변수간 다중공선성(Multicollinearity)를 고려하기 위해 VIF(Variation Inflation Factor) test를 적용하여 최종 7개의 변수를 선정하고 단계별 회귀방법(Stepwise regression)을 이용하여 GR4J의 매개변수별 회귀식을 생성하였다.

  • PDF

A Convergence Study of Factors Affecting Life Satisfaction for Adolescents with Allergic Disease (알레르기 질환이 있는 청소년의 삶의 만족도 영향요인의 융합연구)

  • Lee, Eun Jee
    • Journal of the Korea Convergence Society
    • /
    • v.10 no.3
    • /
    • pp.355-362
    • /
    • 2019
  • The aim of this study was to identify factors affecting life satisfaction for adolescents with asthma or atopic dermatitis. Korean Child and Youth Panel Survey (KCYPS) Data in 2016 was used. The data were analyzed by Chi-square test, t-test, one-way ANOVA and stepwise multiple linear regression. In multiple stepwise regression analysis, less depression, higher resilience, higher self-esteem, more affectionate parenting behavior, lower age enhances the life satisfaction of adolescents with allergic disease. Educational program is necessary to improve the life satisfaction of adolescents with asthma or atopic dermatitis which is reflecting the result of this study.

The correlation and regression analyses based on variable selection for the university evaluation index (대학 평가지표들에 대한 상관분석과 변수선택에 의한 선형모형추정)

  • Song, Pil-Jun;Kim, Jong-Tae
    • Journal of the Korean Data and Information Science Society
    • /
    • v.23 no.3
    • /
    • pp.457-465
    • /
    • 2012
  • The purpose of this study is to analyze the association between indicators and to find statistical models based on important indicators at 'College Notifier' in Korea Council for University Education. First, Pearson correlation coefficients are used to find statistically significant correlations. By variable selection method, the important indicators are selected and their coefficients are estimated. As variable selection method, backward and stepwise methods are employed.

An Analysis Study on the Doping Intentions of Athletes using Stepwise Regression Analysis

  • Youn-Suk Han;Jong-Hwa Park
    • Journal of the Korea Society of Computer and Information
    • /
    • v.28 no.5
    • /
    • pp.171-177
    • /
    • 2023
  • This study aims to provide useful information for prevention of doping by investigating and verifying relationships among demographic factors such as athletic career and experience of anti-doping education, controlled motivation, attitude toward anti-doping, perceived behavioral control factors and doping intentions to verify factors affecting doping intentions of domestic elite athletes based on the advance studies that have been carried out various theoretical approaches so far. Method: This study analyzed using SPSS 27.0 program. First, this study confirmed a multicollinearity problem by conducting Pearson's correlation analysis to examine correlation between variables. And this study conducted stepwise multiple linear regression to confirm how the variables affect doping intentions. Result: Study results show that all factors such as athletic career, experience of anti-doping education, controlled motivation, attitude toward anti-doping and perceived behavioral control have a significant impact on doping intentions, and this study verified significant impact by putting variables in order of each influence. As a result of verification, this study confirmed that controlled motivation has the greatest influence, and perceived behavioral control toward doping, experience of anti-doping education, attitude toward and athletic career came next in order.

The Prediction of Ship's Powering Performance Using Statistical Analysis and Theoretical Formulation (통계해석과 이론식을 이용한 저항추진성능 추정)

  • Eun-Chan,Kim;Sung-Wan,Hong;Seung-Il,Yang
    • Bulletin of the Society of Naval Architects of Korea
    • /
    • v.26 no.4
    • /
    • pp.14-26
    • /
    • 1989
  • This paper describes the method of statistical analysis and its programs for predicting the ship's powering performance. The equation for the wavemaking resistance coefficient is derived as the sectional area coefficients by using the wavemaking resistance theory and its regression coefficients are determined from the regression analysis of the model test results. The equations for the form factor, wake franction and thrust deduction fraction are derived by purely regression analysis of the principal dimensions, sectional area coefficients and model test results. The statistical analyses are performed using the various descriptive statistic and stepwise regression analysis techniques. The powering performance prognosis program is developed to cover the prediction of resistance coefficients, propulsive coefficients, propeller open-water efficiency and various scale effect corrections.

  • PDF

A Study on Patterning and Grading by the Impact of Traffic Culture Index (교통문화지수 영향요인에 의한 유형화와 영향정도에 관한 연구)

  • Jeong Cheal-Woo;Jung Hun-Young;Ko Sang-Sean
    • Journal of Navigation and Port Research
    • /
    • v.30 no.1 s.107
    • /
    • pp.35-43
    • /
    • 2006
  • This study suggests strategies to prevent traffic accidents by utilizing impact factors per each cluster and the typical patterns of 81 cities based on the statistical analysis of the data concerning the TCI which was developed from the partnership of the Traffic Safety Authority and the Green Traffic Movement Corporation in 2002 and 2003. The Principal Component Analysis and Cluster Analysis on impact factors and TCI result in 4 components and 4 clusters. Also as the results of Stepwise Multiple Regression Analysis examining the relationship between impact factors and TCI, R2 values of these models show high to all clusters. According to the results, we suggest strategies to prevent traffic accidents per cluster concretely and it is necessary to analyze how effective the invested facilities are in reducing traffic accidents in the future.

Taste Characteristics of Kanjang Made with Barley Bran (보리등겨로 제조한 간장의 맛성분 특성)

  • Son, Dong-Hwa;Kwon, O-Jun;Choi, Ung-Kyu;Kwon, O-Jin;Lee, Suk-Il;Im, Moo-Hyeg;Kwon, Kwang-Il;Kim, Sung-Hong;Chung, Yung-Gun
    • Applied Biological Chemistry
    • /
    • v.45 no.1
    • /
    • pp.18-24
    • /
    • 2002
  • This study was conducted to find out optimum conditions for kanjang fermented with barley bran. The correlation between taste components and sensory evaluation score was analyzed with stepwise multiple regression analysis. It was revealed that the taste of kanjang was explained with the mix of free amino acids, free sugars and organic acids. The highest multiple correlation coefficient was obtained from absolute value transformed with logarithm. Thus, stepwise multiple regression analysis was conducted with absolute value transformed with logarithm, for which F-value was highest and standard error of estimation was lowest among the multiple regression models transformed with six variables. The stepwise multiple regression analysis showed that the taste components which most contribute to the quality of taste of kanjang fermented with barley bran was salty taste component followed by palatable taste component, and bitter taste component.

A Multivariate Analysis of Korean Professional Players Salary (한국 프로스포츠 선수들의 연봉에 대한 다변량적 분석)

  • Song, Jong-Woo
    • The Korean Journal of Applied Statistics
    • /
    • v.21 no.3
    • /
    • pp.441-453
    • /
    • 2008
  • We analyzed Korean professional basketball and baseball players salary under the assumption that it depends on the personal records and contribution to the team in the previous year. We extensively used data visualization tools to check the relationship among the variables, to find outliers and to do model diagnostics. We used multiple linear regression and regression tree to fit the model and used cross-validation to find an optimal model. We check the relationship between variables carefully and chose a set of variables for the stepwise regression instead of using all variables. We found that points per game, number of assists, number of free throw successes, career are important variables for the basketball players. For the baseball pitchers, career, number of strike-outs per 9 innings, ERA, number of homeruns are important variables. For the baseball hitters, career, number of hits, FA are important variables.