• 제목/요약/키워드: Stepwise Regression Method

검색결과 511건 처리시간 0.023초

군집분석 기법과 단계별 회귀모델을 결합한 예측 방법 (A Prediction Method Combining Clustering Method and Stepwise Regression)

  • 정일교;전치혁
    • 한국경영과학회:학술대회논문집
    • /
    • 대한산업공학회/한국경영과학회 2002년도 춘계공동학술대회
    • /
    • pp.949-952
    • /
    • 2002
  • A regression model is used in predicting the response variable given predictor variables However, in case of large number of predictor variables, a regression model has some problems such as multicollinearity, interpretation of the functional relationship between the response and predictors and prediction accuracy. A clustering method and stepwise regression could be used to reduce the amount of data by grouping predictors having similar properties and by selecting the subset of predictors. respectively. This paper proposes a prediction method combining clustering method and stepwise regression. The proposed method fits a global model and local models and predicts responses given new observations by using both models. The paper also compares the performance of proposed method with stepwise regression via a real data of ample obtained in a steel process.

  • PDF

Quantitative Analysis by Diffuse Reflectance Infrared Fourier Transform and Linear Stepwise Multiple Regression Analysis I -Simultaneous quantitation of ethenzamide, isopropylantipyrine, caffeine, and allylisopropylacetylurea in tablet by DRIFT and linear stepwise multiple regression analysis-

  • Park, Man-Ki;Yoon, Hye-Ran;Kim, Kyoung-Ho;Cho, Jung-Hwan
    • Archives of Pharmacal Research
    • /
    • 제11권2호
    • /
    • pp.99-113
    • /
    • 1988
  • Quantitation of ethenzamide, isopropylantipyrine and caffeine takes about 41 hrs by conventional GC method. Quantitation of allylisoprorylacetylurea takes about 40 hrs by conventional UV method. But quantitation of them takes about 6 hrs by DRIFT developing method. Each standard and sample sieved, powdered and acquired DRIFT spectrum. Out of them peak of each component was selected and ratio of each peak to standard peak was acquired, and then linear stepwise multiple regression was performed with these data and concentration. Reflectance value, Kubelka-Munk equation and Inverse-Kubelka-Munk equation were modified by us. Inverse-Kubelka-Munk equation completed the deficit of Kubelka-Munk equation. Correlation coefficients acquired by conventioanl GC and UV against DRIFT were more than 0.95.

  • PDF

Analysis of Client Propensity in Cyber Counseling Using Bayesian Variable Selection

  • Pi, Su-Young
    • International Journal of Fuzzy Logic and Intelligent Systems
    • /
    • 제6권4호
    • /
    • pp.277-281
    • /
    • 2006
  • Cyber counseling, one of the most compatible type of consultation for the information society, enables people to reveal their mental agonies and private problems anonymously, since it does not require face-to-face interview between a counsellor and a client. However, there are few cyber counseling centers which provide high quality and trustworthy service, although the number of cyber counseling center has highly increased. Therefore, this paper is intended to enable an appropriate consultation for each client by analyzing client propensity using Bayesian variable selection. Bayesian variable selection is superior to stepwise regression analysis method in finding out a regression model. Stepwise regression analysis method, which has been generally used to analyze individual propensity in linear regression model, is not efficient since it is hard to select a proper model for its own defects. In this paper, based on the case database of current cyber counseling centers in the web, we will analyze clients' propensities using Bayesian variable selection to enable individually target counseling and to activate cyber counseling programs.

지식에 관한 간호결과도구의 타당성 조사 (Validation of Nursing Care Sensitive Outcomes related to Knowledge)

  • 이은주
    • 대한간호학회지
    • /
    • 제33권5호
    • /
    • pp.625-632
    • /
    • 2003
  • Purpose: The purpose of this study was to assess the importance and sensitivity to nursing interventions of four nursing sensitive nursing outcomes selected from the Nursing Outcomes Classification (NOC). Outcomes for this study were 'Knowledge: Diet', 'Knowledge: Disease Process', 'Knowledge: Energy Conservation', and 'Knowledge: Health Behaviors'. Method: Data were collected from 183 nurses working in 2 university hospitals. Fehring method was used to estimate outcome and indicators' content and sensitivity validity. Multiple and stepwise regression were used to evaluate relationships between each outcome and its indicators. Result: Results confirmed the importance and nursing sensitivity of outcomes and their indicators. Key indicators of each outcomes were found by multiple regression. 'Knowledge: Diet' was suggested for adding new indicators because the variance explained by indicators was relatively low. Not all of the indicators selected for stepwise regression model were rated for highly in Fehring method. The R² statistics of the stepwise regression models were between 18 and 63% in importance by selected indicators and between 34 and 68% in contribution by selected indicators. Conclusion: This study refined what outcomes and indicators will be useful in clinical practice. Further research will be required for the revision of outcome and indicators of NOC. However, this study refined what outcomes and indicators will be useful in clinical practice.

A Climate Prediction Method Based on EMD and Ensemble Prediction Technique

  • Bi, Shuoben;Bi, Shengjie;Chen, Xuan;Ji, Han;Lu, Ying
    • Asia-Pacific Journal of Atmospheric Sciences
    • /
    • 제54권4호
    • /
    • pp.611-622
    • /
    • 2018
  • Observed climate data are processed under the assumption that their time series are stationary, as in multi-step temperature and precipitation prediction, which usually leads to low prediction accuracy. If a climate system model is based on a single prediction model, the prediction results contain significant uncertainty. In order to overcome this drawback, this study uses a method that integrates ensemble prediction and a stepwise regression model based on a mean-valued generation function. In addition, it utilizes empirical mode decomposition (EMD), which is a new method of handling time series. First, a non-stationary time series is decomposed into a series of intrinsic mode functions (IMFs), which are stationary and multi-scale. Then, a different prediction model is constructed for each component of the IMF using numerical ensemble prediction combined with stepwise regression analysis. Finally, the results are fit to a linear regression model, and a short-term climate prediction system is established using the Visual Studio development platform. The model is validated using temperature data from February 1957 to 2005 from 88 weather stations in Guangxi, China. The results show that compared to single-model prediction methods, the EMD and ensemble prediction model is more effective for forecasting climate change and abrupt climate shifts when using historical data for multi-step prediction.

Relationship between Aiming Patterns and Scores in Archery Shooting

  • Quan, ChengHao;Lee, Sangmin
    • 한국운동역학회지
    • /
    • 제26권4호
    • /
    • pp.353-360
    • /
    • 2016
  • Objective: The aim of this study was to investigate the relationship between aiming patterns and scores in archery shooting. Method: Four (N = 4) elementary-level archers from middle school participated in this study. Aiming pattern was defined by averaged acceleration data measured from accelerometers attached on the body during the aiming phase in archery shooting. Stepwise multiple regression analysis was used to test whether a model incorporating aiming patterns from all nine accelerometers could predict the scores. In order to extract period of interest (POI) data from raw data, a Dynamic Time Warping (DTW)-based extraction method was presented. Results: Regression models for all four subjects are conducted with different significance levels and variables. The significance levels of the regression models are 0.12%, 1.61%, 0.55%, and 0.4% respectively; the $R^2$ of the regression models is 64.04%, 27.93%, 72.02%, and 45.62% respectively; and the maximum significance levels of parameters in the regression models are 1.26%, 4.58%, 5.1%, and 4.98% respectively. Conclusion: Our results indicated that the relationship between aiming patterns and scores was described by a regression model. Analysis of the significance levels, variables, and parameters of the regression model showed that our approach - regression analysis with DTW - is an effective way to raise scores in archery shooting.

뇌혈관질환자에게 적용가능한 간호결과 분류체계의 타당성 검증 (Validation of the Nursing Outcomes Classification on Cerebrovascular Patients)

  • 김영화;소향숙;이은주;고은
    • 성인간호학회지
    • /
    • 제20권3호
    • /
    • pp.489-499
    • /
    • 2008
  • Purpose: The purpose of this study was to assess the importance and contribution of 9 nursing outcomes and their indicators that could be applied to cerebrovascular patients. Methods: Data were collected from 175 neurosurgical nurses working at two university affiliated hospitals and five secondary hospitals located in Gwang-ju. The Fehring method was used to estimate outcome content validity(OCV) and outcome sensitivity validity(OSV) of nursing outcomes and their indicators. Stepwise regression was used to evaluate relationship between outcome and its indicators. Results: The core outcomes identified by the OCV were Tissue Perfusion: Cerebral, Nutritional Status, Neurological Status, and Wound Healing: Primary Intention, whereas highly supportive outcomes identified by the OSV were Oral Health, Self-Care: ADL, and Nutritional Status. All the critical indicators selected for Fehring method were not included in stepwise regression model. By stepwise regression analysis, the indicators explained outcomes from 19% to 52% in importance and from 21% to 45% in contribution. Conclusion: This study identified core and supportive outcomes and their indicators which could be useful to assess the physical status of cerebrovascular patients. Further research is needed for the revision and development of nursing outcomes and their indicators at neurological nursing area.

  • PDF

Evaluation of Sigumjang Aroma by Stepwise Multiple Regression Analysis of Gas Chromatographic Profiles

  • Choi, Ung-Kyu;Kwon, O-Jun;Lee, Eun-Jeong;Son, Dong-Hwa;Cho, Young-Je;Im, Moo-Hyeog;Chung, Yung-Gun
    • Journal of Microbiology and Biotechnology
    • /
    • 제10권4호
    • /
    • pp.476-481
    • /
    • 2000
  • A linear correlation, by the stepwise multiple regression analysis, was found between the sensory test of Sigumjang aroma and the gas chromatographic data which were transformed with logarithm. GC data is the most objective method to evaluate Sigumjang aroma. A multiple correlation coefficient and a determination coefficient of more than 0.9 were obtained at the 9th and 13th steps, respectively. At step 31, the coefficient of determination level of 0.95 was attained. The accuracy of its estimation became higher as the number of the variables entered into the regression model increased. Over 90% of the Sigumjang aroma was explained by 13 compounds indentified on GC. The contributing proportion of the peak 26 was the highest followed by peaks 57 (9.27%), 29 (7.51%), 54 (6.01%), 8 (5.99%), 49 (4.97%), and 13 (4.11%).

  • PDF

중소하천유역의 임계지속시간 결정에 관한 연구 (Study on the Critical Storm Duration Decision of the Rivers Basin)

  • 안승섭;이효정;정도준
    • 한국환경과학회지
    • /
    • 제16권11호
    • /
    • pp.1301-1312
    • /
    • 2007
  • The objective of this study is to propose a critical storm duration forecasting model on storm runoff in small river basin. The critical storm duration data of 582 sub-basin which introduced disaster impact assessment report on the National Emergency Management Agency during the period from 2004 to 2007 were collected, analyzed and studied. The stepwise multiple regression method are used to establish critical storm duration forecasting models(Linear and exponential type). The results of multiple regression analysis discriminated the linear type more than exponential type. The results of multiple linear regression analysis between the critical storm duration and 5 basin characteristics parameters such as basin area, main stream length, average slope of main stream, shape factor and CN showed more than 0.75 of correlation in terms of the multi correlation coefficient.

회귀분석에 기초한 균등화 방법에 관한 연구 (A study on equating method based on regression analysis)

  • 조장식
    • Journal of the Korean Data and Information Science Society
    • /
    • 제21권3호
    • /
    • pp.513-521
    • /
    • 2010
  • 대부분의 대학들은 교수업적평가를 위해 강의평가제도를 실시하고 있다. 그러나 강의평가의 결과는 강좌규모, 강의형태, 개설학년, 이수구분, 평균평점 등과 같은 개설강좌의 특성에 많은 영향을 받게 된다. 따라서 이러한 각 강좌특성들이 강의평가 결과에 영향을 미치는 효과를 제거하지 않는다면, 담당교수가 강의평가 결과에 대한 공정성과 객관성을 신뢰할 수 없게 만들 정도로 심각한 편의를 갖게 된다. 따라서 강의평가의 공정성을 위해 강좌특성에 따른 편의를 제거하기 위한 사후조정된 점수가 요구된다. 따라서 본 연구에서는 단계적 변수선택법에 의한 회귀분석을 이용하여 강의평가 결과에 대한 균등화 방법을 이용하여 사후조정된 점수를 계산하는 방법을 제안한다. 그리고 제안된 방법은 기존의 방법과 비교를 하였다.