• 제목/요약/키워드: stepwise multiple linear regression analysis

검색결과 89건 처리시간 0.026초

Quantitative Analysis by Diffuse Reflectance Infrared Fourier Transform and Linear Stepwise Multiple Regression Analysis I -Simultaneous quantitation of ethenzamide, isopropylantipyrine, caffeine, and allylisopropylacetylurea in tablet by DRIFT and linear stepwise multiple regression analysis-

  • Park, Man-Ki;Yoon, Hye-Ran;Kim, Kyoung-Ho;Cho, Jung-Hwan
    • Archives of Pharmacal Research
    • /
    • 제11권2호
    • /
    • pp.99-113
    • /
    • 1988
  • Quantitation of ethenzamide, isopropylantipyrine and caffeine takes about 41 hrs by conventional GC method. Quantitation of allylisoprorylacetylurea takes about 40 hrs by conventional UV method. But quantitation of them takes about 6 hrs by DRIFT developing method. Each standard and sample sieved, powdered and acquired DRIFT spectrum. Out of them peak of each component was selected and ratio of each peak to standard peak was acquired, and then linear stepwise multiple regression was performed with these data and concentration. Reflectance value, Kubelka-Munk equation and Inverse-Kubelka-Munk equation were modified by us. Inverse-Kubelka-Munk equation completed the deficit of Kubelka-Munk equation. Correlation coefficients acquired by conventioanl GC and UV against DRIFT were more than 0.95.

  • PDF

Quantitative Analysis by Derivative Spectrophotometry (III) -Simultaneous quantitation of vitamin B group and vitamin C in by multiple linear regression analysis-

  • Park, Man-Ki;Cho, Jung-Hwan
    • Archives of Pharmacal Research
    • /
    • 제11권1호
    • /
    • pp.45-51
    • /
    • 1988
  • The feature of resolution enhancement by derivative operation is linked to one of the multivariate analysis, which is multiple linear regression with two options, all possible and stepwise regression. Examined samples were synthetic mixtures of 5 vitamins, thiamine mononitrate, riboflavin phosphate, nicotinamide, pyridoxine hydrochloride and ascorbic acid. All components in mixture were quantified with reasonably good accuracy and precision. Whole data processing procedure was accomplished on-line by the development of three computer programs written in APPLESOFT BASIC language.

  • PDF

중소하천유역의 임계지속시간 결정에 관한 연구 (Study on the Critical Storm Duration Decision of the Rivers Basin)

  • 안승섭;이효정;정도준
    • 한국환경과학회지
    • /
    • 제16권11호
    • /
    • pp.1301-1312
    • /
    • 2007
  • The objective of this study is to propose a critical storm duration forecasting model on storm runoff in small river basin. The critical storm duration data of 582 sub-basin which introduced disaster impact assessment report on the National Emergency Management Agency during the period from 2004 to 2007 were collected, analyzed and studied. The stepwise multiple regression method are used to establish critical storm duration forecasting models(Linear and exponential type). The results of multiple regression analysis discriminated the linear type more than exponential type. The results of multiple linear regression analysis between the critical storm duration and 5 basin characteristics parameters such as basin area, main stream length, average slope of main stream, shape factor and CN showed more than 0.75 of correlation in terms of the multi correlation coefficient.

Evaluation of Sigumjang Aroma by Stepwise Multiple Regression Analysis of Gas Chromatographic Profiles

  • Choi, Ung-Kyu;Kwon, O-Jun;Lee, Eun-Jeong;Son, Dong-Hwa;Cho, Young-Je;Im, Moo-Hyeog;Chung, Yung-Gun
    • Journal of Microbiology and Biotechnology
    • /
    • 제10권4호
    • /
    • pp.476-481
    • /
    • 2000
  • A linear correlation, by the stepwise multiple regression analysis, was found between the sensory test of Sigumjang aroma and the gas chromatographic data which were transformed with logarithm. GC data is the most objective method to evaluate Sigumjang aroma. A multiple correlation coefficient and a determination coefficient of more than 0.9 were obtained at the 9th and 13th steps, respectively. At step 31, the coefficient of determination level of 0.95 was attained. The accuracy of its estimation became higher as the number of the variables entered into the regression model increased. Over 90% of the Sigumjang aroma was explained by 13 compounds indentified on GC. The contributing proportion of the peak 26 was the highest followed by peaks 57 (9.27%), 29 (7.51%), 54 (6.01%), 8 (5.99%), 49 (4.97%), and 13 (4.11%).

  • PDF

한국 프로스포츠 선수들의 연봉에 대한 다변량적 분석 (A Multivariate Analysis of Korean Professional Players Salary)

  • 송종우
    • 응용통계연구
    • /
    • 제21권3호
    • /
    • pp.441-453
    • /
    • 2008
  • 프로스포츠 선수들의 연봉은 선수들의 개인 성적과 팀에 대한 기여도 등으로 결정된다는 가정하에 프로농구와 프로야구 선수들의 전년도 성적으로 다음해 연봉을 예측 분석하였다. 분석에 있어서 data visualization 기법을 통해 변수사이의 관계, 이상점 발견, 모형진단등을 하였다. 다중선형회귀 모형(Multiple Linear Regression)과 트리모형(Regression Tree)을 이용해서 자료를 분석하고 모델간 비교를 했으며, Cross-Validation을 이용해서 최적모델을 선택하였다. 특히, 자동으로 변수선택을 하는 stepwise regression방법을 그냥 사용하기보다는 먼저 설명변수들 사이의 관계나 설명변수와 반응변수 사이의 관계등을 조사하고 나서 이를 통해 선택된 변수들을 가지고 stepwise regression과 regression tree 방법론을 이용해서 적절한 변수 및 최종 모형을 선택하였다. 분석결과, 프로농구의 경우에는 경기당 득점, 어시스트, 자유투 성공수, 경력 등이 중요한 변수였고, 프로야구 투수의 경우에는 경력, 9이닝 당 삼진 수, 방어율, 피홈런 수 등이 중요한 변수였고, 프로야구 타자의 경우에는 경력, 안타 수, FA(자유계약)유무 여부 등이 중요한 변수였다.

다중선형회귀모형에서의 변수선택기법 평가 (Evaluating Variable Selection Techniques for Multivariate Linear Regression)

  • 류나현;김형석;강필성
    • 대한산업공학회지
    • /
    • 제42권5호
    • /
    • pp.314-326
    • /
    • 2016
  • The purpose of variable selection techniques is to select a subset of relevant variables for a particular learning algorithm in order to improve the accuracy of prediction model and improve the efficiency of the model. We conduct an empirical analysis to evaluate and compare seven well-known variable selection techniques for multiple linear regression model, which is one of the most commonly used regression model in practice. The variable selection techniques we apply are forward selection, backward elimination, stepwise selection, genetic algorithm (GA), ridge regression, lasso (Least Absolute Shrinkage and Selection Operator) and elastic net. Based on the experiment with 49 regression data sets, it is found that GA resulted in the lowest error rates while lasso most significantly reduces the number of variables. In terms of computational efficiency, forward/backward elimination and lasso requires less time than the other techniques.

Interpretation of Relationship Between Sesame Yield and It's components under Early Sowing Cropping Condition

  • Shim Kang-Bo;Kang Churl-Whan;Seong Jae-Duck;Hwang Chung-Dong;Suh Duck-Yong
    • 한국작물학회지
    • /
    • 제51권4호
    • /
    • pp.269-273
    • /
    • 2006
  • Multiple linear regression analysis was conducted to interpretate the relationship between sesame grain yield and its components under early sowing cropping condition. The t test showed that stem length, number of capsules per plant, 1000 seeds weight and seed weight per plant gave significant contribution to sesame grain yield, therefore those variables were assumed to mostly influenced components to grain yield of sesame. In the stepwise regression analysis, the predicted equation for sesame grain yield per square meter (Y) was Y = -7.900 + 0.150X1 + 0.461X5 + 15.553X6 + 8.543X7. Meanwhile, F value showed that stem length, number of capsules per plant and seed weight per plant gave significant contribution to sesame grain yield, while 1000 seeds weight did not significantly show. Based on the results, it is reasonable to assume that high yield. potential of sesame under early sowing cropping condition would be obtained by selecting breeding lines with long stem length, number of capsules per plant, and seed weight per plant, which was different result at the late sowing cropping condition in which days to flowering and maturity were assumed to be more affected factors to the sesame grain yield.

가족생활주기에 따른 맞벌이 남녀의 대처전략과 결혼만족도 연구 (A Study on the Coping Strategies and Marital Satisfaction of Dual-Earner Men and Women Across the Family Life Cycle)

  • 이은희
    • 한국사회복지학
    • /
    • 제45권
    • /
    • pp.288-314
    • /
    • 2001
  • The purpose of this study is to examine the strategies that may influence the marital satisfaction of dual-earner men and women. General linear model, Pearson's correlation analysis, Stepwise multiple regression were employed for data analysis. the subjects are 396 dual-earner men and women. The result from the research were as follows: 1) coping strategy use differs significantly by life cycle stage. 2) The following strategies significantly correlated with the level of marital satisfaction: cognitive restructuring, delegation. using social support, modifying standards, personal time reducing. 3) The result of stepwise multiple regression analysis indicated that strategies which predict the level of marital satisfaction were cognitive restructuring, delegating, using social support, personal time reducing. these finding give us significant practical implications for social work intervention.

  • PDF

대학 급식소의 식수예측 모델 개발 (Development of a Forecasting Model for University Food Services)

  • 정라나;양일선;백승희
    • 대한지역사회영양학회지
    • /
    • 제8권6호
    • /
    • pp.910-918
    • /
    • 2003
  • The purposes of this study were to develop a model for university foodservices and to provide management strategies for reducing costs, and increasing productivity and customer satisfaction. The results of this study were as follows : 1) The demands in university food services varied depending on the time series. A fixed pattern was discovered for specific times of the month and semesters. The demand tended to constantly decrease from the beginning of a specific semester to the end, from March to June and from September to December. Moreover, the demand was higher during the first semester than the second semester, within school term than during vacation periods, and during the summer vacation than the winter. 2) Pearson's simple correlation was done between actual customer demand and the factors relating to forecasting the demand. There was a high level of correlation between the actual demand and the demand that had occurred in the previous weeks. 3) By applying the stepwise multiple linear regression analysis to two different university food services providing multiple menu items, a model was developed in terms of four different time series(first semester, second semester, summer vacation, and winter vacation). Customer preference for specific menu items was found to be the most important factor to be considered in forecasting the demand.

실시간 수위 예측을 위한 다중선형회귀 모형의 비교 (Comparison of Different Multiple Linear Regression Models for Real-time Flood Stage Forecasting)

  • 최승용;한건연;김병현
    • 대한토목학회논문집
    • /
    • 제32권1B호
    • /
    • pp.9-20
    • /
    • 2012
  • 최근 수위 예측을 위한 개념적 기반, 수문학적, 물리적 기반 모형 등의 단점을 극복하고자 홍수예측을 위해 자료지향형 모형 중의 하나인 다중선형회귀 모형이 널리 도입되고 있다. 본 연구의 목적은 이러한 다중선형회귀 모형의 서로 다른 회귀계수 선정 방법에 따른 홍수예측 성능을 비교 검토하고 이를 통해 적절한 다중회귀 홍수예측 모형을 구축하는 것이다. 이를 위해 입력자료의 자기상관분석을 통해 독립변수의 시간 규모를 결정한 후 최소 자승법, 가중 최소 자승법, 단계별 선택법의 각기 다른 회귀계수 산정 방법을 이용한 홍수예측 모형을 구축하고 중랑천 유역의 다양한 홍수사상에 대해 적용하였다. 구축된 모형들의 성능을 평가하기 위해 평균제곱근오차, Nash-Suttcliffe 효율계수, 평균절대오차, 수정 결정계수와 같이 4개의 통계지표들을 사용하였다. 모의결과 단계별 선택법을 이용한 다중선형회귀 홍수예측 모형이 가장 정확한 예측 결과를 보였고, 최소자승법을 이용한 홍수예측 모형이 가중 최소자승법을 이용한 홍수예측 모형보다 좀 더 나은 예측 결과를 나타냈다.