• Title/Summary/Keyword: stepwise multiple linear regression analysis

Search Result 89, Processing Time 0.027 seconds

Quantitative Analysis by Diffuse Reflectance Infrared Fourier Transform and Linear Stepwise Multiple Regression Analysis I -Simultaneous quantitation of ethenzamide, isopropylantipyrine, caffeine, and allylisopropylacetylurea in tablet by DRIFT and linear stepwise multiple regression analysis-

  • Park, Man-Ki;Yoon, Hye-Ran;Kim, Kyoung-Ho;Cho, Jung-Hwan
    • Archives of Pharmacal Research
    • /
    • v.11 no.2
    • /
    • pp.99-113
    • /
    • 1988
  • Quantitation of ethenzamide, isopropylantipyrine and caffeine takes about 41 hrs by conventional GC method. Quantitation of allylisoprorylacetylurea takes about 40 hrs by conventional UV method. But quantitation of them takes about 6 hrs by DRIFT developing method. Each standard and sample sieved, powdered and acquired DRIFT spectrum. Out of them peak of each component was selected and ratio of each peak to standard peak was acquired, and then linear stepwise multiple regression was performed with these data and concentration. Reflectance value, Kubelka-Munk equation and Inverse-Kubelka-Munk equation were modified by us. Inverse-Kubelka-Munk equation completed the deficit of Kubelka-Munk equation. Correlation coefficients acquired by conventioanl GC and UV against DRIFT were more than 0.95.

  • PDF

Quantitative Analysis by Derivative Spectrophotometry (III) -Simultaneous quantitation of vitamin B group and vitamin C in by multiple linear regression analysis-

  • Park, Man-Ki;Cho, Jung-Hwan
    • Archives of Pharmacal Research
    • /
    • v.11 no.1
    • /
    • pp.45-51
    • /
    • 1988
  • The feature of resolution enhancement by derivative operation is linked to one of the multivariate analysis, which is multiple linear regression with two options, all possible and stepwise regression. Examined samples were synthetic mixtures of 5 vitamins, thiamine mononitrate, riboflavin phosphate, nicotinamide, pyridoxine hydrochloride and ascorbic acid. All components in mixture were quantified with reasonably good accuracy and precision. Whole data processing procedure was accomplished on-line by the development of three computer programs written in APPLESOFT BASIC language.

  • PDF

Study on the Critical Storm Duration Decision of the Rivers Basin (중소하천유역의 임계지속시간 결정에 관한 연구)

  • Ahn, Seung-Seop;Lee, Hyeo-Jung;Jung, Do-June
    • Journal of Environmental Science International
    • /
    • v.16 no.11
    • /
    • pp.1301-1312
    • /
    • 2007
  • The objective of this study is to propose a critical storm duration forecasting model on storm runoff in small river basin. The critical storm duration data of 582 sub-basin which introduced disaster impact assessment report on the National Emergency Management Agency during the period from 2004 to 2007 were collected, analyzed and studied. The stepwise multiple regression method are used to establish critical storm duration forecasting models(Linear and exponential type). The results of multiple regression analysis discriminated the linear type more than exponential type. The results of multiple linear regression analysis between the critical storm duration and 5 basin characteristics parameters such as basin area, main stream length, average slope of main stream, shape factor and CN showed more than 0.75 of correlation in terms of the multi correlation coefficient.

Evaluation of Sigumjang Aroma by Stepwise Multiple Regression Analysis of Gas Chromatographic Profiles

  • Choi, Ung-Kyu;Kwon, O-Jun;Lee, Eun-Jeong;Son, Dong-Hwa;Cho, Young-Je;Im, Moo-Hyeog;Chung, Yung-Gun
    • Journal of Microbiology and Biotechnology
    • /
    • v.10 no.4
    • /
    • pp.476-481
    • /
    • 2000
  • A linear correlation, by the stepwise multiple regression analysis, was found between the sensory test of Sigumjang aroma and the gas chromatographic data which were transformed with logarithm. GC data is the most objective method to evaluate Sigumjang aroma. A multiple correlation coefficient and a determination coefficient of more than 0.9 were obtained at the 9th and 13th steps, respectively. At step 31, the coefficient of determination level of 0.95 was attained. The accuracy of its estimation became higher as the number of the variables entered into the regression model increased. Over 90% of the Sigumjang aroma was explained by 13 compounds indentified on GC. The contributing proportion of the peak 26 was the highest followed by peaks 57 (9.27%), 29 (7.51%), 54 (6.01%), 8 (5.99%), 49 (4.97%), and 13 (4.11%).

  • PDF

A Multivariate Analysis of Korean Professional Players Salary (한국 프로스포츠 선수들의 연봉에 대한 다변량적 분석)

  • Song, Jong-Woo
    • The Korean Journal of Applied Statistics
    • /
    • v.21 no.3
    • /
    • pp.441-453
    • /
    • 2008
  • We analyzed Korean professional basketball and baseball players salary under the assumption that it depends on the personal records and contribution to the team in the previous year. We extensively used data visualization tools to check the relationship among the variables, to find outliers and to do model diagnostics. We used multiple linear regression and regression tree to fit the model and used cross-validation to find an optimal model. We check the relationship between variables carefully and chose a set of variables for the stepwise regression instead of using all variables. We found that points per game, number of assists, number of free throw successes, career are important variables for the basketball players. For the baseball pitchers, career, number of strike-outs per 9 innings, ERA, number of homeruns are important variables. For the baseball hitters, career, number of hits, FA are important variables.

Evaluating Variable Selection Techniques for Multivariate Linear Regression (다중선형회귀모형에서의 변수선택기법 평가)

  • Ryu, Nahyeon;Kim, Hyungseok;Kang, Pilsung
    • Journal of Korean Institute of Industrial Engineers
    • /
    • v.42 no.5
    • /
    • pp.314-326
    • /
    • 2016
  • The purpose of variable selection techniques is to select a subset of relevant variables for a particular learning algorithm in order to improve the accuracy of prediction model and improve the efficiency of the model. We conduct an empirical analysis to evaluate and compare seven well-known variable selection techniques for multiple linear regression model, which is one of the most commonly used regression model in practice. The variable selection techniques we apply are forward selection, backward elimination, stepwise selection, genetic algorithm (GA), ridge regression, lasso (Least Absolute Shrinkage and Selection Operator) and elastic net. Based on the experiment with 49 regression data sets, it is found that GA resulted in the lowest error rates while lasso most significantly reduces the number of variables. In terms of computational efficiency, forward/backward elimination and lasso requires less time than the other techniques.

Interpretation of Relationship Between Sesame Yield and It's components under Early Sowing Cropping Condition

  • Shim Kang-Bo;Kang Churl-Whan;Seong Jae-Duck;Hwang Chung-Dong;Suh Duck-Yong
    • KOREAN JOURNAL OF CROP SCIENCE
    • /
    • v.51 no.4
    • /
    • pp.269-273
    • /
    • 2006
  • Multiple linear regression analysis was conducted to interpretate the relationship between sesame grain yield and its components under early sowing cropping condition. The t test showed that stem length, number of capsules per plant, 1000 seeds weight and seed weight per plant gave significant contribution to sesame grain yield, therefore those variables were assumed to mostly influenced components to grain yield of sesame. In the stepwise regression analysis, the predicted equation for sesame grain yield per square meter (Y) was Y = -7.900 + 0.150X1 + 0.461X5 + 15.553X6 + 8.543X7. Meanwhile, F value showed that stem length, number of capsules per plant and seed weight per plant gave significant contribution to sesame grain yield, while 1000 seeds weight did not significantly show. Based on the results, it is reasonable to assume that high yield. potential of sesame under early sowing cropping condition would be obtained by selecting breeding lines with long stem length, number of capsules per plant, and seed weight per plant, which was different result at the late sowing cropping condition in which days to flowering and maturity were assumed to be more affected factors to the sesame grain yield.

A Study on the Coping Strategies and Marital Satisfaction of Dual-Earner Men and Women Across the Family Life Cycle (가족생활주기에 따른 맞벌이 남녀의 대처전략과 결혼만족도 연구)

  • Lee, Eun-Hee
    • Korean Journal of Social Welfare
    • /
    • v.45
    • /
    • pp.288-314
    • /
    • 2001
  • The purpose of this study is to examine the strategies that may influence the marital satisfaction of dual-earner men and women. General linear model, Pearson's correlation analysis, Stepwise multiple regression were employed for data analysis. the subjects are 396 dual-earner men and women. The result from the research were as follows: 1) coping strategy use differs significantly by life cycle stage. 2) The following strategies significantly correlated with the level of marital satisfaction: cognitive restructuring, delegation. using social support, modifying standards, personal time reducing. 3) The result of stepwise multiple regression analysis indicated that strategies which predict the level of marital satisfaction were cognitive restructuring, delegating, using social support, personal time reducing. these finding give us significant practical implications for social work intervention.

  • PDF

Development of a Forecasting Model for University Food Services (대학 급식소의 식수예측 모델 개발)

  • 정라나;양일선;백승희
    • Korean Journal of Community Nutrition
    • /
    • v.8 no.6
    • /
    • pp.910-918
    • /
    • 2003
  • The purposes of this study were to develop a model for university foodservices and to provide management strategies for reducing costs, and increasing productivity and customer satisfaction. The results of this study were as follows : 1) The demands in university food services varied depending on the time series. A fixed pattern was discovered for specific times of the month and semesters. The demand tended to constantly decrease from the beginning of a specific semester to the end, from March to June and from September to December. Moreover, the demand was higher during the first semester than the second semester, within school term than during vacation periods, and during the summer vacation than the winter. 2) Pearson's simple correlation was done between actual customer demand and the factors relating to forecasting the demand. There was a high level of correlation between the actual demand and the demand that had occurred in the previous weeks. 3) By applying the stepwise multiple linear regression analysis to two different university food services providing multiple menu items, a model was developed in terms of four different time series(first semester, second semester, summer vacation, and winter vacation). Customer preference for specific menu items was found to be the most important factor to be considered in forecasting the demand.

Comparison of Different Multiple Linear Regression Models for Real-time Flood Stage Forecasting (실시간 수위 예측을 위한 다중선형회귀 모형의 비교)

  • Choi, Seung Yong;Han, Kun Yeun;Kim, Byung Hyun
    • KSCE Journal of Civil and Environmental Engineering Research
    • /
    • v.32 no.1B
    • /
    • pp.9-20
    • /
    • 2012
  • Recently to overcome limitations of conceptual, hydrological and physics based models for flood stage forecasting, multiple linear regression model as one of data-driven models have been widely adopted for forecasting flood streamflow(stage). The objectives of this study are to compare performance of different multiple linear regression models according to regression coefficient estimation methods and determine most effective multiple linear regression flood stage forecasting models. To do this, the time scale was determined through the autocorrelation analysis of input data and different flood stage forecasting models developed using regression coefficient estimation methods such as LS(least square), WLS(weighted least square), SPW(stepwise) was applied to flood events in Jungrang stream. To evaluate performance of established models, fours statistical indices were used, namely; Root mean square error(RMSE), Nash Sutcliffe efficiency coefficient (NSEC), mean absolute error (MAE), adjusted coefficient of determination($R^{*2}$). The results show that the flood stage forecasting model using SPW(stepwise) parameter estimation can carry out the river flood stage prediction better in comparison with others, and the flood stage forecasting model using LS(least square) parameter estimation is also found to be slightly better than the flood stage forecasting model using WLS(weighted least square) parameter estimation.