• Title/Summary/Keyword: Explanatory model

Search Result 932, Processing Time 0.027 seconds

A credit classification method based on generalized additive models using factor scores of mixtures of common factor analyzers (공통요인분석자혼합모형의 요인점수를 이용한 일반화가법모형 기반 신용평가)

  • Lim, Su-Yeol;Baek, Jang-Sun
    • Journal of the Korean Data and Information Science Society
    • /
    • v.23 no.2
    • /
    • pp.235-245
    • /
    • 2012
  • Logistic discrimination is an useful statistical technique for quantitative analysis of financial service industry. Especially it is not only easy to be implemented, but also has good classification rate. Generalized additive model is useful for credit scoring since it has the same advantages of logistic discrimination as well as accounting ability for the nonlinear effects of the explanatory variables. It may, however, need too many additive terms in the model when the number of explanatory variables is very large and there may exist dependencies among the variables. Mixtures of factor analyzers can be used for dimension reduction of high-dimensional feature. This study proposes to use the low-dimensional factor scores of mixtures of factor analyzers as the new features in the generalized additive model. Its application is demonstrated in the classification of some real credit scoring data. The comparison of correct classification rates of competing techniques shows the superiority of the generalized additive model using factor scores.

Predicting the resting metabolic rate of young and middle-aged healthy Korean adults: A preliminary study

  • Park, Hun-Young;Jung, Won-Sang;Hwang, Hyejung;Kim, Sung-Woo;Kim, Jisu;Lim, Kiwon
    • Korean Journal of Exercise Nutrition
    • /
    • v.24 no.1
    • /
    • pp.9-13
    • /
    • 2020
  • [Purpose] This preliminary study aimed to develop a regression model to estimate the resting metabolic rate (RMR) of young and middle-aged Koreans using various easy-to-measure dependent variables. [Methods] The RMR and the dependent variables for its estimation (e.g. age, height, body mass index, fat-free mass; FFM, fat mass, % body fat, systolic blood pressure, diastolic blood pressure, mean arterial pressure, pulse pressure, and resting heart rate) were measured in 53 young (male n = 18, female n = 16) and middle-aged (male n = 5, female n = 14) healthy adults. Statistical analysis was performed to develop an RMR estimation regression model using the stepwise regression method. [Results] We confirmed that FFM and age were important variables in both the regression models based on the regression coefficients. Mean explanatory power of RMR1 regression models estimated only by FFM was 66.7% (R2) and 66.0% (adjusted R2), while mean standard errors of estimates (SEE) was 219.85 kcal/day. Additionally, mean explanatory power of RMR2 regression models developed by FFM and age were 70.0% (R2) and 68.8% (adjusted R2), while the mean SEE was 210.64 kcal/day. There was no significant difference between the measured RMR by the canopy method using a metabolic gas analyzer and the predicted RMR by RMR1 and RMR2 equations. [Conclusion] This preliminary study developed a regression model to estimate the RMR of young and middle-age healthy Koreans. The regression model was as follows: RMR1 = 24.383 × FFM + 634.310, RMR2 = 23.691 × FFM - 5.745 × age + 852.341.

Development of Long-Term Hospitalization Prediction Model for Minor Automobile Accident Patients (자동차 사고 경상환자의 장기입원 예측 모델 개발)

  • DoegGyu Lee;DongHyun Nam;Sung-Phil Heo
    • Journal of Korea Society of Industrial Information Systems
    • /
    • v.28 no.6
    • /
    • pp.11-20
    • /
    • 2023
  • The cost of medical treatment for motor vehicle accidents is increasing every year. In this study, we created a model to predict long-term hospitalization(more than 18 days) among minor patients, which is the main item of increasing traffic accident medical expenses, using five algorithms such as decision tree, and analyzed the factors affecting long-term hospitalization. As a result, the accuracy of the prediction models ranged from 91.377 to 91.451, and there was no significant difference between each model, but the random forest and XGBoost models had the highest accuracy of 91.451. There were significant differences between models in the importance of explanatory variables, such as hospital location, name of disease, and type of hospital, between the long-stay and non-long-stay groups. Model validation was tested by comparing the average accuracy of each model cross-validated(10 times) on the training data with the accuracy of the validation data. To test of the explanatory variables, the chi-square test was used for categorical variables.

Dynamic Model of a Long-term Water Demand Using System Dynamics (시스템 다이나믹스를 이용한 도시 물수요 장기 예측의 동적 모델 연구)

  • Lee, Sangeun;Choi, Dongjin;Park, Heekyungh
    • Journal of Korean Society of Water and Wastewater
    • /
    • v.21 no.1
    • /
    • pp.75-82
    • /
    • 2007
  • When one forecasts urban water demand in a long-term, multivariate model can give more benefits than per capita requirement model. However, the former has shortcomings in that statistically high explanatory power cannot be obtained well, and change in customer behavior cannot be considered. If the past water consumption effects the future water demand, dynamic model may describe real water consumption data better than static model, i.e. the existing multivariate model. On these grounds, this study built dynamic model using system dynamics. From a case study in Seoul and Busan city, dynamic model was expected to forecast water demand more descriptively and reliably.

Characteristics and Forecasting Models of Urban Traffic Generation in Seoul Metropolitan Area (수도권(首都圈)에 있어서 도시교통발생특성(都市交通發生特性)과 그 예측모형(豫測模型))

  • Kim, Dae Oung;Kim, Eon Dong
    • KSCE Journal of Civil and Environmental Engineering Research
    • /
    • v.6 no.2
    • /
    • pp.45-55
    • /
    • 1986
  • This study proposes the explanatory indices of urban traffic for the purpose of solving the ambiguity of selection of the explanatory variables, which always raises problems in case of the travel-demand forecasting in the urban transportation planning, and develops optimal urban traffic generation models. The multiple regression models for objective traffic generation are developed by using the proposed explanatory inidces. Objective variables that can be explained by one explanatory variable are modified into simple regression type (Y=bX) in order to ensure the nonnegativity of traffic generation. Similarities are noted in the generaton characteristics of generated traffic from homogeneous land-use activity. Objective variables that can not be explained by multiple variable, such as trip attraction of school and trip generation of social-recreation, are classified by the characteristics of each zone. And traffic generation forecasting models are built as homogeneous zone group, the validity of each model being tested by a statistical method. It is desired that the forecasting precision is in improved by easy and simple method. Accordingly, trip generation rates are calculated from each land-use activity, and trip generation rates for practical application are proposed by considering their stability.

  • PDF

The Study of Decision-Making Model on Small and Medium Sized Management States of Financial Agencies and Monitoring Progressive Insolvency : Case of Mutual Savings Banks

  • Ryu, Ji-Cheol;Lee, Young-Jai
    • Journal of Information Technology Applications and Management
    • /
    • v.15 no.3
    • /
    • pp.43-59
    • /
    • 2008
  • This paper studies small and medium sized financial agency's management states that take advantage of the Korea Federation of Saving Bank's data. It also presents the management state and the decision-making model that monitors progressive insolvency by standardizing transfer path between relevant groups. With this in mind, we extracted explanatory variables for predictions of insolvency by using existing studies of document related insolvency. First of all, we designed a state model based on demarcated groups to take advantage of the self organizing map that groups in line with a neural network. Secondly, we developed a transition model by standardizing the transfer path between individual banks in a state model. Finally, we presented a decision-making model that integrated the state model and the transition model. This paper will provide groundwork for methods of insolvency prevention to businesses in order for them to have a smooth management system in the financial agencies.

  • PDF

Development of the Roundwood Demand Prediction Model

  • Kim, Dong-Jun
    • Journal of Korean Society of Forest Science
    • /
    • v.95 no.2
    • /
    • pp.203-208
    • /
    • 2006
  • This study compared the roundwood demand prediction accuracy of econometric and time-series models using Korean data. The roundwood was divided into softwood and hardwood by species. The econometric model of roundwood demand was specified with four explanatory variables; own price, substitute price, gross domestic product, dummy. The time-series model was specified with lagged endogenous variable. The dummy variable reflected the abrupt decrease in roundwood demand in the late 1990's in the case of softwood roundwood, and the boom of plywood export in the late 1970's in the case of hardwood roundwood. On the other hand, the prediction accuracy was estimated on the basis of Residual Mean Square Errors(RMSE). The results showed that the softwood roundwood demand prediction can be performed more accurately by econometric model than by time-series model. However, the hardwood roundwood demand prediction accuracy was similar in the case of using econometric and time-series model.

The Causal Relationships among Nurses' Perceived Autonomy, Job Satisfaction and Realated Variables (임상간호사의 자율성과 직무만족 관련요인의 인과관계 분석)

  • Lee, Sang-Mi
    • Journal of Korean Academy of Nursing Administration
    • /
    • v.6 no.1
    • /
    • pp.109-122
    • /
    • 2000
  • The present study examined the causal relationships among nurses' perceived autonomy, job satisfaction, work environment (work overload, role conflict, situational support, head nurses' leadership), personal aspects(experiences, need for achievement, professional knowledge and skill) by constructing and testing a theoretical framework. Based on literature review nurses' perceived autonomy and job satisfaction were conceived of as outcomes of the interplay among work environment and personal characteristics. Work environment factors involved work overload, role conflict, situational support, and head nurses' leadership (task oriented leadership, relation oriented leadership). Personal charateristics included experiences, need for achievement, and professional knowledge and skill. Three large general hospital in Chonbuk were selected to participate. The total sample of 516 registered nurses represents a response rate of 92 percent. Data for this study was collected from July to September in 1998 by Questionnaire. Path analyses with LISREL 7.16 program were used to test the fit of the proposed conceptual model to the data and to examine the causal relationship among variables. The result showed that both the proposed model and the modified model fit the data excellently. It needs to be notified, however, that path analisis can not count measurement errors; measurement error can attenuate estimates of coefficient and explanatory power. Nevertheless the model revealed relatively high explanatory power. 42 percent of nurses' perceived autonomy was explained by predicted variables; 32 percent of nurses' job satisfaction was explained by by predicted variables. Tn predicting nurses' perceived autonomy the findings of this study clearly demonstrated the work overload might be the most important variable of all the antecedent variables. Head nurses' relation oriented leadership, situational supports, need for achievement, and role conflict were also found to be important determinants for nurses' perceived autonomy. As for the job satisfaction, role conflict, situational supports, need for the achievement, and head nurses' relation oriented leadership were in turn important predictors. Unexpectedly the result showed perceived autonomy have few influence on job satisfaction. The results were discussed, including directions for the future research and practical implication drawn from the research were suggested.

  • PDF

Unbiasedness or Statistical Efficiency: Comparison between One-stage Tobit of MLE and Two-step Tobit of OLS

  • Park, Sun-Young
    • International Journal of Human Ecology
    • /
    • v.4 no.2
    • /
    • pp.77-87
    • /
    • 2003
  • This paper tried to construct statistical and econometric models on the basis of economic theory in order to discuss the issue of statistical efficiency and unbiasedness including the sample selection bias correcting problem. Comparative analytical tool were one stage Tobit of Maximum Likelihood estimation and Heckman's two-step Tobit of Ordinary Least Squares. The results showed that the adequacy of model for the analysis on demand and choice, we believe that there is no big difference in explanatory variables between the first selection model and the second linear probability model. Since the Lambda, the self- selectivity correction factor, in the Type II Tobit is not statistically significant, there is no self-selectivity in the Type II Tobit model, indicating that Type I Tobit model would give us better explanation in the demand for and choice which is less complicated statistical method rather than type II model.

Development of the Plywood Demand Prediction Model

  • Kim, Dong-Jun
    • Journal of Korean Society of Forest Science
    • /
    • v.97 no.2
    • /
    • pp.140-143
    • /
    • 2008
  • This study compared the plywood demand prediction accuracy of econometric and vector autoregressive models using Korean data. The econometric model of plywood demand was specified with three explanatory variables; own price, construction permit area, dummy. The vector autoregressive model was specified with lagged endogenous variable, own price, construction permit area and dummy. The dummy variable reflected the abrupt decrease in plywood consumption in the late 1990's. The prediction accuracy was estimated on the basis of Residual Mean Squared Error, Mean Absolute Percentage Error and Theil's Inequality Coefficient. The results showed that the plywood demand prediction can be performed more accurately by econometric model than by vector autoregressive model.