• Title/Summary/Keyword: Binary Logistic Model

Search Result 160, Processing Time 0.027 seconds

A study on log-density with log-odds graph for variable selection in logistic regression (로지스틱회귀모형의 변수선택에서 로그-오즈 그래프를 통한 로그-밀도비 연구)

  • Kahng, Myung-Wook;Shin, Eun-Young
    • Journal of the Korean Data and Information Science Society
    • /
    • v.23 no.1
    • /
    • pp.99-111
    • /
    • 2012
  • The log-density ratio of the conditional densities of the predictors given the response variable provides useful information for variable selection in the logistic regression model. In this paper, we consider the predictors that are needed and how they should be included in the model. If the conditional distributions are skewed, the distributions can be considered as gamma distributions. Under this assumption, linear and log terms are generally included in the model. The log-odds graph is a very useful graphical tool in this study. A graphical study is presented which shows that if the conditional distributions of x|y for the two groups overlap significantly, we need both the linear and quadratic terms. On the contrary, if they are well separated, only the linear or log term is needed in the model.

Modeling the Natural Occurrence of Selected Dipterocarp Genera in Sarawak, Borneo

  • Teo, Stephen;Phua, Mui-How
    • Journal of Forest and Environmental Science
    • /
    • v.28 no.3
    • /
    • pp.170-178
    • /
    • 2012
  • Dipterocarps or Dipterocarpaceae is a commercially important timber producing and dominant keystone tree family in the rain forests of Borneo. Borneo's landscape is changing at an unprecedented rate in recent years which affects this important biodiversity. This paper attempts to model the natural occurrence (distribution including those areas with natural forests before being converted to other land uses as opposed to current distribution) of dipterocarp species in Sarawak which is important for forest biodiversity conservation and management. Local modeling method of Inverse Distance Weighting was compared with commonly used statistical method (Binary Logistic Regression) to build the best natural distribution models for three genera (12 species) of dipterocarps. Database of species occurrence data and pseudoabsence data were constructed and divided into two halves for model building and validation. For logistic regression modeling, climatic, topographical and edaphic parameters were used. Proxy variables were used to represent the parameters which were highly (p>0.75) correlated to avoid over-fitting. The results show that Inverse Distance Weighting produced the best and consistent prediction with an average accuracy of over 80%. This study demonstrates that local interpolation method can be used for the modeling of natural distribution of dipterocarp species. The Inverse Distance Weighted was proven a better method and the possible reasons are discussed.

A Comparative Study on Prediction Performance of the Bankruptcy Prediction Models for General Contractors in Korea Construction Industry

  • Seung-Kyu Yoo;Jae-Kyu Choi;Ju-Hyung Kim;Jae-Jun Kim
    • International conference on construction engineering and project management
    • /
    • 2011.02a
    • /
    • pp.432-438
    • /
    • 2011
  • The purpose of the present thesis is to develop bankruptcy prediction models capable of being applied to the Korean construction industry and to deduce an optimal model through comparative evaluation of final developed models. A study population was selected as general contractors in the Korean construction industry. In order to ease the sample securing and reliability of data, it was limited to general contractors receiving external audit from the government. The study samples are divided into a bankrupt company group and a non-bankrupt company group. The bankruptcy, insolvency, declaration of insolvency, workout and corporate reorganization were used as selection criteria of a bankrupt company. A company that is not included in the selection criteria of the bankrupt company group was selected as a non-bankrupt company. Accordingly, the study sample is composed of a total of 112 samples and is composed of 48 bankrupt companies and 64 non-bankrupt companies. A financial ratio was used as early predictors for development of an estimation model. A total of 90 financial ratios were used and were divided into growth, profitability, productivity and added value. The MDA (Multivariate Discriminant Analysis) model and BLRA (Binary Logistic Regression Analysis) model were used for development of bankruptcy prediction models. The MDA model is an analysis method often used in the past bankruptcy prediction literature, and the BLRA is an analysis method capable of avoiding equal variance assumption. The stepwise (MDA) and forward stepwise method (BLRA) were used for selection of predictor variables in case of model construction. Twenty two variables were finally used in MDA and BLRA models according to timing of bankruptcy. The ROC-Curve Analysis and Classification Analysis were used for analysis of prediction performance of estimation models. The correct classification rate of an individual bankruptcy prediction model is as follows: 1) one year ago before the event of bankruptcy (MDA: 83.04%, BLRA: 93.75%); 2) two years ago before the event of bankruptcy (MDA: 77.68%, BLRA: 78.57%); 3) 3 years ago before the event of bankruptcy (MDA: 84.82%, BLRA: 91.96%). The AUC (Area Under Curve) of an individual bankruptcy prediction model is as follows. : 1) one year ago before the event of bankruptcy (MDA: 0.933, BLRA: 0.978); 2) two years ago before the event of bankruptcy (MDA: 0.852, BLRA: 0.875); 3) 3 years ago before the event of bankruptcy (MDA: 0.938, BLRA: 0.975). As a result of the present research, accuracy of the BLRA model is higher than the MDA model and its prediction performance is improved.

  • PDF

A Study on the Change of Quality in a Residential Sector of Single Person Households in Seoul during the COVID-19: Analyze Variable Importance and Causality with Artificial Neural Networks and Logistic Regression Analysis (서울시 1인 가구의 코로나 19 전후 주거의 질 변화 연구: 인공신 경망과 로지스틱 회귀모형을 활용한 변수 중요도 및 인과관계 분석)

  • Jaebin, Lim;Kiseong, Jeong
    • Land and Housing Review
    • /
    • v.14 no.1
    • /
    • pp.67-82
    • /
    • 2023
  • Using the Artificial Neural Network model and Binary Logistic Regression model, this study investigates influence factors on the quality of life in terms of housing environment during the COVID-19 in Seoul. The results show that the lower the satisfaction level of housing policy, the lower the quality of life in the employment field and the lower the quality of residential field. On the other hand, permanent workers and self-employed respondents have experienced improvement in residential quality during the pandemic. A limitation of this study is associated with disentangling the causal relationship using the 'black box' characteristics of ANN method.

Exploring the Predictive Factors of Passing the Korean Physical Therapist Licensing Examination (한국 물리치료사 국가 면허시험 합격 여부의 예측요인 탐색)

  • Kim, So-Hyun;Cho, Sung-Hyoun
    • Journal of The Korean Society of Integrative Medicine
    • /
    • v.10 no.3
    • /
    • pp.107-117
    • /
    • 2022
  • Purpose : The purpose of this study was to establish a model of the predictive factors for success or failure of examinees undertaking the Korean physical therapist licensing examination (KPTLE). Additionally, we assessed the pass/fail cut-off point. Methods : We analyzed the results of 10,881 examinees who undertook the KPTLE, using data provided by the Korea Health Personnel Licensing Examination Institute. The target variable was the test result (pass or fail), and the input variables were: sex, age, test subject, and total score. Frequency analysis, chi-square test, descriptive statistics, independent t-test, correlation analysis, binary logistic regression, and receiver operating characteristic (ROC) curve analyses were performed on the data. Results : Sex and age were not significant predictors of attaining a pass (p>.05). The test subjects with the highest probability of passing were, in order, medical regulation (MR) (Odds ratio (OR)=2.91, p<.001), foundations of physical therapy (FPT) (OR=2.86, p<.001), diagnosis and evaluation for physical therapy (DEPT) (OR=2.74, p<.001), physical therapy intervention (PTI) (OR=2.66, p<.001), and practical examination (PE) (OR=1.24, p<.001). The cut-off points for each subject were: FPT, 32.50; DEPT, 29.50; PTI, 44.50; MR, 14.50; and PE, 50.50. The total score (TS) was 164.50. The sensitivity, specificity, and the classification accuracy of the prediction model was 99 %, 98 %, and 99 %, respectively, indicating high accuracy. Area under the curve (AUC) values for each subject were: FPT, .958; DEPT, .968; PTI, .984; MR, .885; PE, .962; and TS, .998, indicating a high degree of fit. Conclusion : In our study, the predictive factors for passing KPTLE were identified, and the optimal cut-off point was calculated for each subject. Logistic regression was adequate to explain the predictive model. These results will provide universities and examinees with useful information for predicting their success or failure in the KPTLE.

A Study on Decision Factors Affecting Utilization of Elderly Welfare Center: Focus on Gimpo City (노인복지관 이용 결정요인에 관한 연구: 김포시 노인을 중심으로)

  • Won, Il;Kim, Keunhong;Kim, SungHyun
    • 한국노년학
    • /
    • v.38 no.2
    • /
    • pp.351-364
    • /
    • 2018
  • The purpose of this study is to learn about the decision factors affecting utilization of elderly welfare center of the elderly living in Gimpo city. The reason of the study is that the elderly welfare center as a provider of general welfare services could not only thinking about the state policy but also need to consider about the inherent role and function of the elderly. Especially for these elders living in rural areas, although the number of elderly welfare centers of the whole country has greatly increased in last 10 years, the effect and function of the facility are almost the same and they are still lack of leisure activities. This issue become a serious problem nowadays. For the above reasons, this article conducts a social survey of 360 elderly people over the age of 65 who lives in the Gimpo city which is a rural-urban type city. The research method is to examine the relationship between the predisposing factors, enabling factors and need factors of Andersen's behavior model with binary logistic regression analysis and the decision tree analysis. The result of binary logistic regression shows the most of factors of Andersen's model is significant. The factors of age, gender, education level in predisposing factors; monthly income in enabling factors and the reserve for old life, the preparation of economic activity for old life in need factors are significant. Then the result of decision tree analysis shows the interaction between factors; when the education level in predisposing factors is higher, the possibility of using of elderly welfare center becomes bigger. Also as the level of healthy promoting preparation in the need factors gets lower, the possibility of using of elderly welfare center still becomes bigger. Although differences were found in the interpretation of the results of regression analysis and decision tree analysis, the results of this study can still provide support for the necessity of elderly welfare centers providing integrated welfare services.

A Dynamic Shortest Path Finding Model using Hierarchical Road Networks (도로 위계 구조를 고려한 동적 최적경로 탐색 기법개발)

  • Kim, Beom-Il;Lee, Seung-Jae
    • Journal of Korean Society of Transportation
    • /
    • v.23 no.6 s.84
    • /
    • pp.91-102
    • /
    • 2005
  • When it comes to the process of information storage, people are likely to organize individual information into the forms of groups rather than independent attributes, and put them together in their brains. Likewise, in case of finding the shortest path, this study suggests that a Hierarchical Road Network(HRN) model should be selected to browse the most desirable route, since the HRN model takes the process mentioned above into account. Moreover, most of drivers make a decision to select a route from origin to destination by road hierarchy. It says that the drivers feel difference between the link travel tine which was measured by driving and the theoretical link travel time. There is a different solution which has predicted the link travel time to solve this problem. By using this solution, the link travel time is predicted based on link conditions from time to time. The predicated link travel time is used to search the shortest path. Stochastic Process model uses the historical patterns of travel time conditions on links. The HRN model has compared favorably with the conventional shortest path finding model in tern of calculated speeds. Even more, the result of the shortest path using the HRN model has more similar to the survey results which was conducted to the taxi drivers. Taxi drivers have a strong knowledge of road conditions on the road networks and they are more likely to select a shortest path according to the real common sense.

Assessment of Freeway Crash Risk using Probe Vehicle Accelerometer (프로브차량 가속도센서를 이용한 고속도로 교통사고 위험도 평가기법)

  • Park, Jae-Hong;Oh, Cheol;Kang, Kyeong-Pyo
    • International Journal of Highway Engineering
    • /
    • v.13 no.2
    • /
    • pp.49-56
    • /
    • 2011
  • Understanding various casual factors affecting the occurrence of freeway traffic crash is a backbone of deriving effective countermeasures. The first step toward understanding such factors is to identify crash risks on freeways. Unlike existing studies, this study focused on the unsafe vehicle maneuvering that can be detected by in-vehicle sensors. The recent advancement of sensor technologies allows us to gather and analyze detailed microscopic events leading to crash occurrence such as the abrupt change in acceleration. This study used an accelerometer to capture the unsafe events. A set of candidate variables representing unsafe events were derived from analyzing acceleration data obtained by the accelerometer. Then, the crash risk was modeled by the binary logistic regression technique. The probabilistic outcome of crash risk can be provided by the proposed model. An application of the methodology assessing crash risk was presented, and further research items for the successful field implementation were also discussed.

Methodology for Determining Delineator Placement and Operation Based on User's Satisfaction (이용자 만족도를 고려한 델리네이터 설치 및 운용 방법론에 관한 연구)

  • Park, Jae-Hong;Oh, Cheol;Kim, Young-Gul
    • International Journal of Highway Engineering
    • /
    • v.12 no.1
    • /
    • pp.39-46
    • /
    • 2010
  • Delineator is a useful device to support driver's safer maneuver. Effective placement and operation of the delineator would lead to prevent traffic accidents on the roads. This study evaluates the effectiveness of parameters associated with delineator placement and operation, which include spacing, height and size, from the point of user's satisfaction. Also, this study devises a methodology for determining such parameters using binary logistic regression technique. The proposed model is capable of producing probabilistic measure of user's satisfaction according to the various parameters. The outcome of this study would be useful fundamentals for more effective placement and operation of delineators.

Expression of p53 Breast Cancer in Kurdish Women in the West of Iran: a Reverse Correlation with Lymph Node Metastasis

  • Payandeh, Mehrdad;Sadeghi, Masoud;Sadeghi, Edris;Madani, Seyed-Hamid
    • Asian Pacific Journal of Cancer Prevention
    • /
    • v.17 no.3
    • /
    • pp.1261-1264
    • /
    • 2016
  • Background: In breast cancer (BC), it has been suggested that nuclear overexpression of p53 protein might be an indicator of poor prognosis. The aim of the current study was to evaluate the expression of p53 BC in Kurdish women from the West of Iran and its correlation with other clinicopathology figures. Materials and Methods: In the present retrospective study, 231 patients were investigated for estrogen receptor (ER) and progesterone receptor (PR) positivity, defined as ${\geq}10%$ positive tumor cells with nuclear staining. A binary logistic regression model was selected using Akaike Information Criteria (AIC) in stepwise selection for determination of important factors. Results: ER, PR, the human epidermal growth factor receptor 2 (HER2) and p53 were positive in 58.4%, 55.4%, 59.7% and 45% of cases, respectively. Ki67 index was divided into two groups: 54.5% had Ki67<20% and 45.5% had Ki67 ${\geq}20%$. Of 214 patients, 137(64%) had lymph node metastasis and of 186 patients, 122(65.6%) had vascular invasion. Binary logistic regression analysis showed that there was inverse significant correlation between lymph node metastasis (P=0.008, OR 0.120 and 95%CI 0.025-0.574), ER status (P=0.006, OR 0.080, 95%CI 0.014-0.477) and a direct correlation between HER2 (P=005, OR 3.047, 95%CI 1.407-6.599) with the expression of p53. Conclusions: As in a number of studies, expression of p53 had a inverse correlation with lymph node metastasis and ER status and also a direct correlation with HER2 status. Also, p53-positivity is more likely in triple negative BC compared to other subtypes.