• Title/Summary/Keyword: 이항 로지스틱

Search Result 125, Processing Time 0.023 seconds

Logistic Regressions with Sensory Evaluation Data about Hanwoo Steer Beef (한우 거세우 고기 관능평가 데이터의 로지스틱 회귀분석)

  • Lee, Hye-Jung;Kim, Jae-Hee
    • The Korean Journal of Applied Statistics
    • /
    • v.23 no.5
    • /
    • pp.857-870
    • /
    • 2010
  • This study was conducted to investigate the relationship between the socio-demographic factors and the Korean consumers palatability evaluation grades with Hanwoo sensory evaluation data from 2006 to 2008 by National Institute of Animal Science. The dichotomy logistic regression model and the multinomial logistic regression model are fitted with the independent variables such as the consumer living location, age, gender occupation, monthly income, beef cut and the the palatability grade as the categorical dependent variable and tenderness, 리avor and juiciness as the continuous dependent variable. Stepwise variable selection procedure is incorporated to find the final model and odds ratios are calculated to nd the associations between categories.

Assessment of Freeway Crash Risk using Probe Vehicle Accelerometer (프로브차량 가속도센서를 이용한 고속도로 교통사고 위험도 평가기법)

  • Park, Jae-Hong;Oh, Cheol;Kang, Kyeong-Pyo
    • International Journal of Highway Engineering
    • /
    • v.13 no.2
    • /
    • pp.49-56
    • /
    • 2011
  • Understanding various casual factors affecting the occurrence of freeway traffic crash is a backbone of deriving effective countermeasures. The first step toward understanding such factors is to identify crash risks on freeways. Unlike existing studies, this study focused on the unsafe vehicle maneuvering that can be detected by in-vehicle sensors. The recent advancement of sensor technologies allows us to gather and analyze detailed microscopic events leading to crash occurrence such as the abrupt change in acceleration. This study used an accelerometer to capture the unsafe events. A set of candidate variables representing unsafe events were derived from analyzing acceleration data obtained by the accelerometer. Then, the crash risk was modeled by the binary logistic regression technique. The probabilistic outcome of crash risk can be provided by the proposed model. An application of the methodology assessing crash risk was presented, and further research items for the successful field implementation were also discussed.

Characteristics and Influencing Factors of Red Light Running (RLR) Crashes (신호위반사고의 특성과 영향요인 분석)

  • Park, Jeong Soon;Jung, Yong Il;Kim, Yun Hwan
    • Journal of Korean Society of Transportation
    • /
    • v.32 no.3
    • /
    • pp.198-206
    • /
    • 2014
  • According to the statistics of the National Police Agency, red light running (RLR) crashes represent a significant safety issue throughout Korea. This study deals with the RLR crashes occurred at signalized intersections in Cheongju. The objectives of this study are to comparatively analyze the characteristics of between RLR crashes and the Non-RLR crashes, and to find out factors using a Binary Logistic Regression(BLR) model. In pursuing the above, the study gives particular attentions to testing the differences between the above two groups with the data of 2,246 RLR/ 3,884 Non-RLR crashes (2007-2011). The main results are as follows. First, many RLR crashes were occurred in the nighttime and in going straight. Second, the difference between RLR and Non-RLR crashes were clearly defined by crash type, maneuver of vehicle before crash, age of driver (30s, 50s), alcohol use and accident pattern. Finally, a statistically significant model (Hosmer and Lemeshow test : 7.052, p-value : 0.531) was developed through the BLR model.

Analysis of Traffic Crash Severity on Freeway Using Hierarchical Binomial Logistic Model (계층 이항 로지스틱모형에 의한 고속도로 교통사고 심각도 분석)

  • Mun, Sung-Ra;Lee, Young-Ihn
    • International Journal of Highway Engineering
    • /
    • v.13 no.4
    • /
    • pp.199-209
    • /
    • 2011
  • In the study of traffic safety, the analysis on factors affecting crash severity and the understanding about their relationship is important to be planning and execute to improve safety of road and traffic facilities. The purpose of this study is to develop a hierarchical binomial logistic model to identify the significant factors affecting fatal injuries and vehicle damages of traffic crashes on freeway. Two models on death and total vehicle damage are developed. The hierarchical structure of response variable is composed of two level, crash-occupant and crash-vehicle. As a result, we have gotten the crash-level random effect from these hierarchical structure as well as the fixed effect of covariates, namely odds ratio. The crash on the main line and in-out section have greater damage than other facilities. Injuries and vehicle damages are severe in case of traffic violations, centerline invasion and speeding. Also, collision crash and fire occurrence is more severe damaged than other crash types. The surrounding environment of surface conditions by climate and visibility conditions by day and night is a significant factor on crash occurrence. On the orher hand, the geometric condition of road isn't.

Cost Performance Evaluation Framework through Analysis of Unstructured Construction Supervision Documents using Binomial Logistic Regression (비정형 공사감리문서 정보와 이항 로지스틱 회귀분석을 이용한 건축 현장 비용성과 평가 프레임워크 개발)

  • Kim, Chang-Won;Song, Taegeun;Lee, Kiseok;Yoo, Wi Sung
    • Journal of the Korea Institute of Building Construction
    • /
    • v.24 no.1
    • /
    • pp.121-131
    • /
    • 2024
  • This research explores the potential of leveraging unstructured data from construction supervision documents, which contain detailed inspection insights from independent third-party monitors of building construction processes. With the evolution of analytical methodologies, such unstructured data has been recognized as a valuable source of information, offering diverse insights. The study introduces a framework designed to assess cost performance by applying advanced analytical methods to the unstructured data found in final construction supervision reports. Specifically, key phrases were identified using text mining and social network analysis techniques, and these phrases were then analyzed through binomial logistic regression to assess cost performance. The study found that predictions of cost performance based on unstructured data from supervision documents achieved an accuracy rate of approximately 73%. The findings of this research are anticipated to serve as a foundational resource for analyzing various forms of unstructured data generated within the construction sector in future projects.

Model assessment with residual plot in logistic regression (로지스틱회귀에서 잔차산점도를 이용한 모형평가)

  • Kahng, Myung Wook
    • Journal of the Korean Data and Information Science Society
    • /
    • v.26 no.1
    • /
    • pp.141-150
    • /
    • 2015
  • Graphical paradigms for assessing the adequacy of models in logistic regression are discussed. The residual plot has been widely used as a graphical tool for evaluating the adequacy of the model. However, this approach works well only for linear models with constant variance, and the alternative approach, the marginal model plot, has its defects as well. We suggest a Chi-residual plot that overcomes the potential shortcomings of the marginal model plot.

Exploring interaction using 3-D residual plots in logistic regression model (3차원 잔차산점도를 이용한 로지스틱회귀모형에서 교호작용의 탐색)

  • Kahng, Myung-Wook
    • Journal of the Korean Data and Information Science Society
    • /
    • v.25 no.1
    • /
    • pp.177-185
    • /
    • 2014
  • Under bivariate normal distribution assumptions, the interaction and quadratic terms are needed in the logistic regression model with two predictors. However, depending on the correlation coefficient and the variances of two conditional distributions, the interaction and quadratic terms may not be necessary. Although the need for these terms can be determined by comparing the two scatter plots, it is not as useful for interaction terms. We explore the structure and usefulness of the 3-D residual plot as a tool for dealing with interaction in logistic regression models. If predictors have an interaction effect, a 3-D residual plot can show the effect. This is illustrated by simulated and real data.

Log-density Ratio with Two Predictors in a Logistic Regression Model (로지스틱 회귀모형에서 이변량 정규분포에 근거한 로그-밀도비)

  • Kahng, Myung Wook;Yoon, Jae Eun
    • The Korean Journal of Applied Statistics
    • /
    • v.26 no.1
    • /
    • pp.141-149
    • /
    • 2013
  • We present methods for studying the log-density ratio that enables the selection of the predictors and the form to be included in the logistic regression model. Under bivariate normal distributional assumptions, we investigate the form of the log-density ratio as a function of two predictors. If two covariance matrices are equal, then the crossproduct and quadratic terms are not needed. If the variables are uncorrelated, we do not need the crossproduct terms, but we still need the linear and quadratic terms. We also explore other conditions in which the crossproduct and quadratic terms are not needed in the logistic regression model.

Comparison of Bias Correction Methods for the Rare Event Logistic Regression (희귀 사건 로지스틱 회귀분석을 위한 편의 수정 방법 비교 연구)

  • Kim, Hyungwoo;Ko, Taeseok;Park, No-Wook;Lee, Woojoo
    • The Korean Journal of Applied Statistics
    • /
    • v.27 no.2
    • /
    • pp.277-290
    • /
    • 2014
  • We analyzed binary landslide data from the Boeun area with logistic regression. Since the number of landslide occurrences is only 9 out of 5000 observations, this can be regarded as a rare event data. The main issue of logistic regression with the rare event data is a serious bias problem in regression coefficient estimates. Two bias correction methods were proposed before and we quantitatively compared them via simulation. Firth (1993)'s approach outperformed and provided the most stable results for analyzing the rare-event binary data.

Penalized logistic regression models for determining the discharge of dyspnea patients (호흡곤란 환자 퇴원 결정을 위한 벌점 로지스틱 회귀모형)

  • Park, Cheolyong;Kye, Myo Jin
    • Journal of the Korean Data and Information Science Society
    • /
    • v.24 no.1
    • /
    • pp.125-133
    • /
    • 2013
  • In this paper, penalized binary logistic regression models are employed as statistical models for determining the discharge of 668 patients with a chief complaint of dyspnea based on 11 blood tests results. Specifically, the ridge model based on $L^2$ penalty and the Lasso model based on $L^1$ penalty are considered in this paper. In the comparison of prediction accuracy, our models are compared with the logistic regression models with all 11 explanatory variables and the selected variables by variable selection method. The results show that the prediction accuracy of the ridge logistic regression model is the best among 4 models based on 10-fold cross-validation.