• 제목/요약/키워드: logistic regression model

검색결과 1,517건 처리시간 0.032초

로지스틱모형에서 그래픽을 이용한 회귀와 모형평가 (Graphical regression and model assessment in logistic model)

  • 강명욱;김부용;홍주희
    • Journal of the Korean Data and Information Science Society
    • /
    • 제21권1호
    • /
    • pp.21-32
    • /
    • 2010
  • 그래픽적 회귀는 모형에 대한 가정을 하지 않고 회귀정보를 모두 포함하는 충분요약그림을 찾아내는 분석 방법으로 모든 회귀정보를 저차원의 그림으로 표현할 수 있게 하는 데에 그 목적이 있다. 잔차산점도를 이용한 모형의 평가는 적용 범위가 선형회귀모형에 국한되는 문제점이 있기 때문에 일반화선형모형에서는 그 대안으로 주변모형 산점도를 이용하여 모형의 적절성을 평가한다. 본 논문에서는 일반화선형모형 중에서 이진반응변수를 갖는 로지스틱모형에서의 그래픽적 회귀 방법과 주변모형 산점도를 이용한 모형평가 방법을 알아본다.

Prediction of Galloping Accidents in Power Transmission Line Using Logistic Regression Analysis

  • Lee, Junghoon;Jung, Ho-Yeon;Koo, J.R.;Yoon, Yoonjin;Jung, Hyung-Jo
    • Journal of Electrical Engineering and Technology
    • /
    • 제12권2호
    • /
    • pp.969-980
    • /
    • 2017
  • Galloping is one of the most serious vibration problems in transmission lines. Power lines can be extensively damaged owing to aerodynamic instabilities caused by ice accretion. In this study, the accident probability induced by galloping phenomenon was analyzed using logistic regression analysis. As former studies have generally concluded, main factors considered were local weather factors and physical factors of power delivery systems. Since the number of transmission towers outnumbers the number of weather observatories, interpolation of weather factors, Kriging to be more specific, has been conducted in prior to forming galloping accident estimation model. Physical factors have been provided by Korea Electric Power Corporation, however because of the large number of explanatory variables, variable selection has been conducted, leaving total 11 variables. Before forming estimation model, with 84 provided galloping cases, 840 non-galloped cases were chosen out of 13 billion cases. Prediction model for accidents by galloping has been formed with logistic regression model and validated with 4-fold validation method, corresponding AUC value of ROC curve has been used to assess the discrimination level of estimation models. As the result, logistic regression analysis effectively discriminated the power lines that experienced galloping accidents from those that did not.

로지스틱 회귀분석과 퍼지 기법을 이용한 산사태 취약성 지도작성: 보은군을 대상으로 (Landslide susceptibility mapping using Logistic Regression and Fuzzy Set model at the Boeun Area, Korea)

  • 알-마문;장동호
    • 한국지형학회지
    • /
    • 제23권2호
    • /
    • pp.109-125
    • /
    • 2016
  • This study aims to identify the landslide susceptible zones of Boeun area and provide reliable landslide susceptibility maps by applying different modeling methods. Aerial photographs and field survey on the Boeun area identified landslide inventory map that consists of 388 landslide locations. A total ofseven landslide causative factors (elevation, slope angle, slope aspect, geology, soil, forest and land-use) were extracted from the database and then converted into raster. Landslide causative factors were provided to investigate about the spatial relationship between each factor and landslide occurrence by using fuzzy set and logistic regression model. Fuzzy membership value and logistic regression coefficient were employed to determine each factor's rating for landslide susceptibility mapping. Then, the landslide susceptibility maps were compared and validated by cross validation technique. In the cross validation process, 50% of observed landslides were selected randomly by Excel and two success rate curves (SRC) were generated for each landslide susceptibility map. The result demonstrates the 84.34% and 83.29% accuracy ratio for logistic regression model and fuzzy set model respectively. It means that both models were very reliable and reasonable methods for landslide susceptibility analysis.

공간통합 모델을 적용한 암괴류 및 애추 지형 분포가능지 추출 (Extraction of Potential Area for Block Stream and Talus Using Spatial Integration Model)

  • 이성호;장동호
    • 한국지형학회지
    • /
    • 제26권2호
    • /
    • pp.1-14
    • /
    • 2019
  • This study analyzed the relativity between block stream and talus distributions by employing a likelihood ratio approach. Possible distribution sites for each debris slope landform were extracted by applying a spatial integration model, in which we combined fuzzy set model, Bayesian predictive model, and logistic regression model. Moreover, to verify model performance, a success rate curve was prepared by cross-validation. The results showed that elevation, slope, curvature, topographic wetness index, geology, soil drainage, and soil depth were closely related to the debris slope landform sites. In addition, all spatial integration models displayed an accuracy of over 90%. The accuracy of the distribution potential area map of the block stream was highest in the logistic regression model (93.79%). Eventually, the accuracy of the distribution potential area map of the talus was also highest in the logistic regression model (97.02%). We expect that the present results will provide essential data and propose methodologies to improve the performance of efficient and systematic micro-landform studies. Moreover, our research will potentially help to enhance field research and topographic resource management.

An Analysis of Factors Relating to Agricultural Machinery Farm-Work Accidents Using Logistic Regression

  • Kim, Byounggap;Yum, Sunghyun;Kim, Yu-Yong;Yun, Namkyu;Shin, Seung-Yeoub;You, Seokcheol
    • Journal of Biosystems Engineering
    • /
    • 제39권3호
    • /
    • pp.151-157
    • /
    • 2014
  • Purpose: In order to develop strategies to prevent farm-work accidents relating to agricultural machinery, influential factors were examined in this paper. The effects of these factors were quantified using logistic regression. Methods: Based on the results of a survey on farm-work accidents conducted by the National Academy of Agricultural Science, 21 tentative independent variables were selected. To apply these variables to regression, the presence of multicollinearity was examined by comparing correlation coefficients, checking the statistical significance of the coefficients in a simple linear regression model, and calculating the variance inflation factor. A logistic regression model and determination method of its goodness of fit was defined. Results: Among 21 independent variables, 13 variables were not collinear each other. The results of a logistic regression analysis using these variables showed that the model was significant and acceptable, with deviance of 714.053. Parameter estimation results showed that four variables (age, power tiller ownership, cognizance of the government's safety policy, and consciousness of safety) were significant. The logistic regression model predicted that the former two increased accident odds by 1.027 and 8.506 times, respectively, while the latter two decreased the odds by 0.243 and 0.545 times, respectively. Conclusions: Prevention strategies against factors causing an accident, such as the age of farmers and the use of a power tiller, are necessary. In addition, more efficient trainings to elevate the farmer's consciousness about safety must be provided.

로지스틱 회귀 모형을 이용한 무선인터넷 콘텐츠 서비스의 life cycle 분석 및 예측 (A Study on Life Cycle analysis and prediction of Contents Service in the Wireless Internet)

  • 박지홍;전준현
    • 대한전자공학회:학술대회논문집
    • /
    • 대한전자공학회 2005년도 추계종합학술대회
    • /
    • pp.1161-1164
    • /
    • 2005
  • In this paper, we proposed the technique to estimate the life cycle of Internet content services based on the logistic regression model. In this paper, to define parameters of Internet contents estimating life cycle by logistic regression model, we used market size, traffic amount, page view and session-visit number as the parameters of Internet contents estimating life cycle by logistic regression model. In this paper, to compare the performance of our proposed scheme, we estimated life cycle for the download services of bell sound & character contents in mobile network. As a result, using our proposed logistic regression, we were able to estimate exactly the life cycle of the download services of bell sound & character contents.

  • PDF

단계별 비행훈련 성패 예측 모형의 성능 비교 연구 (Comparison of Classification Models for Sequential Flight Test Results)

  • 손소영;조용관;최성옥;김영준
    • 대한인간공학회지
    • /
    • 제21권1호
    • /
    • pp.1-14
    • /
    • 2002
  • The main purpose of this paper is to present selection criteria for ROK Airforce pilot training candidates in order to save costs involved in sequential pilot training. We use classification models such Decision Tree, Logistic Regression and Neural Network based on aptitude test results of 288 ROK Air Force applicants in 1994-1996. Different models are compared in terms of classification accuracy, ROC and Lift-value. Neural network is evaluated as the best model for each sequential flight test result while Logistic regression model outperforms the rest of them for discriminating the last flight test result. Therefore we suggest a pilot selection criterion based on this logistic regression. Overall. we find that the factors such as Attention Sharing, Speed Tracking, Machine Comprehension and Instrument Reading Ability having significant effects on the flight results. We expect that the use of our criteria can increase the effectiveness of flight resources.

로지스틱회귀에서 잔차산점도를 이용한 모형평가 (Model assessment with residual plot in logistic regression)

  • 강명욱
    • Journal of the Korean Data and Information Science Society
    • /
    • 제26권1호
    • /
    • pp.141-150
    • /
    • 2015
  • 로지스틱회귀에서 모형을 평가하거나 진단할 때 가설검정이 주로 사용되지만 이것만으로는 놓칠 수 있는 부분이 많고 이에 대한 보완을 위하여 그래픽적 방법의 사용이 요구된다. 그래프를 이용한 모형의 적절성 평가를 위한 도구로 잔차산점도가 널리 이용되고 있으나 적용 범위가 선형회귀에 국한되는 문제점이 있다. 해결 방안으로 주변모형산점도를 이용하여 모형의 적절성을 평가하는 방법이 있으나 역시 문제점을 가지고 있다. 본 논문에서는 주변모형산점도의 대안으로 카이잔차산점도를 제안하고 그 효용성을 알아본다.

Nonlinear Regression on Cold Tolerance Data for Brassica Napus

  • Yang, Woohyeong;Choi, Myeong Seok;Ahn, Sung Jin
    • Journal of the Korean Data Analysis Society
    • /
    • 제20권6호
    • /
    • pp.2721-2731
    • /
    • 2018
  • This study purposes to derive the predictive model for the cold tolerance of Brassica napus, using the data collected in the Tree Breeding Lab of Gyeongsang National University during July and August of 2016. Three Brassica napus samples were treated at each of low temperatures from $4^{\circ}C$ to $-12^{\circ}C$ by decrement of $4^{\circ}C$, step by step, and electrolyte leakage levels were measured at each stage. Electrolyte leakages were observed tangibly from $-4^{\circ}C$. We tried to fit the six nonlinear regression models to the electrolyte leakage data of Brassica napus: 3-parameter logistic model, baseline logistic model, 4-parameter logistic model, (4-1)-parameter logistic model, 3-parameter Gompertz model, and (3-1)-parameter Gompertz model. The baseline levels of the electrolyte leakage estimated by these models were 4.81%, 4.07%, 4.19%, 4.07%, 4.55%, and 0%, respectively. The estimated median lethal temperature, LT50, were $-5.87^{\circ}C$, $-6.31^{\circ}C$, $-6.05^{\circ}C$, $-6.35^{\circ}C$, $-4.98^{\circ}C$, and $-5.15^{\circ}C$, respectively. We compared and discussed the measures of goodness of fit to select the appropriate nonlinear regression model.

2007년 한국프로야구에서 도루성공모형 (Steal Success Model for 2007 Korean Professional Baseball Games)

  • 홍종선;최정민
    • 응용통계연구
    • /
    • 제21권3호
    • /
    • pp.455-468
    • /
    • 2008
  • 야구경기의 승패에 영향을 미치는 중요한 요인으로 간주되는 도루의 성공모형을 개발하기 위하여 2007년 한국프로야구 기록자료를 바탕으로 로지스틱 회귀모형들을 제안한다. 또한 한국프로야구의 도루성공과 실패에 대해 판별분석을 실시하고 분류 기준값을 결정하였으며, 판별분석 분류표를 이용해 로지스틱 회귀분석과 판별분석의 효율성을 비교한다. 전체적인 모형의 정확도는 로지스틱 회귀모형이 판별분석보다 더 좋은 것으로 나타났고, 연속형 자료를 범주형으로 변환한 자료에 대한 로지스틱 회귀모형도 유사한 효율성을 갖고있다.