• 제목/요약/키워드: Logistic Regression (LR)

검색결과 53건 처리시간 0.019초

An Introduction to Logistic Regression: From Basic Concepts to Interpretation with Particular Attention to Nursing Domain

  • Park, Hyeoun-Ae
    • 대한간호학회지
    • /
    • 제43권2호
    • /
    • pp.154-164
    • /
    • 2013
  • Purpose: The purpose of this article is twofold: 1) introducing logistic regression (LR), a multivariable method for modeling the relationship between multiple independent variables and a categorical dependent variable, and 2) examining use and reporting of LR in the nursing literature. Methods: Text books on LR and research articles employing LR as main statistical analysis were reviewed. Twenty-three articles published between 2010 and 2011 in the Journal of Korean Academy of Nursing were analyzed for proper use and reporting of LR models. Results: Logistic regression from basic concepts such as odds, odds ratio, logit transformation and logistic curve, assumption, fitting, reporting and interpreting to cautions were presented. Substantial shortcomings were found in both use of LR and reporting of results. For many studies, sample size was not sufficiently large to call into question the accuracy of the regression model. Additionally, only one study reported validation analysis. Conclusion: Nursing researchers need to pay greater attention to guidelines concerning the use and reporting of LR models.

Evaluating seismic liquefaction potential using multivariate adaptive regression splines and logistic regression

  • Zhang, Wengang;Goh, Anthony T.C.
    • Geomechanics and Engineering
    • /
    • 제10권3호
    • /
    • pp.269-284
    • /
    • 2016
  • Simplified techniques based on in situ testing methods are commonly used to assess seismic liquefaction potential. Many of these simplified methods were developed by analyzing liquefaction case histories from which the liquefaction boundary (limit state) separating two categories (the occurrence or non-occurrence of liquefaction) is determined. As the liquefaction classification problem is highly nonlinear in nature, it is difficult to develop a comprehensive model using conventional modeling techniques that take into consideration all the independent variables, such as the seismic and soil properties. In this study, a modification of the Multivariate Adaptive Regression Splines (MARS) approach based on Logistic Regression (LR) LR_MARS is used to evaluate seismic liquefaction potential based on actual field records. Three different LR_MARS models were used to analyze three different field liquefaction databases and the results are compared with the neural network approaches. The developed spline functions and the limit state functions obtained reveal that the LR_MARS models can capture and describe the intrinsic, complex relationship between seismic parameters, soil parameters, and the liquefaction potential without having to make any assumptions about the underlying relationship between the various variables. Considering its computational efficiency, simplicity of interpretation, predictive accuracy, its data-driven and adaptive nature and its ability to map the interaction between variables, the use of LR_MARS model in assessing seismic liquefaction potential is promising.

A CTR Prediction Approach for Text Advertising Based on the SAE-LR Deep Neural Network

  • Jiang, Zilong;Gao, Shu;Dai, Wei
    • Journal of Information Processing Systems
    • /
    • 제13권5호
    • /
    • pp.1052-1070
    • /
    • 2017
  • For the autoencoder (AE) implemented as a construction component, this paper uses the method of greedy layer-by-layer pre-training without supervision to construct the stacked autoencoder (SAE) to extract the abstract features of the original input data, which is regarded as the input of the logistic regression (LR) model, after which the click-through rate (CTR) of the user to the advertisement under the contextual environment can be obtained. These experiments show that, compared with the usual logistic regression model and support vector regression model used in the field of predicting the advertising CTR in the industry, the SAE-LR model has a relatively large promotion in the AUC value. Based on the improvement of accuracy of advertising CTR prediction, the enterprises can accurately understand and have cognition for the needs of their customers, which promotes the multi-path development with high efficiency and low cost under the condition of internet finance.

A Comparison of Urban Growth Probability Maps using Frequency Ratio and Logistic Regression Methods

  • Park, So-Young;Jin, Cheung-Kil;Kim, Shin-Yup;Jo, Gyung-Cheol;Choi, Chul-Uong
    • 한국조경학회지
    • /
    • 제38권5_2호
    • /
    • pp.194-205
    • /
    • 2010
  • To predict urban growth according to changes in landcover, probability factors werecal culated and mapped. Topographic, geographic and social and political factors were used as prediction variables for constructing probability maps of urban growth. Urban growth-related factors included elevation, slope, aspect, distance from road,road ratio, distance from the main city, land cover, environmental rating and legislative rating. Accounting for these factors, probability maps of urban growth were constr uctedusing frequency ratio (FR) and logistic regression (LR) methods and the effectiveness of the results was verified by the relative operating characteristic (ROC). ROC values of the urban growth probability index (UGPI) maps by the FR and LR models were 0.937 and 0.940, respectively. The LR map had a slightly higher ROC value than the FR map, but the numerical difference was slight, with both models showing similar results. The FR model is the simplest tool for probability analysis of urban growth, providing a faster and easier calculation process than other available tools. Additionally, the results can be easily interpreted. In contrast, for the LR model, only a limited amount of input data can be processed by the statistical program and a separate conversion process for input and output data is necessary. In conclusion, although the FR model is the simplest way to analyze the probability of urban growth, the LR model is more appropriate because it allows for quantitative analysis.

로지스틱 회귀 모델을 이용한 우면산 산사태 취약성도 제작 및 현장조사를 통한 사후검증 (Susceptibility Mapping of Umyeonsan Using Logistic Regression (LR) Model and Post-validation through Field Investigation)

  • 이선민;이명진
    • 대한원격탐사학회지
    • /
    • 제33권6_2호
    • /
    • pp.1047-1060
    • /
    • 2017
  • 현대사회에서 지속적으로 진행되고 있는 지구 온난화 현상은 비정상적인 기상 현상을 빈번히 발생시키고 있다. 특히 21세기에는 폭우와 같이 수문학적 측면에서 물의 특성이 전과 다르고, 수문학적 재해의 강도와 빈도가 증가하고 있다. 그 중 도시 지역에서는 재해로 인한 피해가 극대화될 가능성이 크기 때문에 피해를 대비하기 위한 재해에 대한 예측이 필요하다. 따라서 본 연구에서는 우리나라의 대표적인 도시 자연 재해인 산사태를 로지스틱 회귀(Logistic regression, LR) 모델을 이용하여 분석하고 현장조사를 통해 산사태 이후의 관리 현황을 조사 및 검증하였다. 현장조사 대상 지역은 기존에 산사태 발생지역 및 본 연구의 연구결과로부터 산사태 취약성이 높게 나타난 지역을 중심으로 수행하였다. 기존 산사태 발생지 데이터는 2011년 우면산 산사태 당시의 현장조사 자료 및 항공사진 비교분석을 통해 추출하였다. 산사태 관련 요인은 항공사진으로부터 제작된 지형도와 임상도에서 추출하였다. 산사태 취약성 지도는 산사태에 영향을 미치는 총 13개 요인을 통해 구성된 공간 데이터베이스에 LR 모델을 적용하여 제작되었다. 마지막으로 ROC(Receiver operating characteristic) 곡선을 이용해 산사태 취약성 지도를 검증한 결과 77.79%의 정확도를 나타냈다. 추가적으로, 연구결과에 나타난 산사태 취약지역에 대해 2011년 산사태 이후 산사태가 어떻게 관리되었는지를 확인하기 위해 현장조사를 수행하였다. 본 연구의 결과는 국내 도시 산사태 관리에 관한 정책 수립에 있어 과학적 근거로 활용할 수 있을 것으로 기대된다.

가중치 세분화 기반의 로지스틱 회귀분석 모델 (Fine-Grain Weighted Logistic Regression Model)

  • 이창환
    • 전자공학회논문지
    • /
    • 제53권9호
    • /
    • pp.77-81
    • /
    • 2016
  • 로지스틱 회귀분석은 오랫동안 다양한 분야에서 예측을 위한 기술 혹은 변수 간의 관계를 설명하기 위하여 사용되어 왔다. 로지스틱 회귀분석에서 각 속성은 목적 값에 대한 중요도를 가지는데 본 연구에서는 이를 세분화하여 각 속성의 값에 따라서 중요도를 부여하는 새로운 방법을 제시한다. 점진적 하강법을 이용하여 알고리즘의 성능을 최대화하는 각 속성값 가중치의 값을 계산하였다. 제안된 방법은 다양한 데이터를 이용하여 실험하였고 본 연구의 속성값 기반 로지스틱 회귀분석 방법은 기존의 로지스틱 회귀분석보다 우수한 학습 능력을 보임을 알 수 있었다.

전자의무기록을 이용한 욕창발생 예측 베이지안 네트워크 모델 개발 (Predictive Bayesian Network Model Using Electronic Patient Records for Prevention of Hospital-Acquired Pressure Ulcers)

  • 조인숙;정은자
    • 대한간호학회지
    • /
    • 제41권3호
    • /
    • pp.423-431
    • /
    • 2011
  • Purpose: The study was designed to determine the discriminating ability of a Bayesian network (BN) for predicting risk for pressure ulcers. Methods: Analysis was done using a retrospective cohort, nursing records representing 21,114 hospital days, 3,348 patients at risk for ulcers, admitted to the intensive care unit of a tertiary teaching hospital between January 2004 and January 2007. A BN model and two logistic regression (LR) versions, model-I and .II, were compared, varying the nature, number and quality of input variables. Classification competence and case coverage of the models were tested and compared using a threefold cross validation method. Results: Average incidence of ulcers was 6.12%. Of the two LR models, model-I demonstrated better indexes of statistical model fits. The BN model had a sensitivity of 81.95%, specificity of 75.63%, positive and negative predictive values of 35.62% and 96.22% respectively. The area under the receiver operating characteristic (AUROC) was 85.01% implying moderate to good overall performance, which was similar to LR model-I. However, regarding case coverage, the BN model was 100% compared to 15.88% of LR. Conclusion: Discriminating ability of the BN model was found to be acceptable and case coverage proved to be excellent for clinical use.

랜덤 포리스트를 이용한 비제어 급성 출혈성 쇼크의 흰쥐에서의 생존 예측 (A Survival Prediction Model of Rats in Uncontrolled Acute Hemorrhagic Shock Using the Random Forest Classifier)

  • 최준열;김성권;구정모;김덕원
    • 대한의용생체공학회:의공학회지
    • /
    • 제33권3호
    • /
    • pp.148-154
    • /
    • 2012
  • Hemorrhagic shock is a primary cause of deaths resulting from injury in the world. Although many studies have tried to diagnose accurately hemorrhagic shock in the early stage, such attempts were not successful due to compensatory mechanisms of humans. The objective of this study was to construct a survival prediction model of rats in acute hemorrhagic shock using a random forest (RF) model. Heart rate (HR), mean arterial pressure (MAP), respiration rate (RR), lactate concentration (LC), and peripheral perfusion (PP) measured in rats were used as input variables for the RF model and its performance was compared with that of a logistic regression (LR) model. Before constructing the models, we performed 5-fold cross validation for RF variable selection, and forward stepwise variable selection for the LR model to examine which variables were important for the models. For the LR model, sensitivity, specificity, accuracy, and area under the receiver operating characteristic curve (ROC-AUC) were 0.83, 0.95, 0.88, and 0.96, respectively. For the RF models, sensitivity, specificity, accuracy, and AUC were 0.97, 0.95, 0.96, and 0.99, respectively. In conclusion, the RF model was superior to the LR model for survival prediction in the rat model.

Artificial Neural Network for Prediction of Distant Metastasis in Colorectal Cancer

  • Biglarian, Akbar;Bakhshi, Enayatollah;Gohari, Mahmood Reza;Khodabakhshi, Reza
    • Asian Pacific Journal of Cancer Prevention
    • /
    • 제13권3호
    • /
    • pp.927-930
    • /
    • 2012
  • Background and Objectives: Artificial neural networks (ANNs) are flexible and nonlinear models which can be used by clinical oncologists in medical research as decision making tools. This study aimed to predict distant metastasis (DM) of colorectal cancer (CRC) patients using an ANN model. Methods: The data of this study were gathered from 1219 registered CRC patients at the Research Center for Gastroenterology and Liver Disease of Shahid Beheshti University of Medical Sciences, Tehran, Iran (January 2002 and October 2007). For prediction of DM in CRC patients, neural network (NN) and logistic regression (LR) models were used. Then, the concordance index (C index) and the area under receiver operating characteristic curve (AUROC) were used for comparison of neural network and logistic regression models. Data analysis was performed with R 2.14.1 software. Results: The C indices of ANN and LR models for colon cancer data were calculated to be 0.812 and 0.779, respectively. Based on testing dataset, the AUROC for ANN and LR models were 0.82 and 0.77, respectively. This means that the accuracy of ANN prediction was better than for LR prediction. Conclusion: The ANN model is a suitable method for predicting DM and in that case is suggested as a good classifier that usefulness to treatment goals.

BCI에서 기계 학습을 위한 간질 뇌파 특징 선택을 통한 차원 감소 방법 분석 (Analysis of Dimensionality Reduction Methods Through Epileptic EEG Feature Selection for Machine Learning in BCI)

  • 양통;;임창균
    • 한국전자통신학회논문지
    • /
    • 제13권6호
    • /
    • pp.1333-1342
    • /
    • 2018
  • 지금까지 뇌파(Electroencephalography - EEG)는 뇌전증 진단 및 치료를 위한 가장 중요하고 편리한 방법이었다. 그러나 뇌전증 뇌파 신호의 파형 특성은 매우 약하고 비 정지 상태이며 배경 노이즈가 강하기 때문에 식별하기가 어렵다. 이 논문에서는 간질 뇌파의 특징 선택을 통한 차원 감소를 통한 분류 방법의 효과를 분석한다. 우리는 차원 감소를 위해 주 요소 분석, 커널 요소 분석, 선형 판별 분석 방법을 사용하였다. 차원 감소방법의 성능 분석을 위해 Support Vector Machine: SVM), Logistic Regression(: LR), K-Nearestneighbor(: K-NN), Decision Tree(: DR), Random Forest(: RF) 분류 방법들을 사용해 평가하였다. 실험 결과에 따르면, PCA는 SVM, LR 및 K-NN에서 75% 정확도를 나타냈다. KPCA는 SVM과 K-KNN에서 85%의 성능을 보였으며 LDA는 K-NN를 이용했을 때 100 %의 정확도 보여주었다. 따라서 LDA를 이용한 차원 감소가 뇌전증 EEG 신호에 대한 최고의 분류 결과 보여주었다.