• Title/Summary/Keyword: Logistic models

Search Result 804, Processing Time 0.026 seconds

A Study on Diagnostics Method for Categorical Data (범주형 자료의 진단방법에 관한 연구)

  • 이선규;조범석
    • Journal of Korean Society of Industrial and Systems Engineering
    • /
    • v.18 no.33
    • /
    • pp.93-102
    • /
    • 1995
  • In this study we are concerned with the diagnostics method of cross-classified categorical data using logistic regression model of binary response models for cell proportions. under this model, we could examine the goodness-of-fit of the models using Pearson's $x^2$test statistic and likelihood ratio statistic. Under this model, these statistics are assumed that sample survey schemes are with replacement sampling model. But these statistics are often inappropriate for analysing contingency tables consists of complex sampling schemes obtained sample survey data. In this study we are examined diagnostics procedures detecting any outlying cell proportions and influential observations on design space in logistic regression modeltake account of the survey design effects.

  • PDF

Modeling Exponential Growth in Population using Logistic, Gompertz and ARIMA Model: An Application on New Cases of COVID-19 in Pakistan

  • Omar, Zara;Tareen, Ahsan
    • International Journal of Computer Science & Network Security
    • /
    • v.21 no.1
    • /
    • pp.192-200
    • /
    • 2021
  • In the mid of the December 2019, the virus has been started to spread from China namely Corona virus. It causes fatalities globally and WHO has been declared as pandemic in the whole world. There are different methods which can fit such types of values which obtain peak and get flattened by the time. The main aim of the paper is to find the best or nearly appropriate modeling of such data. The three different models has been deployed for the fitting of the data of Coronavirus confirmed patients in Pakistan till the date of 20th November 2020. In this paper, we have conducted analysis based on data obtained from National Institute of Health (NIH) Islamabad and produced a forecast of COVID-19 confirmed cases as well as the number of deaths and recoveries in Pakistan using the Logistic model, Gompertz model and Auto-Regressive Integrated Moving Average Model (ARIMA) model. The fitted models revealed high exponential growth in the number of confirmed cases, deaths and recoveries in Pakistan.

Performance and Cost Analysis of Supply Chain Models

  • Bause, F.;Fischer, M.;Kemper, P.;Volker, M.
    • Proceedings of the Korea Society for Simulation Conference
    • /
    • 2001.10a
    • /
    • pp.425-434
    • /
    • 2001
  • In this paper we introduce a general framework for the modeling, analysis and costing of logistic networks including supply chains (SCs). The employed modeling notation, the so-called Process Chain paradigm, is specifically developed for the application field of logistic networks which includes SCs. We view SCs as discrete event dynamic systems (DEDS) and apply corresponding simulative techniques in order to derive performance measures of the Process Chain model under investigation. For this purpose Process Chain models are automatically transformed into the input language of the simulation tool HIT. Subsequently, a cost accounting model using the performance measures is applied to obtain costs which are actually subject of interest. The usefulness and applicability of the approach is illustrated by a typical supply chain example. We investigate the impact of an additional SC channel between a manufacturer and web-consumers on the overall supply chain costs.

  • PDF

Selecting Machine Learning Model Based on Natural Language Processing for Shanghanlun Diagnostic System Classification (자연어 처리 기반 『상한론(傷寒論)』 변병진단체계(辨病診斷體系) 분류를 위한 기계학습 모델 선정)

  • Young-Nam Kim
    • 대한상한금궤의학회지
    • /
    • v.14 no.1
    • /
    • pp.41-50
    • /
    • 2022
  • Objective : The purpose of this study is to explore the most suitable machine learning model algorithm for Shanghanlun diagnostic system classification using natural language processing (NLP). Methods : A total of 201 data items were collected from 『Shanghanlun』 and 『Clinical Shanghanlun』, 'Taeyangbyeong-gyeolhyung' and 'Eumyangyeokchahunobokbyeong' were excluded to prevent oversampling or undersampling. Data were pretreated using a twitter Korean tokenizer and trained by logistic regression, ridge regression, lasso regression, naive bayes classifier, decision tree, and random forest algorithms. The accuracy of the models were compared. Results : As a result of machine learning, ridge regression and naive Bayes classifier showed an accuracy of 0.843, logistic regression and random forest showed an accuracy of 0.804, and decision tree showed an accuracy of 0.745, while lasso regression showed an accuracy of 0.608. Conclusions : Ridge regression and naive Bayes classifier are suitable NLP machine learning models for the Shanghanlun diagnostic system classification.

  • PDF

Prediction of Galloping Accidents in Power Transmission Line Using Logistic Regression Analysis

  • Lee, Junghoon;Jung, Ho-Yeon;Koo, J.R.;Yoon, Yoonjin;Jung, Hyung-Jo
    • Journal of Electrical Engineering and Technology
    • /
    • v.12 no.2
    • /
    • pp.969-980
    • /
    • 2017
  • Galloping is one of the most serious vibration problems in transmission lines. Power lines can be extensively damaged owing to aerodynamic instabilities caused by ice accretion. In this study, the accident probability induced by galloping phenomenon was analyzed using logistic regression analysis. As former studies have generally concluded, main factors considered were local weather factors and physical factors of power delivery systems. Since the number of transmission towers outnumbers the number of weather observatories, interpolation of weather factors, Kriging to be more specific, has been conducted in prior to forming galloping accident estimation model. Physical factors have been provided by Korea Electric Power Corporation, however because of the large number of explanatory variables, variable selection has been conducted, leaving total 11 variables. Before forming estimation model, with 84 provided galloping cases, 840 non-galloped cases were chosen out of 13 billion cases. Prediction model for accidents by galloping has been formed with logistic regression model and validated with 4-fold validation method, corresponding AUC value of ROC curve has been used to assess the discrimination level of estimation models. As the result, logistic regression analysis effectively discriminated the power lines that experienced galloping accidents from those that did not.

Semiparametric Approach to Logistic Model with Random Intercept (준모수적 방법을 이용한 랜덤 절편 로지스틱 모형 분석)

  • Kim, Mijeong
    • The Korean Journal of Applied Statistics
    • /
    • v.28 no.6
    • /
    • pp.1121-1131
    • /
    • 2015
  • Logistic models with a random intercept are useful to analyze longitudinal binary data. Traditionally, the random intercept of the logistic model is assumed to be parametric (such as normal distribution) and is also assumed to be independent to variables. Such assumptions are very strong and restricted for application to real data. Recently, Garcia and Ma (2015) derived semiparametric efficient estimators for logistic model with a random intercept without these assumptions. Their estimator shows the consistency where we do not assume any parametric form for the random intercept. In addition, the method is computationally simple. In this paper, we apply this method to analyze toenail infection data. We compare the semiparametric estimator with maximum likelihood estimator, penalized quasi-likelihood estimator and hierarchical generalized linear estimator.

Landslide susceptibility mapping using Logistic Regression and Fuzzy Set model at the Boeun Area, Korea (로지스틱 회귀분석과 퍼지 기법을 이용한 산사태 취약성 지도작성: 보은군을 대상으로)

  • Al-Mamun, Al-Mamun;JANG, Dong-Ho
    • Journal of The Geomorphological Association of Korea
    • /
    • v.23 no.2
    • /
    • pp.109-125
    • /
    • 2016
  • This study aims to identify the landslide susceptible zones of Boeun area and provide reliable landslide susceptibility maps by applying different modeling methods. Aerial photographs and field survey on the Boeun area identified landslide inventory map that consists of 388 landslide locations. A total ofseven landslide causative factors (elevation, slope angle, slope aspect, geology, soil, forest and land-use) were extracted from the database and then converted into raster. Landslide causative factors were provided to investigate about the spatial relationship between each factor and landslide occurrence by using fuzzy set and logistic regression model. Fuzzy membership value and logistic regression coefficient were employed to determine each factor's rating for landslide susceptibility mapping. Then, the landslide susceptibility maps were compared and validated by cross validation technique. In the cross validation process, 50% of observed landslides were selected randomly by Excel and two success rate curves (SRC) were generated for each landslide susceptibility map. The result demonstrates the 84.34% and 83.29% accuracy ratio for logistic regression model and fuzzy set model respectively. It means that both models were very reliable and reasonable methods for landslide susceptibility analysis.

A Study on Diabetes Management System Based on Logistic Regression and Random Forest

  • ByungJoo Kim
    • International journal of advanced smart convergence
    • /
    • v.13 no.2
    • /
    • pp.61-68
    • /
    • 2024
  • In the quest for advancing diabetes diagnosis, this study introduces a novel two-step machine learning approach that synergizes the probabilistic predictions of Logistic Regression with the classification prowess of Random Forest. Diabetes, a pervasive chronic disease impacting millions globally, necessitates precise and early detection to mitigate long-term complications. Traditional diagnostic methods, while effective, often entail invasive testing and may not fully leverage the patterns hidden in patient data. Addressing this gap, our research harnesses the predictive capability of Logistic Regression to estimate the likelihood of diabetes presence, followed by employing Random Forest to classify individuals into diabetic, pre-diabetic or nondiabetic categories based on the computed probabilities. This methodology not only capitalizes on the strengths of both algorithms-Logistic Regression's proficiency in estimating nuanced probabilities and Random Forest's robustness in classification-but also introduces a refined mechanism to enhance diagnostic accuracy. Through the application of this model to a comprehensive diabetes dataset, we demonstrate a marked improvement in diagnostic precision, as evidenced by superior performance metrics when compared to other machine learning approaches. Our findings underscore the potential of integrating diverse machine learning models to improve clinical decision-making processes, offering a promising avenue for the early and accurate diagnosis of diabetes and potentially other complex diseases.

Nonlinear Regression Analysis to Determine Infection Models of Colletotrichum acutatum Causing Anthracnose of Chili Pepper Using Logistic Equation

  • Kang, Wee-Soo;Yun, Sung-Chul;Park, Eun-Woo
    • The Plant Pathology Journal
    • /
    • v.26 no.1
    • /
    • pp.17-24
    • /
    • 2010
  • A logistic model for describing combined effects of both temperature and wetness period on appressorium formation was developed using laboratory data on percent appressorium formation of Colletotrichum acutatum. In addition, the possible use of the logistic model for forecasting infection risks was also evaluated as compared with a first-order linear model. A simplified equilibrium model for enzymatic reactions was applied to obtain a temperature function for asymptote parameter (A) of logistic model. For the position (B) and the rate (k) parameters, a reciprocal model was used to calculate the respective temperature functions. The nonlinear logistic model described successfully the response of appressorium formation to the combined effects of temperature and wetness period. Especially the temperature function for asymptote parameter A reflected the response of upper limit of appressorium formation to temperature, which showed the typical temperature response of enzymatic reactions in the cells. By having both temperature and wetness period as independent variables, the nonlinear logistic model can be used to determine the length of wetness periods required for certain levels of appressorium formation under different temperature conditions. The infection model derived from the nonlinear logistic model can be used to calculate infection risks using hourly temperature and wetness period data monitored by automated weather stations in the fields. Compared with the nonlinear infection model, the linear infection model always predicted a shorter wetness period for appressorium formation, and resulted in significantly under- and over-estimation of response at low and high temperatures, respectively.

A Comparative Study of Predictive Factors for Passing the National Physical Therapy Examination using Logistic Regression Analysis and Decision Tree Analysis

  • Kim, So Hyun;Cho, Sung Hyoun
    • Physical Therapy Rehabilitation Science
    • /
    • v.11 no.3
    • /
    • pp.285-295
    • /
    • 2022
  • Objective: The purpose of this study is to use logistic regression and decision tree analysis to identify the factors that affect the success or failurein the national physical therapy examination; and to build and compare predictive models. Design: Secondary data analysis study Methods: We analyzed 76,727 subjects from the physical therapy national examination data provided by the Korea Health Personnel Licensing Examination Institute. The target variable was pass or fail, and the input variables were gender, age, graduation status, and examination area. Frequency analysis, chi-square test, binary logistic regression, and decision tree analysis were performed on the data. Results: In the logistic regression analysis, subjects in their 20s (Odds ratio, OR=1, reference), expected to graduate (OR=13.616, p<0.001) and from the examination area of Jeju-do (OR=3.135, p<0.001), had a high probability of passing. In the decision tree, the predictive factors for passing result had the greatest influence in the order of graduation status (x2=12366.843, p<0.001) and examination area (x2=312.446, p<0.001). Logistic regression analysis showed a specificity of 39.6% and sensitivity of 95.5%; while decision tree analysis showed a specificity of 45.8% and sensitivity of 94.7%. In classification accuracy, logistic regression and decision tree analysis showed 87.6% and 88.0% prediction, respectively. Conclusions: Both logistic regression and decision tree analysis were adequate to explain the predictive model. Additionally, whether actual test takers passed the national physical therapy examination could be determined, by applying the constructed prediction model and prediction rate.