• Title/Summary/Keyword: logistic regression analysis

Search Result 4,175, Processing Time 0.031 seconds

Hazard Map of Road Slope Using a Logistic Regression Model and GIS (Logistic 회귀모형과 GIS기법을 활용한 접도사면 붕괴확률위험도 제작)

  • Kang Ho-Yun;Kwak Young-Joo;Kang In-Joon;Jang Yong-Gu
    • Proceedings of the Korean Society of Surveying, Geodesy, Photogrammetry, and Cartography Conference
    • /
    • 2006.04a
    • /
    • pp.339-344
    • /
    • 2006
  • Slope failures are happen to natural disastrous when they occur in mountainous areas adjoining highways in Korea. The accidents associated with slope failures have increased due to rapid urbanization of mountainous areas. Therefore, Regular maintenance is essential for all slope and conducted to maintain road safety as well as road function. In this study, we take priority of making a database of risk factor of the failure of a slope before assesment and analysis. The purpose of this paper is to recommend a standard of Slope Management Information Sheet(SMIS) like as Hazard Map. The next research, we suggest to pre-estimated model of a road slope using Logistic Regression Model.

  • PDF

Network Traffic Measurement Analysis using Machine Learning

  • Hae-Duck Joshua Jeong
    • Korean Journal of Artificial Intelligence
    • /
    • v.11 no.2
    • /
    • pp.19-27
    • /
    • 2023
  • In recent times, an exponential increase in Internet traffic has been observed as a result of advancing development of the Internet of Things, mobile networks with sensors, and communication functions within various devices. Further, the COVID-19 pandemic has inevitably led to an explosion of social network traffic. Within this context, considerable attention has been drawn to research on network traffic analysis based on machine learning. In this paper, we design and develop a new machine learning framework for network traffic analysis whereby normal and abnormal traffic is distinguished from one another. To achieve this, we combine together well-known machine learning algorithms and network traffic analysis techniques. Using one of the most widely used datasets KDD CUP'99 in the Weka and Apache Spark environments, we compare and investigate results obtained from time series type analysis of various aspects including malicious codes, feature extraction, data formalization, network traffic measurement tool implementation. Experimental analysis showed that while both the logistic regression and the support vector machine algorithm were excellent for performance evaluation, among these, the logistic regression algorithm performs better. The quantitative analysis results of our proposed machine learning framework show that this approach is reliable and practical, and the performance of the proposed system and another paper is compared and analyzed. In addition, we determined that the framework developed in the Apache Spark environment exhibits a much faster processing speed in the Spark environment than in Weka as there are more datasets used to create and classify machine learning models.

A Comparative Analysis of Landslide Susceptibility Assessment by Using Global and Spatial Regression Methods in Inje Area, Korea

  • Park, Soyoung;Kim, Jinsoo
    • Journal of the Korean Society of Surveying, Geodesy, Photogrammetry and Cartography
    • /
    • v.33 no.6
    • /
    • pp.579-587
    • /
    • 2015
  • Landslides are major natural geological hazards that result in a large amount of property damage each year, with both direct and indirect costs. Many researchers have produced landslide susceptibility maps using various techniques over the last few decades. This paper presents the landslide susceptibility results from the geographically weighted regression model using remote sensing and geographic information system data for landslide susceptibility in the Inje area of South Korea. Landslide locations were identified from aerial photographs. The eleven landslide-related factors were calculated and extracted from the spatial database and used to analyze landslide susceptibility. Compared with the global logistic regression model, the Akaike Information Criteria was improved by 109.12, the adjusted R-squared was improved from 0.165 to 0.304, and the Moran’s I index of this analysis was improved from 0.4258 to 0.0553. The comparisons of susceptibility obtained from the models show that geographically weighted regression has higher predictive performance.

An educational tool for binary logistic regression model using Excel VBA (엑셀 VBA를 이용한 이분형 로지스틱 회귀모형 교육도구 개발)

  • Park, Cheolyong;Choi, Hyun Seok
    • Journal of the Korean Data and Information Science Society
    • /
    • v.25 no.2
    • /
    • pp.403-410
    • /
    • 2014
  • Binary logistic regression analysis is a statistical technique that explains binary response variable by quantitative or qualitative explanatory variables. In the binary logistic regression model, the probability that the response variable equals, say 1, one of the binary values is to be explained as a transformation of linear combination of explanatory variables. This is one of big barriers that non-statisticians have to overcome in order to understand the model. In this study, an educational tool is developed that explains the need of the binary logistic regression analysis using Excel VBA. More precisely, this tool explains the problems related to modeling the probability of the response variable equal to 1 as a linear combination of explanatory variables and then shows how these problems can be solved through some transformations of the linear combination.

A Study on Determinants of Stockpile Ammunition using Data Mining (데이터 마이닝을 활용한 장기저장탄약 상태 결정요인 분석 연구)

  • Roh, Yu Chan;Cho, Nam-Wook;Lee, Dongnyok
    • Journal of Korean Society for Quality Management
    • /
    • v.48 no.2
    • /
    • pp.297-307
    • /
    • 2020
  • Purpose: The purpose of this study is to analyze the factors that affect ammunition performance by applying data mining techniques to the Ammunition Stockpile Reliability Program (ASRP) data of the 155mm propelling charge. Methods: The ASRP data from 1999 to 2017 have been utilized. Logistic regression and decision tree analysis were used to investigate the factors that affect performance of ammunition. The performance evaluation of each model was conducted through comparison with an artificial neural networks(ANN) model. Results: The results of this study are as follows; logistic regression and the decision tree analysis showed that major defect rate of visual inspection is the most significant factor. Also, muzzle velocity by base charge and muzzle velocity by increment charge are also among the significant factors affecting the performance of 155mm propelling charge. To validate the logistic regression and decision tree models, their classification accuracies have been compared with the results of an ANN model. The results indicate that the logistic regression and decision tree models show sufficient performance which conforms the validity of the models. Conclusion: The main contribution of this paper is that, to our best knowledge, it is the first attempt at identifying the significant factors of ASPR data by using data mining techniques. The approaches suggested in the paper could also be extended to other types ammunition data.

Designing Neural Network Using Genetic Algorithm (유전자 알고리즘을 이용한 신경망 설계)

  • Park, Jeong-Sun
    • The Transactions of the Korea Information Processing Society
    • /
    • v.4 no.9
    • /
    • pp.2309-2314
    • /
    • 1997
  • The study introduces a neural network to predict the bankruptcy of insurance companies. As a method to optimize the network, a genetic algorithm suggests optimal structure and network parameters. The neural network designed by genetic algorithm is compared with discriminant analysis, logistic regression, ID3, and CART. The robust neural network model shows the best performance among those models compared.

  • PDF

Bivariate odd-log-logistic-Weibull regression model for oral health-related quality of life

  • Cruz, Jose N. da;Ortega, Edwin M.M.;Cordeiro, Gauss M.;Suzuki, Adriano K.;Mialhe, Fabio L.
    • Communications for Statistical Applications and Methods
    • /
    • v.24 no.3
    • /
    • pp.271-290
    • /
    • 2017
  • We study a bivariate response regression model with arbitrary marginal distributions and joint distributions using Frank and Clayton's families of copulas. The proposed model is used for fitting dependent bivariate data with explanatory variables using the log-odd log-logistic Weibull distribution. We consider likelihood inferential procedures based on constrained parameters. For different parameter settings and sample sizes, various simulation studies are performed and compared to the performance of the bivariate odd-log-logistic-Weibull regression model. Sensitivity analysis methods (such as local and total influence) are investigated under three perturbation schemes. The methodology is illustrated in a study to assess changes on schoolchildren's oral health-related quality of life (OHRQoL) in a follow-up exam after three years and to evaluate the impact of caries incidence on the OHRQoL of adolescents.

Prediction of Hypertension Complications Risk Using Classification Techniques

  • Lee, Wonji;Lee, Junghye;Lee, Hyeseon;Jun, Chi-Hyuck;Park, Il-Su;Kang, Sung-Hong
    • Industrial Engineering and Management Systems
    • /
    • v.13 no.4
    • /
    • pp.449-453
    • /
    • 2014
  • Chronic diseases including hypertension and its complications are major sources causing the national medical expenditures to increase. We aim to predict the risk of hypertension complications for hypertension patients, using the sample national healthcare database established by Korean National Health Insurance Corporation. We apply classification techniques, such as logistic regression, linear discriminant analysis, and classification and regression tree to predict the hypertension complication onset event for each patient. The performance of these three methods is compared in terms of accuracy, sensitivity and specificity. The result shows that these methods seem to perform similarly although the logistic regression performs marginally better than the others.

Efficiency Analysis and Finance Strategy for an Automotive Parts Maker Using DEA and Logistic Regression Model (DEA와 로지스틱 회귀분석을 이용한 자동차부품기업의 효율성 분석 및 재무전략)

  • Sin, Jeong-Hun;Hwang, Seung-June
    • Journal of the Korean Operations Research and Management Science Society
    • /
    • v.41 no.1
    • /
    • pp.127-143
    • /
    • 2016
  • This study applied DEA analysis to measure the relative efficiency of 35 companies that produce automobile body components. First, the input and output, the improvement target value of the calculated variables, and the reference group for benchmarking for inefficient groups to become efficient groups were established through DEA analysis. In addition, whether inefficiency was due to technical inefficiency or size was analyzed in connection with the cases of the actual companies through the measurement of scale efficiency. Second, a route for efficiency improvement was derived through DEA-Tier analysis by defining the possible group for benchmarking in actuality within the production industry of automobile body components where the primary cooperative company belonged. Third, the financial variables that generate the difference between efficient and inefficient groups were derived through logistic regression analysis. Financial strategies that determine the direction the indices should be improved to allow the inefficient group to become an efficient one were recommended. This research is expected to provide diagnostic methods for management efficiency and the direction of improvement to enhance the management efficiency of automotive parts makers by identifying the causes of the inefficiency of domestic automotive parts makers empirically. The study also provides financial strategies together with the target values of efficiency improvement for each individual company.

The Study of the Influence of Induced Abortion on Secondary Infertility analyzed by Logistic Regression (Logistic Analysis를 이용하여 분석한 인공유산이 속발성불임에 미치는 영향)

  • Lee, Won-Chul
    • Journal of Preventive Medicine and Public Health
    • /
    • v.15 no.1
    • /
    • pp.179-186
    • /
    • 1982
  • The methods controlling the confounding factors were discussed using the data of secondary infertility with induced abortion. Mantel-Haenszel method and logistic model were applied in the analysis to find out which factors were confounding and/or effect modification variables. In the logistic analysis, the main effect of induced abortion, spontaneous abortion, age and interaction effect between induced abortion and spontaneous abortion were chosen as independent variables being regressed into logistic functions. Spontaneons abortion was interpreted as a potential confounder and at the same time potential effect modifier and age was interpreted as potential confounder. Spontaneous abortion was shown to be more important influencing factor than age to the secondary infertility. In the course of logistic analysis, the problem of parameter estimation and hypothesis testing, assessing the fitness of a model, and selection of the best model were briefly explained. For the program of logistic model, FUNCAT Procedure of SAS package was chosen.

  • PDF