• Title/Summary/Keyword: receiver operating characteristic curve

Search Result 520, Processing Time 0.031 seconds

Multivariate Outlier Removing for the Risk Prediction of Gas Leakage based Methane Gas (메탄 가스 기반 가스 누출 위험 예측을 위한 다변량 특이치 제거)

  • Dashdondov, Khongorzul;Kim, Mi-Hye
    • Journal of the Korea Convergence Society
    • /
    • v.11 no.12
    • /
    • pp.23-30
    • /
    • 2020
  • In this study, the relationship between natural gas (NG) data and gas-related environmental elements was performed using machine learning algorithms to predict the level of gas leakage risk without directly measuring gas leakage data. The study was based on open data provided by the server using the IoT-based remote control Picarro gas sensor specification. The naturel gas leaks into the air, it is a big problem for air pollution, environment and the health. The proposed method is multivariate outlier removing method based Random Forest (RF) classification for predicting risk of NG leak. After, unsupervised k-means clustering, the experimental dataset has done imbalanced data. Therefore, we focusing our proposed models can predict medium and high risk so best. In this case, we compared the receiver operating characteristic (ROC) curve, accuracy, area under the ROC curve (AUC), and mean standard error (MSE) for each classification model. As a result of our experiments, the evaluation measurements include accuracy, area under the ROC curve (AUC), and MSE; 99.71%, 99.57%, and 0.0016 for MOL_RF respectively.

Life Risk Assessment of Landslide Disaster in Jinbu Area Using Logistic Regression Model (로지스틱 회귀분석모델을 활용한 평창군 진부 지역의 산사태 재해의 인명 위험 평가)

  • Rahnuma, Bintae Rashid Urmi;Al, Mamun;Jang, Dong-Ho
    • Journal of The Geomorphological Association of Korea
    • /
    • v.27 no.2
    • /
    • pp.65-80
    • /
    • 2020
  • This paper deals with risk assessment of life in a landslide-prone area by a GIS-based modeling method. Landslide susceptibility maps can provide a probability of landslide prone areas to mitigate or proper control this problems and to take any development plan and disaster management. A landslide inventory map of the study area was prepared based on past historical information and aerial photography analysis. A total of 550 landslides have been counted at the whole study area. The extracted landslides were randomly selected and divided into two different groups, 50% of the landslides were used for model calibration and the other were used for validation purpose. Eleven causative factors (continuous and thematic) such as slope, aspect, curvature, topographic wetness index, elevation, forest type, forest crown density, geology, land-use, soil drainage, and soil texture were used in hazard analysis. The correlation between landslides and these factors, pixels were divided into several classes and frequency ratio was also extracted. Eventually, a landslide susceptibility map was constructed using a logistic regression model based on entire events. Moreover, the landslide susceptibility map was plotted with a receiver operating characteristic (ROC) curve and calculated the area under the curve (AUC) and tried to extract a success rate curve. Based on the results, logistic regression produced an 85.18% accuracy, so we believed that the model was reliable and acceptable for the landslide susceptibility analysis on the study area. In addition, for risk assessment, vulnerability scale were added for social thematic data layer. The study area predictive landslide affected pixels 2,000 and 5,000 were also calculated for making a probability table. In final calculation, the 2,000 predictive landslide affected pixels were assumed to run. The total population causalities were estimated as 7.75 person that was relatively close to the actual number published in Korean Annual Disaster Report, 2006.

Texture Analysis of Gray-Scale Ultrasound Images for Staging of Hepatic Fibrosis (간 섬유화 단계 평가를 위한 회색조 초음파 영상 기반 텍스처 분석)

  • Eun Joo Park;Seung Ho Kim;Sang Joon Park;Tae Wook Baek
    • Journal of the Korean Society of Radiology
    • /
    • v.82 no.1
    • /
    • pp.116-127
    • /
    • 2021
  • Purpose To evaluate the feasibility of texture analysis of gray-scale ultrasound (US) images for staging of hepatic fibrosis. Materials and Methods Altogether, 167 patients who had undergone routine US and laboratory tests for a fibrosis-4 (FIB-4) index were included. Texture parameters were measured using a dedicated in-house software. Regions of interest were placed in five different segments (3, 5, 6, 7, 8) for each patient. The FIB-4 index was used as the reference standard for hepatic fibrosis grade. Comparisons of the texture parameters between different fibrosis groups were performed with the Student's t-test or Mann-Whitney U-test. Diagnostic performance was evaluated by receiver operating curve analysis. Results The study population comprised of patients with no fibrosis (FIB-4 < 1.45, n = 50), mild fibrosis (1.45 ≤ FIB-4 ≤ 2.35, n = 37), moderate fibrosis (2.35 < FIB-4 ≤ 3.25, n = 27), and severe fibrosis (FIB-4 > 3.25, n = 53). Skewness in hepatic segment 5 showed a difference between patients with no fibrosis and mild fibrosis (0.2392 ± 0.3361, 0.4134 ± 0.3004, respectively, p = 0.0109). The area under the curve of skewness for discriminating patients with no fibrosis from those with mild fibrosis was 0.660 (95% confidence interval, 0.551-0.758), with an estimated accuracy, sensitivity, specificity of 64%, 87%, 48%, respectively. Conclusion A significant difference was observed regarding skewness in segment 5 between patients with no fibrosis and patients with mild fibrosis.

The Use of Continuous Confidence Judgments in ROC of Digital Radiography (디지털 X선영상 평가에서 연속확신도법 ROC의 적용)

  • Kim, Hark-Sung;Lee, In-Ja;Kim, Sung-Chul
    • Journal of radiological science and technology
    • /
    • v.32 no.2
    • /
    • pp.147-151
    • /
    • 2009
  • In general, the discrete confidence judgments that use five-step assessment method have been used to assess the medical images by ROC. TPF or FPF can be computed easily with this independent reading test. However, during experiments, it happens frequently that adequate distribution for observers is required to smoothly estimate the ROC curve. In addition, data becomes invalid for distribution of the created categories. To solve such problems or to apply the ROC interpretation to data that is not obtained from the experimental observation, the continuous confidence judgements (CCJ) has been proposed, which implements ROC interpretation using continuously-distributed experimental results without category classification has been used. As the use of CCJ to assess medical images was barely reported in Korea, we applied it to the assessment of chest digital images in this study. The results showed that a smooth ROC curve was obtained conveniently by the commercialized program and the characteristic value was measured easily. Therefore, it is recommended that this method can be applied to the assessment of digital medical images.

  • PDF

Diagnostic performance of enzyme-linked immnosorbent assays for diagnosing paratuberculosis in cattle: a meta-analysis

  • Pak, Son-Il
    • Korean Journal of Veterinary Research
    • /
    • v.44 no.4
    • /
    • pp.669-676
    • /
    • 2004
  • To evaluate the diagnostic accuracy of two commercial ELISA tests (Allied- and CSL-ELISA) for the diagnosis of Mycobacterium paratuberculosis in cattle, Meta-analysis using English language papers published during 1990-2001 was performed. Diagnostic odds ratios (DOR) were analyzed using regression analysis together with summary receiver operating characteristic (ROC) curves. The difference in diagnostic performance between the two ELISA systems was evaluated by using linear regression. Publication bias was assessed by funnel plot and linear regression. The pooled sensitivity and specificity were 44% (95% CI, 38 to 51) and 98% (95% CI, 96 to 99) for the random-effect model. The DOR between studies was heterogeneous. The area under the fitted ROC curve (AUC) was 0.72 for the unweighted and 0.77 for the weighted model. Maximum joint sensitivity and specificity for the unweighted and weighted model from their summary ROC curve were 70% and 75%, respectively. Based on the fitted model, at a specificity of 95%, sensitivity was estimated to be 52% for the unweighted and 57% for the weighted model. From the final multivariable model study characteristic, the country was the only significant variable with an explained component variance of 13.3%. There were no significant differences in discriminatory power, sensitivity, and specificity between the two ELISA tests. The overall diagnostic accuracy of two commercial ELISA tests was moderate, as judged by the AUC, maximum joint sensitivity and specificity, and estimates from the fitted model and clinical usefulness of the tests for screening program is limited because of low sensitivity and heterogeneous of DOR. It is, therefore, recommended to use ELISA tests as a parallel testing with other diagnostic tests together to increase test sensitivity in the screening program.

Linear prediction analysis-based method for detecting snapping shrimp noise (선형 예측 분석 기반의 딱총 새우 잡음 검출 기법)

  • Jinuk Park;Jungpyo Hong
    • The Journal of the Acoustical Society of Korea
    • /
    • v.42 no.3
    • /
    • pp.262-269
    • /
    • 2023
  • In this paper, we propose a Linear Prediction (LP) analysis-based feature for detecting Snapping Shrimp (SS) Noise (SSN) in underwater acoustic data. SS is a species that creates high amplitude signals in shallow, warm waters, and its frequent and loud sound is a major source of noise. The proposed feature takes advantage of the characteristic of SSN, which is sudden and rapidly disappearing, by using LP analysis to detect the exact noise interval and reduce the effects of SSN. The error between the predicted and measured value is large and results in effective SSN detection. To further improve performance, a constant false alarm rate detector is incorporated into the proposed feature. Our evaluation shows that the proposed methods outperform the state-of-the-art MultiLayer-Wavelet Packet Decomposition (ML-WPD) in terms of receiver operating characteristic curve and Area Under the Curve (AUC), with the LP analysis-based feature achieving a higher AUC by 0.12 on average and lower computational complexity.

Related Factors of Depression according to Individual Attributes and Regional Environment: Using Multi-Level Analysis (다수준분석을 활용한 개인특성 및 지역환경에 따른 우울증 관련 영향요인 분석)

  • Moon, Seok-Jun;Lee, Ga Ram;Nam, Eun-Woo
    • Health Policy and Management
    • /
    • v.30 no.3
    • /
    • pp.355-365
    • /
    • 2020
  • Background: This study is aimed to verify individual and regional-level factors affecting the depression of Koreans and to develop social programs for improving the depressive status. Methods: This study used individual-level variables from the Korean Community Health Survey (2018) and used the e-regional index of the Korean Statistical Information Service as the regional-level variable. A multi-level logistic regression was executed to identify individual and regional-level variables that were expected to affect the extent of depressive symptoms and to draw the receiver operating characteristic curve to compare the volume of impact between variables from both levels. Results: The results of the multi-level logistic regression analysis in regards to individual-level factors showed that older age, female gender, a lower income level, a lower education level, not having a spouse, the practice of walking, the consumption of breakfast higher levels of stress, and having high blood pressure or diabetes were associated with a greater increase in depressive symptoms. In terms of regional factors, areas with fewer cultural facilities and fewer car registration had higher levels of depressive symptoms. The comparison of area under the curve showed that individual factors had a greater influence than regional factors. Conclusion: This study showed that while both, individual and regional-level factors affect depression, the influence of the latter was relatively weaker as compared to the first. In this sense, it is necessary to develop programs focused on the individual, such as social prescribing at the local or community-level, rather than the city and nation-level approach that are currently prevalent.

Nomogram comparison conducted by logistic regression and naïve Bayesian classifier using type 2 diabetes mellitus (T2D) (제 2형 당뇨병을 이용한 로지스틱과 베이지안 노모그램 구축 및 비교)

  • Park, Jae-Cheol;Kim, Min-Ho;Lee, Jea-Young
    • The Korean Journal of Applied Statistics
    • /
    • v.31 no.5
    • /
    • pp.573-585
    • /
    • 2018
  • In this study, we fit the logistic regression model and naïve Bayesian classifier model using 11 risk factors to predict the incidence rate probability for type 2 diabetes mellitus. We then introduce how to construct a nomogram that can help people visually understand it. We use data from the 2013-2015 Korean National Health and Nutrition Examination Survey (KNHANES). We take 3 interactions in the logistic regression model to improve the quality of the analysis and facilitate the application of the left-aligned method to the Bayesian nomogram. Finally, we compare the two nomograms and examine their utility. Then we verify the nomogram using the ROC curve.

A Logistic Model Including Risk Factors for Lymph Node Metastasis Can Improve the Accuracy of Magnetic Resonance Imaging Diagnosis of Rectal Cancer

  • Ogawa, Shimpei;Itabashi, Michio;Hirosawa, Tomoichiro;Hashimoto, Takuzo;Bamba, Yoshiko;Kameoka, Shingo
    • Asian Pacific Journal of Cancer Prevention
    • /
    • v.16 no.2
    • /
    • pp.707-712
    • /
    • 2015
  • Background: To evaluate use of magnetic resonance imaging (MRI) and a logistic model including risk factors for lymph node metastasis for improved diagnosis. Materials and Methods: The subjects were 176 patients with rectal cancer who underwent preoperative MRI. The longest lymph node diameter was measured and a cut-off value for positive lymph node metastasis was established based on a receiver operating characteristic (ROC) curve. A logistic model was constructed based on MRI findings and risk factors for lymph node metastasis extracted from logistic-regression analysis. The diagnostic capabilities of MRI alone and those of the logistic model were compared using the area under the curve (AUC) of the ROC curve. Results: The cut-off value was a diameter of 5.47 mm. Diagnosis using MRI had an accuracy of 65.9%, sensitivity 73.5%, specificity 61.3%, positive predictive value (PPV) 62.9%, and negative predictive value (NPV) 72.2% [AUC: 0.6739 (95%CI: 0.6016-0.7388)]. Age (<59) (p=0.0163), pT (T3+T4) (p=0.0001), and BMI (<23.5) (p=0.0003) were extracted as independent risk factors for lymph node metastasis. Diagnosis using MRI with the logistic model had an accuracy of 75.0%, sensitivity 72.3%, specificity 77.4%, PPV 74.1%, and NPV 75.8% [AUC: 0.7853 (95%CI: 0.7098-0.8454)], showing a significantly improved diagnostic capacity using the logistic model (p=0.0002). Conclusions: A logistic model including risk factors for lymph node metastasis can improve the accuracy of MRI diagnosis of rectal cancer.

Model Based on Alkaline Phosphatase and Gamma-Glutamyltransferase for Gallbladder Cancer Prognosis

  • Xu, Xin-Sen;Miao, Run-Chen;Zhang, Ling-Qiang;Wang, Rui-Tao;Qu, Kai;Pang, Qing;Liu, Chang
    • Asian Pacific Journal of Cancer Prevention
    • /
    • v.16 no.15
    • /
    • pp.6255-6259
    • /
    • 2015
  • Purpose: To evaluate the prognostic value of alkaline phosphatase (ALP) and gamma-glutamyltransferase (GGT) in gallbladder cancer (GBC). Materials and Methods: Serum ALP and GGT levels and clinicopathological parameters were retrospectively evaluated in 199 GBC patients. Receiver operating characteristic (ROC) curve analysis was performed to determine the cut-off values of ALP and GGT. Then, associations with overall survival were assessed by multivariate analysis. Based on the significant factors, a prognostic score model was established. Results: By ROC curve analysis, $ALP{\geq}210U/L$ and $GGT{\geq}43U/L$ were considered elevated. Overall survival for patients with elevated ALP and GGT was significantly worse than for patients within the normal range. Multivariate analysis showed that the elevated ALP, GGT and tumor stage were independent prognostic factors. Giving each positive factor a score of 1, we established a preoperative prognostic score model. Varied outcomes would be significantly distinguished by the different score groups. By further ROC curve analysis, the simple score showed great superiority compared with the widely used TNM staging, each of the ALP or GGT alone, or traditional tumor markers such as CEA, AFP, CA125 and CA199. Conclusions: Elevated ALP and GGT levels were risk predictors in GBC patients. Our prognostic model provides infomration on varied outcomes of patients from different score groups.