• Title/Summary/Keyword: Area Under the Receiver Operating Characteristic Curve (AUC)

Search Result 162, Processing Time 0.025 seconds

Online anomaly detection algorithm based on deep support vector data description using incremental centroid update (점진적 중심 갱신을 이용한 deep support vector data description 기반의 온라인 비정상 탐지 알고리즘)

  • Lee, Kibae;Ko, Guhn Hyeok;Lee, Chong Hyun
    • The Journal of the Acoustical Society of Korea
    • /
    • v.41 no.2
    • /
    • pp.199-209
    • /
    • 2022
  • Typical anomaly detection algorithms are trained by using prior data. Thus the batch learning based algorithms cause inevitable performance degradation when characteristics of newly incoming normal data change over time. We propose an online anomaly detection algorithm which can consider the gradual characteristic changes of incoming normal data. The proposed algorithm based on one-class classification model includes both offline and online learning procedures. In offline learning procedure, the algorithm learns the prior data to be close to centroid of the latent space and then updates the centroid of the latent space incrementally by new incoming data. In the online learning, the algorithm continues learning by using the updated centroid. Through experiments using public underwater acoustic data, the proposed online anomaly detection algorithm takes only approximately 2 % additional learning time for the incremental centroid update and learning. Nevertheless, the proposed algorithm shows 19.10 % improvement in Area Under the receiver operating characteristic Curve (AUC) performance compared to the offline learning model when new incoming normal data comes.

Deep Learning in Thyroid Ultrasonography to Predict Tumor Recurrence in Thyroid Cancers (인공지능 딥러닝을 이용한 갑상선 초음파에서의 갑상선암의 재발 예측)

  • Jieun Kil;Kwang Gi Kim;Young Jae Kim;Hye Ryoung Koo;Jeong Seon Park
    • Journal of the Korean Society of Radiology
    • /
    • v.81 no.5
    • /
    • pp.1164-1174
    • /
    • 2020
  • Purpose To evaluate a deep learning model to predict recurrence of thyroid tumor using preoperative ultrasonography (US). Materials and Methods We included representative images from 229 US-based patients (male:female = 42:187; mean age, 49.6 years) who had been diagnosed with thyroid cancer on preoperative US and subsequently underwent thyroid surgery. After selecting each representative transverse or longitudinal US image, we created a data set from the resulting database of 898 images after augmentation. The Python 2.7.6 and Keras 2.1.5 framework for neural networks were used for deep learning with a convolutional neural network. We compared the clinical and histological features between patients with and without recurrence. The predictive performance of the deep learning model between groups was evaluated using receiver operating characteristic (ROC) analysis, and the area under the ROC curve served as a summary of the prognostic performance of the deep learning model to predict recurrent thyroid cancer. Results Tumor recurrence was noted in 49 (21.4%) among the 229 patients. Tumor size and multifocality varied significantly between the groups with and without recurrence (p < 0.05). The overall mean area under the curve (AUC) value of the deep learning model for prediction of recurrent thyroid cancer was 0.9 ± 0.06. The mean AUC value was 0.87 ± 0.03 in macrocarcinoma and 0.79 ± 0.16 in microcarcinoma. Conclusion A deep learning model for analysis of US images of thyroid cancer showed the possibility of predicting recurrence of thyroid cancer.

Prediction model of osteoporosis using nutritional components based on association (연관성 규칙 기반 영양소를 이용한 골다공증 예측 모델)

  • Yoo, JungHun;Lee, Bum Ju
    • The Journal of the Convergence on Culture Technology
    • /
    • v.6 no.3
    • /
    • pp.457-462
    • /
    • 2020
  • Osteoporosis is a disease that occurs mainly in the elderly and increases the risk of fractures due to structural deterioration of bone mass and tissues. The purpose of this study are to assess the relationship between nutritional components and osteoporosis and to evaluate models for predicting osteoporosis based on nutrient components. In experimental method, association was performed using binary logistic regression, and predictive models were generated using the naive Bayes algorithm and variable subset selection methods. The analysis results for single variables indicated that food intake and vitamin B2 showed the highest value of the area under the receiver operating characteristic curve (AUC) for predicting osteoporosis in men. In women, monounsaturated fatty acids showed the highest AUC value. In prediction model of female osteoporosis, the models generated by the correlation based feature subset and wrapper based variable subset methods showed an AUC value of 0.662. In men, the model by the full variable obtained an AUC of 0.626, and in other male models, the predictive performance was very low in sensitivity and 1-specificity. The results of these studies are expected to be used as the basic information for the treatment and prevention of osteoporosis.

Multivariate Outlier Removing for the Risk Prediction of Gas Leakage based Methane Gas (메탄 가스 기반 가스 누출 위험 예측을 위한 다변량 특이치 제거)

  • Dashdondov, Khongorzul;Kim, Mi-Hye
    • Journal of the Korea Convergence Society
    • /
    • v.11 no.12
    • /
    • pp.23-30
    • /
    • 2020
  • In this study, the relationship between natural gas (NG) data and gas-related environmental elements was performed using machine learning algorithms to predict the level of gas leakage risk without directly measuring gas leakage data. The study was based on open data provided by the server using the IoT-based remote control Picarro gas sensor specification. The naturel gas leaks into the air, it is a big problem for air pollution, environment and the health. The proposed method is multivariate outlier removing method based Random Forest (RF) classification for predicting risk of NG leak. After, unsupervised k-means clustering, the experimental dataset has done imbalanced data. Therefore, we focusing our proposed models can predict medium and high risk so best. In this case, we compared the receiver operating characteristic (ROC) curve, accuracy, area under the ROC curve (AUC), and mean standard error (MSE) for each classification model. As a result of our experiments, the evaluation measurements include accuracy, area under the ROC curve (AUC), and MSE; 99.71%, 99.57%, and 0.0016 for MOL_RF respectively.

Exploring the Predictive Factors of Passing the Korean Physical Therapist Licensing Examination (한국 물리치료사 국가 면허시험 합격 여부의 예측요인 탐색)

  • Kim, So-Hyun;Cho, Sung-Hyoun
    • Journal of The Korean Society of Integrative Medicine
    • /
    • v.10 no.3
    • /
    • pp.107-117
    • /
    • 2022
  • Purpose : The purpose of this study was to establish a model of the predictive factors for success or failure of examinees undertaking the Korean physical therapist licensing examination (KPTLE). Additionally, we assessed the pass/fail cut-off point. Methods : We analyzed the results of 10,881 examinees who undertook the KPTLE, using data provided by the Korea Health Personnel Licensing Examination Institute. The target variable was the test result (pass or fail), and the input variables were: sex, age, test subject, and total score. Frequency analysis, chi-square test, descriptive statistics, independent t-test, correlation analysis, binary logistic regression, and receiver operating characteristic (ROC) curve analyses were performed on the data. Results : Sex and age were not significant predictors of attaining a pass (p>.05). The test subjects with the highest probability of passing were, in order, medical regulation (MR) (Odds ratio (OR)=2.91, p<.001), foundations of physical therapy (FPT) (OR=2.86, p<.001), diagnosis and evaluation for physical therapy (DEPT) (OR=2.74, p<.001), physical therapy intervention (PTI) (OR=2.66, p<.001), and practical examination (PE) (OR=1.24, p<.001). The cut-off points for each subject were: FPT, 32.50; DEPT, 29.50; PTI, 44.50; MR, 14.50; and PE, 50.50. The total score (TS) was 164.50. The sensitivity, specificity, and the classification accuracy of the prediction model was 99 %, 98 %, and 99 %, respectively, indicating high accuracy. Area under the curve (AUC) values for each subject were: FPT, .958; DEPT, .968; PTI, .984; MR, .885; PE, .962; and TS, .998, indicating a high degree of fit. Conclusion : In our study, the predictive factors for passing KPTLE were identified, and the optimal cut-off point was calculated for each subject. Logistic regression was adequate to explain the predictive model. These results will provide universities and examinees with useful information for predicting their success or failure in the KPTLE.

Comparison of the Pediatric Balance Scale and Fullerton Advanced Balance Scale for Predicting Falls in Children With Cerebral Palsy

  • Kim, Gyoung-mo
    • Physical Therapy Korea
    • /
    • v.23 no.4
    • /
    • pp.63-70
    • /
    • 2016
  • Background: The Pediatric Balance Scale (PBS) and the Fullerton Advanced Balance (FAB) scale were used to assess balance function in patients with balance problem. These multidimensional clinical balance scales provide information about potential risk factors for falls. Objects: The purpose of this study was to investigate and compare the predictive properties of the PBS and FAB scales relative to fall risk in children with cerebral palsy (CP) using a receiver operating characteristic analysis. Methods: In total, 49 children with CP (boy=21, girl=28) who were diagnosed with level 1 or 2 according to the Gross Motor Function Classification System participated in this study. The PBS and FAB were performed, and verified cut-off score, sensitivity, specificity, and the area of under the curve (AUC). Results: In this study, the PBS scale was as a predictive measure of fall risk, but the FAB was not significant in children with CP. A cut-off score of 45.5 points provided optimal sensitivity of .90 and specificity of .69 on the PBS, and a cut-off score of 21.5 points provided optimal sensitivity of .90 and specificity of .62 on the FAB. Both scales showed moderately accurate of AUC with .79 and .76, respectively. Conclusion: The PBS is a useful screening tool for predicting fall risk in children with cerebral palsy, and those who score 45.5 or lower indicate a high risk for falls and are in need of balance intervention.

Partial AUC maximization for essential gene prediction using genetic algorithms

  • Hwang, Kyu-Baek;Ha, Beom-Yong;Ju, Sanghun;Kim, Sangsoo
    • BMB Reports
    • /
    • v.46 no.1
    • /
    • pp.41-46
    • /
    • 2013
  • Identifying genes indispensable for an organism's life and their characteristics is one of the central questions in current biological research, and hence it would be helpful to develop computational approaches towards the prediction of essential genes. The performance of a predictor is usually measured by the area under the receiver operating characteristic curve (AUC). We propose a novel method by implementing genetic algorithms to maximize the partial AUC that is restricted to a specific interval of lower false positive rate (FPR), the region relevant to follow-up experimental validation. Our predictor uses various features based on sequence information, protein-protein interaction network topology, and gene expression profiles. A feature selection wrapper was developed to alleviate the over-fitting problem and to weigh each feature's relevance to prediction. We evaluated our method using the proteome of budding yeast. Our implementation of genetic algorithms maximizing the partial AUC below 0.05 or 0.10 of FPR outperformed other popular classification methods.

In vivo Evaluation of Flow Estimation Methods for 3D Color Doppler Imaging

  • Yoo, Yang-Mo
    • Journal of Biomedical Engineering Research
    • /
    • v.31 no.3
    • /
    • pp.177-186
    • /
    • 2010
  • In 3D ultrasound color Doppler imaging (CDI), 8-16 pulse transmissions (ensembles) per each scanline are used for effective clutter rejection and flow estimation, but it yields a low volume acquisition rate. In this paper, we have evaluated three flow estimation methods: autoregression (AR), eigendecomposition (ED), and autocorrelation combined with adaptive clutter rejection (AC-ACR) for a small ensemble size (E=4). The performance of AR, ED and AC-ACR methods was compared using 2D and 3D in vivo data acquired under different clutter conditions (common carotid artery, kidney and liver). To evaluate the effectiveness of three methods, receiver operating characteristic (ROC) curves were generated. For 2D kidney in vivo data, the AC-ACR method outperforms the AR and ED methods in terms of the area under the ROC curve (AUC) (0.852 vs. 0.793 and 0.813, respectively). Similarly, the AC-ACR method shows higher AUC values for 2D liver in vivo data compared to the AR and ED methods (0.855 vs. 0.807 and 0.823, respectively). For the common carotid artery data, the AR provides higher AUC values, but it suffers from biased estimates. For 3D in vivo data acquired from a kidney transplant patient, the AC-ACR with E=4 provides an AUC value of 0.799. These in vivo experiment results indicate that the AC-ACR method can provide more robust flow estimates compared to the AR and ED methods with a small ensemble size.

Applying a modified AUC to gene ranking

  • Yu, Wenbao;Chang, Yuan-Chin Ivan;Park, Eunsik
    • Communications for Statistical Applications and Methods
    • /
    • v.25 no.3
    • /
    • pp.307-319
    • /
    • 2018
  • High-throughput technologies enable the simultaneous evaluation of thousands of genes that could discriminate different subclasses of complex diseases. Ranking genes according to differential expression is an important screening step for follow-up analysis. Many statistical measures have been proposed for this purpose. A good ranked list should provide a stable rank (at least for top-ranked gene), and the top ranked genes should have a high power in differentiating different disease status. However, there is a lack of emphasis in the literature on ranking genes based on these two criteria simultaneously. To achieve the above two criteria simultaneously, we proposed to apply a previously reported metric, the modified area under the receiver operating characteristic cure, to gene ranking. The proposed ranking method is found to be promising in leading to a stable ranking list and good prediction performances of top ranked genes. The findings are illustrated through studies on both synthesized data and real microarray gene expression data. The proposed method is recommended for ranking genes or other biomarkers for high-dimensional omics studies.

Evaluation of Clinical Usefulness of Gamma Glutamyl Transferase as a Surrogate Marker for Metabolic Syndrome in Non Obese Adult Men (비만하지 않은 성인 남성에서 대사증후군의 대리 표지자로서 감마 글루타밀 전이효소의 임상적 유용성 평가)

  • Shin, Kyung-A;Kim, Eun Jae
    • Journal of Convergence for Information Technology
    • /
    • v.10 no.12
    • /
    • pp.146-155
    • /
    • 2020
  • This study was to evaluate the usefulness of gamma glutamyl transferase (GGT) as a surrogate marker predicting metabolic syndrome. 7,155 non obese men over the age of 20 were studied as subjects. The criteria for diagnosing MetS were the National Cholesterol Education Program - Third Adult Treatment Panel (NCEP-ATP III). The risk of developing MetS according to GGT was conducted logistic regression analysis, and the ROC (receiver operating characteristic) curve was obtained to confirm GGT ability to predict the risk of MetS. Regardless of age and body mass index, MetS had a 7.09 times higher risk of onset in the fourth quartile than in the first quartile of GGT (p<0.001). The AUC (area under the curve) of GGT for the diagnosis of MetS was 0.715, and the cutoff value of GGT was 40.0 U/L, the sensitivity was 65.0%, and the specificity was 70.2%. Therefore, GGT is considered to be a useful diagnostic index for diagnosing MetS.