• Title/Summary/Keyword: prediction score

Search Result 494, Processing Time 0.033 seconds

Prediction Intervals for Nonlinear Time Series Models Using the Bootstrap Method (붓스트랩을 이용한 비선형 시계열 모형의 예측구간)

  • 이성덕;김주성
    • The Korean Journal of Applied Statistics
    • /
    • v.17 no.2
    • /
    • pp.219-228
    • /
    • 2004
  • In this paper we construct prediction intervals for nonlinear time series models using the bootstrap. We compare these prediction intervals to traditional asymptotic prediction intervals using quasi-score estimation function and M-quasi-score estimating function comprising bounded functions. Simulation results show that the bootstrap method leads to improved accuracy. The accuracy of the bootstrap is empirically demonstrated with the consumer price index.

Analysis of Dietary Factors of Chronic Disease Using a Neural Network (신경망을 이용한 만성질병에 영향을 미치는 식이요인 분석연구)

  • 이심열;백희영;유송민
    • Korean Journal of Community Nutrition
    • /
    • v.4 no.3
    • /
    • pp.421-430
    • /
    • 1999
  • A neural network system was applied in order to analyze the nutritional and other factors influencing chronic diseases. Five different nutrition evaluation methods including SD Score, %RDA, NAR INQ and %RDA-SD Score were utilized to facilitate nutrient data for the system. Observing top three chronic disease prediction ratio, WHR using SD Score was the most frequently quoted factor revealing the highest predication rate as 62.0%. Other high prediction rates using other data processing methods are as follows. Prediction rate with %RDA, NAR, INQ and %RDA-SD Score were 58.5%(diabetes), 53.5%(hyperlipidemia), 51.6%(diabetes), and 58.0%(diabetes)respectively. Higher prediction rate was observed using either NAR or INQ for obesity as 51.7% and 50.9% compared to the previous result using SD Score. After reviewing appearance rate for all chronic disease and for various data processing method used, it was found that iron and vitamin C were the most frequently cited factors resulting in high prediction rate.

  • PDF

The Accuracy of Prediction Models in Burn Patients (화상환자에서 사망예측모델의 성능 평가에 관한 연구)

  • Woo, Jaeyeon;Kym, Dohern
    • Journal of the Korean Burn Society
    • /
    • v.24 no.1
    • /
    • pp.1-6
    • /
    • 2021
  • Purpose: The purpose of this study was to evaluate the accuracy of four prediction models in adult burn patients. Methods: This retrospective study was conducted on 696 adult burn patients who were treated at burn intensive care unit (BICU) of Hallym University Hangang Sacred Heart Hospital from January 2017 to December 2019. The models are ABSI, APACHE IV, rBaux and Hangang score. Results: The discrimination of each prediction model was analyzed as AUC of ROC curve. AUC value was the highest with Hangang score of 0.931 (0.908~0.954), followed by rBaux 0.896 (0.867~0.924), ABSI 0.883 (0.853~0.913) and APACHE IV 0.851 (0.818~0.884). Conclusion: The results of evaluating the accuracy of the four models, Hangang score showed the highest prediction. But it is necessary to apply the appropriate prediction model according to characteristics of the burn center.

Artificial Neural Network Supported Prediction of Magnetic Properties of Bulk Metallic Glasses (인공신경망을 이용한 벌크 비정질 합금 소재의 포화자속밀도 예측 성능평가)

  • Chunghee Nam
    • Korean Journal of Materials Research
    • /
    • v.33 no.7
    • /
    • pp.273-278
    • /
    • 2023
  • In this study, based on the saturation magnetic flux density experimental values (Bs) of 622 Fe-based bulk metallic glasses (BMGs), regression models were applied to predict Bs using artificial neural networks (ANN), and prediction performance was evaluated. Model performance evaluation was investigated by using the F1 score together with the coefficient of determination (R2 score), which is mainly used in regression models. The coefficient of determination can be used as a performance indicator, since it shows the predicted results of the saturation magnetic flux density of full material datasets in a balanced way. However, the BMG alloy contains iron and requires a high saturation magnetic flux density to have excellent applicability as a soft magnetic material, and in this study F1 score was used as a performance indicator to better predict Bs above the threshold value of Bs (1.4 T). After obtaining two ANN models optimized for the R2 and F1 score conditions, respectively, their prediction performance was compared for the test data. As a case study to evaluate the prediction performance, new Fe-based BMG datasets that were not included in the training and test datasets were predicted using the two ANN models. The results showed that the model with an excellent F1 score achieved a more accurate prediction for a material with a high saturation magnetic flux density.

Selecting Optimal Algorithms for Stroke Prediction: Machine Learning-Based Approach

  • Kyung Tae CHOI;Kyung-A KIM;Myung-Ae CHUNG;Min Soo KANG
    • Korean Journal of Artificial Intelligence
    • /
    • v.12 no.2
    • /
    • pp.1-7
    • /
    • 2024
  • In this paper, we compare three models (logistic regression, Random Forest, and XGBoost) for predicting stroke occurrence using data from the Korea National Health and Nutrition Examination Survey (KNHANES). We evaluated these models using various metrics, focusing mainly on recall and F1 score to assess their performance. Initially, the logistic regression model showed a satisfactory recall score among the three models; however, it was excluded from further consideration because it did not meet the F1 score threshold, which was set at a minimum of 0.5. The F1 score is crucial as it considers both precision and recall, providing a balanced measure of a model's accuracy. Among the models that met the criteria, XGBoost showed the highest recall rate and showed excellent performance in stroke prediction. In particular, XGBoost shows strong performance not only in recall, but also in F1 score and AUC, so it should be considered the optimal algorithm for predicting stroke occurrence. This study determines that the performance of XGBoost is optimal in the field of stroke prediction.

A new method to predict the protein sequence alignment quality (단백질 서열정렬 정확도 예측을 위한 새로운 방법)

  • Lee, Min-Ho;Jeong, Chan-Seok;Kim, Dong-Seop
    • Bioinformatics and Biosystems
    • /
    • v.1 no.1
    • /
    • pp.82-87
    • /
    • 2006
  • The most popular protein structure prediction method is comparative modeling. To guarantee accurate comparative modeling, the sequence alignment between a query protein and a template should be accurate. Although choosing the best template based on the protein sequence alignments is most critical to perform more accurate fold-recognition in comparative modeling, even more critical is the sequence alignment quality. Contrast to a lot of attention to developing a method for choosing the best template, prediction of alignment accuracy has not gained much interest. Here, we develop a method for prediction of the shift score, a recently proposed measure for alignment quality. We apply support vector regression (SVR) to predict shift score. The alignment between a query protein and a template protein of length n in our own library is transformed into an input vector of length n +2. Structural alignments are assumed to be the best alignment, and SVR is trained to predict the shift score between structural alignment and profile-profile alignment of a query protein to a template protein. The performance is assessed by Pearson correlation coefficient. The trained SVR predicts shift score with the correlation between observed and predicted shift score of 0.80.

  • PDF

Developing a Pedestrian Satisfaction Prediction Model Based on Machine Learning Algorithms (기계학습 알고리즘을 이용한 보행만족도 예측모형 개발)

  • Lee, Jae Seung;Lee, Hyunhee
    • Journal of Korea Planning Association
    • /
    • v.54 no.3
    • /
    • pp.106-118
    • /
    • 2019
  • In order to develop pedestrian navigation service that provides optimal pedestrian routes based on pedestrian satisfaction levels, it is required to develop a prediction model that can estimate a pedestrian's satisfaction level given a certain condition. Thus, the aim of the present study is to develop a pedestrian satisfaction prediction model based on three machine learning algorithms: Logistic Regression, Random Forest, and Artificial Neural Network models. The 2009, 2012, 2013, 2014, and 2015 Pedestrian Satisfaction Survey Data in Seoul, Korea are used to train and test the machine learning models. As a result, the Random Forest model shows the best prediction performance among the three (Accuracy: 0.798, Recall: 0.906, Precision: 0.842, F1 Score: 0.873, AUC: 0.795). The performance of Artificial Neural Network is the second (Accuracy: 0.773, Recall: 0.917, Precision: 0.811, F1 Score: 0.868, AUC: 0.738) and Logistic Regression model's performance follows the second (Accuracy: 0.764, Recall: 1.000, Precision: 0.764, F1 Score: 0.868, AUC: 0.575). The precision score of the Random Forest model implies that approximately 84.2% of pedestrians may be satisfied if they walk the areas, suggested by the Random Forest model.

Development of the Score Table for Prediction of Landslide Hazard - A Case Study of Gyeongsangbuk-Do Province - (산사태 발생위험 예측을 위한 판정기준표의 작성 -경상북도 지역을 중심으로-)

  • Jung, Kyu-Won;Park, Sang-Jun;Lee, Chang-Woo
    • Journal of Korean Society of Forest Science
    • /
    • v.97 no.3
    • /
    • pp.332-339
    • /
    • 2008
  • This study was carried out to develop the score table for prediction of landslide hazard in Gyeongsangbuk-Do province. It was studied to 172 places landslided in 23 cities and counties of Gyeongsangbuk-Do province. An analyze of the score table for landslide hazard was carried out through the multiple statistics of quantification method (I) by the computer. Factors effected to landslide occurrence quantity were shown in order of slope position, slope length, bedrock, aspect, forest age, slope form and slope. As results of the development of score table for prediction of landslide hazard in Gyeongsangbuk-Do province, total score range was divided that 107 under is stable area (IV class), 107~176 is area with little susceptibility to landslide (III class), 177~246 is area with moderate susceptibility to landslide (II class), above 247 area with severe susceptibility to landslide (I class).

CT Angiography-Derived RECHARGE Score Predicts Successful Percutaneous Coronary Intervention in Patients with Chronic Total Occlusion

  • Jiahui Li;Rui Wang;Christian Tesche;U. Joseph Schoepf;Jonathan T. Pannell;Yi He;Rongchong Huang;Yalei Chen;Jianan Li;Xiantao Song
    • Korean Journal of Radiology
    • /
    • v.22 no.5
    • /
    • pp.697-705
    • /
    • 2021
  • Objective: To investigate the feasibility and the accuracy of the coronary CT angiography (CCTA)-derived Registry of Crossboss and Hybrid procedures in France, the Netherlands, Belgium and United Kingdom (RECHARGE) score (RECHARGECCTA) for the prediction of procedural success and 30-minutes guidewire crossing in percutaneous coronary intervention (PCI) for chronic total occlusion (CTO). Materials and Methods: One hundred and twenty-four consecutive patients (mean age, 54 years; 79% male) with 131 CTO lesions who underwent CCTA before catheter angiography (CA) with CTO-PCI were retrospectively enrolled in this study. The RECHARGECCTA scores were calculated and compared with RECHARGECA and other CTA-based prediction scores, including Multicenter CTO Registry of Japan (J-CTO), CT Registry of CTO Revascularisation (CT-RECTOR), and Korean Multicenter CTO CT Registry (KCCT) scores. Results: The procedural success rate of the CTO-PCI procedures was 72%, and 61% of cases achieved the 30-minutes wire crossing. No significant difference was observed between the RECHARGECCTA score and the RECHARGECA score for procedural success (median 2 vs. median 2, p = 0.084). However, the RECHARGECCTA score was higher than the RECHARGECA score for the 30-minutes wire crossing (median 2 vs. median 1.5, p = 0.001). The areas under the curve (AUCs) of the RECHARGECCTA and RECHARGECA scores for predicting procedural success showed no statistical significance (0.718 vs. 0.757, p = 0.655). The sensitivity, specificity, positive predictive value, and the negative predictive value of the RECHARGECCTA scores of ≤ 2 for predictive procedural success were 78%, 60%, 43%, and 87%, respectively. The RECHARGECCTA score showed a discriminative performance that was comparable to those of the other CTA-based prediction scores (AUC = 0.718 vs. 0.665-0.717, all p > 0.05). Conclusion: The non-invasive RECHARGECCTA score performs better than the invasive determination for the prediction of the 30-minutes wire crossing of CTO-PCI. However, the RECHARGECCTA score may not replace other CTA-based prediction scores for predicting CTO-PCI success.

Evaluation of Predictability of Global/Regional Integrated Model System (GRIMs) for the Winter Precipitation Systems over Korea (한반도 겨울철 강수 유형에 따른 전지구 수치모델(GRIMs) 예측성능 검증)

  • Yeon, Sang-Hoon;Suh, Myoung-Suk;Lee, Juwon;Lee, Eun-Hee
    • Atmosphere
    • /
    • v.32 no.4
    • /
    • pp.353-365
    • /
    • 2022
  • This paper evaluates precipitation forecast skill of Global/Regional Integrated Model system (GRIMs) over South Korea in a boreal winter from December 2013 to February 2014. Three types of precipitation are classified based on development mechanism: 1) convection type (C type), 2) low pressure type (L type), and 3) orographic type (O type), in which their frequencies are 44.4%, 25.0%, and 30.6%, respectively. It appears that the model significantly overestimates precipitation occurrence (0.1 mm d-1) for all types of winter precipitation. Objective measured skill scores of GRIMs are comparably high for L type and O type. Except for precipitation occurrence, the model shows high predictability for L type precipitation with the most unbiased prediction. It is noted that Equitable Threat Score (ETS) is inappropriate for measuring rare events due to its high dependency on the sample size, as in the case of Critical Success Index as well. The Symmetric Extreme Dependency Score (SEDS) demonstrates less sensitivity on the number of samples. Thus, SEDS is used for the evaluation of prediction skill to supplement the limit of ETS. The evaluation via SEDS shows that the prediction skill score for L type is the highest in the range of 5.0, 10.0 mm d-1 and the score for O type is the highest in the range of 1.0, 20.0 mm d-1. C type has the lowest scores in overall range. The difference in precipitation forecast skill by precipitation type can be explained by the spatial distribution and intensity of precipitation in each representative case.