• Title/Summary/Keyword: Model validation

Search Result 3,212, Processing Time 0.032 seconds

Predicting blast-induced ground vibrations at limestone quarry from artificial neural network optimized by randomized and grid search cross-validation, and comparative analyses with blast vibration predictor models

  • Salman Ihsan;Shahab Saqib;Hafiz Muhammad Awais Rashid;Fawad S. Niazi;Mohsin Usman Qureshi
    • Geomechanics and Engineering
    • /
    • v.35 no.2
    • /
    • pp.121-133
    • /
    • 2023
  • The demand for cement and limestone crushed materials has increased many folds due to the tremendous increase in construction activities in Pakistan during the past few decades. The number of cement production industries has increased correspondingly, and so the rock-blasting operations at the limestone quarry sites. However, the safety procedures warranted at these sites for the blast-induced ground vibrations (BIGV) have not been adequately developed and/or implemented. Proper prediction and monitoring of BIGV are necessary to ensure the safety of structures in the vicinity of these quarry sites. In this paper, an attempt has been made to predict BIGV using artificial neural network (ANN) at three selected limestone quarries of Pakistan. The ANN has been developed in Python using Keras with sequential model and dense layers. The hyper parameters and neurons in each of the activation layers has been optimized using randomized and grid search method. The input parameters for the model include distance, a maximum charge per delay (MCPD), depth of hole, burden, spacing, and number of blast holes, whereas, peak particle velocity (PPV) is taken as the only output parameter. A total of 110 blast vibrations datasets were recorded from three different limestone quarries. The dataset has been divided into 85% for neural network training, and 15% for testing of the network. A five-layer ANN is trained with Rectified Linear Unit (ReLU) activation function, Adam optimization algorithm with a learning rate of 0.001, and batch size of 32 with the topology of 6-32-32-256-1. The blast datasets were utilized to compare the performance of ANN, multivariate regression analysis (MVRA), and empirical predictors. The performance was evaluated using the coefficient of determination (R2), mean absolute error (MAE), mean squared error (MSE), mean absolute percentage error (MAPE), and root mean squared error (RMSE)for predicted and measured PPV. To determine the relative influence of each parameter on the PPV, sensitivity analyses were performed for all input parameters. The analyses reveal that ANN performs superior than MVRA and other empirical predictors, andthat83% PPV is affected by distance and MCPD while hole depth, number of blast holes, burden and spacing contribute for the remaining 17%. This research provides valuable insights into improving safety measures and ensuring the structural integrity of buildings near limestone quarry sites.

Development of the Cloud Monitoring Program using Machine Learning-based Python Module from the MAAO All-sky Camera Images (기계학습 기반의 파이썬 모듈을 이용한 밀양아리랑우주천문대 전천 영상의 운량 모니터링 프로그램 개발)

  • Gu Lim;Dohyeong Kim;Donghyun Kim;Keun-Hong Park
    • Journal of the Korean earth science society
    • /
    • v.45 no.2
    • /
    • pp.111-120
    • /
    • 2024
  • Cloud coverage is a key factor in determining whether to proceed with observations. In the past, human judgment played an important role in weather evaluation for observations. However, the development of remote and robotic observation has diminished the role of human judgment. Moreover, it is not easy to evaluate weather conditions automatically because of the diverse cloud shapes and their rapid movement. In this paper, we present the development of a cloud monitoring program by applying a machine learning-based Python module "cloudynight" on all-sky camera images obtained at Miryang Arirang Astronomical Observatory (MAAO). The machine learning model was built by training 39,996 subregions divided from 1,212 images with altitude/azimuth angles and extracting 16 feature spaces. For our training model, the F1-score from the validation samples was 0.97, indicating good performance in identifying clouds in the all-sky image. As a result, this program calculates "Cloudiness" as the ratio of the number of total subregions to the number of subregions predicted to be covered by clouds. In the robotic observation, we set a policy that allows the telescope system to halt the observation when the "Cloudiness" exceeds 0.6 during the last 30 minutes. Following this policy, we found that there were no improper halts in the telescope system due to incorrect program decisions. We expect that robotic observation with the 0.7 m telescope at MAAO can be successfully operated using the cloud monitoring program.

Quality of Radiomics Research on Brain Metastasis: A Roadmap to Promote Clinical Translation

  • Chae Jung Park;Yae Won Park;Sung Soo Ahn;Dain Kim;Eui Hyun Kim;Seok-Gu Kang;Jong Hee Chang;Se Hoon Kim;Seung-Koo Lee
    • Korean Journal of Radiology
    • /
    • v.23 no.1
    • /
    • pp.77-88
    • /
    • 2022
  • Objective: Our study aimed to evaluate the quality of radiomics studies on brain metastases based on the radiomics quality score (RQS), Transparent Reporting of a multivariable prediction model for Individual Prognosis Or Diagnosis (TRIPOD) checklist, and the Image Biomarker Standardization Initiative (IBSI) guidelines. Materials and Methods: PubMed MEDLINE, and EMBASE were searched for articles on radiomics for evaluating brain metastases, published until February 2021. Of the 572 articles, 29 relevant original research articles were included and evaluated according to the RQS, TRIPOD checklist, and IBSI guidelines. Results: External validation was performed in only three studies (10.3%). The median RQS was 3.0 (range, -6 to 12), with a low basic adherence rate of 50.0%. The adherence rate was low in comparison to the "gold standard" (10.3%), stating the potential clinical utility (10.3%), performing the cut-off analysis (3.4%), reporting calibration statistics (6.9%), and providing open science and data (3.4%). None of the studies involved test-retest or phantom studies, prospective studies, or cost-effectiveness analyses. The overall rate of adherence to the TRIPOD checklist was 60.3% and low for reporting title (3.4%), blind assessment of outcome (0%), description of the handling of missing data (0%), and presentation of the full prediction model (0%). The majority of studies lacked pre-processing steps, with bias-field correction, isovoxel resampling, skull stripping, and gray-level discretization performed in only six (20.7%), nine (31.0%), four (3.8%), and four (13.8%) studies, respectively. Conclusion: The overall scientific and reporting quality of radiomics studies on brain metastases published during the study period was insufficient. Radiomics studies should adhere to the RQS, TRIPOD, and IBSI guidelines to facilitate the translation of radiomics into the clinical field.

Development and Validation of MRI-Based Radiomics Models for Diagnosing Juvenile Myoclonic Epilepsy

  • Kyung Min Kim;Heewon Hwang;Beomseok Sohn;Kisung Park;Kyunghwa Han;Sung Soo Ahn;Wonwoo Lee;Min Kyung Chu;Kyoung Heo;Seung-Koo Lee
    • Korean Journal of Radiology
    • /
    • v.23 no.12
    • /
    • pp.1281-1289
    • /
    • 2022
  • Objective: Radiomic modeling using multiple regions of interest in MRI of the brain to diagnose juvenile myoclonic epilepsy (JME) has not yet been investigated. This study aimed to develop and validate radiomics prediction models to distinguish patients with JME from healthy controls (HCs), and to evaluate the feasibility of a radiomics approach using MRI for diagnosing JME. Materials and Methods: A total of 97 JME patients (25.6 ± 8.5 years; female, 45.5%) and 32 HCs (28.9 ± 11.4 years; female, 50.0%) were randomly split (7:3 ratio) into a training (n = 90) and a test set (n = 39) group. Radiomic features were extracted from 22 regions of interest in the brain using the T1-weighted MRI based on clinical evidence. Predictive models were trained using seven modeling methods, including a light gradient boosting machine, support vector classifier, random forest, logistic regression, extreme gradient boosting, gradient boosting machine, and decision tree, with radiomics features in the training set. The performance of the models was validated and compared to the test set. The model with the highest area under the receiver operating curve (AUROC) was chosen, and important features in the model were identified. Results: The seven tested radiomics models, including light gradient boosting machine, support vector classifier, random forest, logistic regression, extreme gradient boosting, gradient boosting machine, and decision tree, showed AUROC values of 0.817, 0.807, 0.783, 0.779, 0.767, 0.762, and 0.672, respectively. The light gradient boosting machine with the highest AUROC, albeit without statistically significant differences from the other models in pairwise comparisons, had accuracy, precision, recall, and F1 scores of 0.795, 0.818, 0.931, and 0.871, respectively. Radiomic features, including the putamen and ventral diencephalon, were ranked as the most important for suggesting JME. Conclusion: Radiomic models using MRI were able to differentiate JME from HCs.

Study of the Derive of Core Habitats for Kirengeshoma koreana Nakai Using HSI and MaxEnt (HSI와 MaxEnt를 통한 나도승마 핵심서식지 발굴 연구)

  • Sun-Ryoung Kim;Rae-Ha Jang;Jae-Hwa Tho;Min-Han Kim;Seung-Woon Choi;Young-Jun Yoon
    • Korean Journal of Environment and Ecology
    • /
    • v.37 no.6
    • /
    • pp.450-463
    • /
    • 2023
  • The objective of this study is to derive the core habitat of the Kirengeshoma koreana Nakai utilizing Habitat Suitability Index (HSI) and Maximum Entropy (MaxEnt) models. Expert-based models have been criticized for their subjective criteria, while statistical models face difficulties in on-site validation and integration of expert opinions. To address these limitations, both models were employed, and their outcomes were overlaid to derive the core habitat. Five variables were identified through a comprehensive literature review and spatial analysis based on appearance coordinates. The environmental variables encompass vegetation zone, forest type, crown density, annual precipitation, and effective soil depth. Through surveys involving six experts, importance rankings and SI (Suitability Index) scores were established for each variable, subsequently facilitating the creation of an HSI map. Using the same variables, the MaxEnt model was also executed, resulting in a corresponding map, which was merged to construct the definitive core habitat map. Out of 16 observed locations of K. koreana, 15 were situated within the identified core habitat. Furthermore, an area historically known to host K. koreana but not verified in the present, Mt. Yeongchwi, was found to lack a core habitat. These findings suggest that the developed models exhibit a high degree of accuracy and effectively reflect the current ecological landscape.

Reactive and Proactive Aggression, the Validation of the Reactive-Proactive Questionnaire (RPQ): Focusing on ESEM and Rasch (반응적 공격성과 주도적 공격성, Reactive-Proactive Questionnaire(RPQ) 타당화 연구: ESEM과 Rasch를 중심으로)

  • Seonyoung Park;Jonghan Sea
    • Korean Journal of Culture and Social Issue
    • /
    • v.30 no.2
    • /
    • pp.159-192
    • /
    • 2024
  • The purpose of this study is to validate the Reactive-Proactive Aggression Questionnaire (RPQ), a tool for measuring reactive-proactive aggression, in the context of South Korea. A thorough translation was conducted in collaboration with the original author. An exploratory factor analysis (EFA), exploratory structural equation modeling (ESEM), rating scale model (Rasch), differential item functioning (DIF), and convergent validity were performed on a sample of 510 South Korean individuals. The results revealed a two-factor structure of reactive and proactive aggression after removing one item showing dual loading. Rating scale analysis based on the Rasch model indicated the appropriateness of the 3-point Likert scale, with all items meeting fit criteria. Although the separation index and separation reliability of proactive aggression was marginally lower, the overall discrimination between participants and items was satisfactory. Examination of participant-item distribution indicated a suitable alignment between reactive aggression and participant ability levels, whereas proactive aggression exhibited slightly elevated item difficulty. Furthermore, three items were found to function differently based on gender. A moderate but statistically significant positive correlation was found between the Barratt Impulsiveness Scale-11-R (Korean version) and RPQ from the results of the convergent validity evaluation. Overall, this study employed rigorous statistical methods to validate the suitability of the RPQ for use in Korea, taking cultural nuances into account, and introduced the concepts of reactive and proactive aggression to the Korean general population.

Investigating the Performance of Bayesian-based Feature Selection and Classification Approach to Social Media Sentiment Analysis (소셜미디어 감성분석을 위한 베이지안 속성 선택과 분류에 대한 연구)

  • Chang Min Kang;Kyun Sun Eo;Kun Chang Lee
    • Information Systems Review
    • /
    • v.24 no.1
    • /
    • pp.1-19
    • /
    • 2022
  • Social media-based communication has become crucial part of our personal and official lives. Therefore, it is no surprise that social media sentiment analysis has emerged an important way of detecting potential customers' sentiment trends for all kinds of companies. However, social media sentiment analysis suffers from huge number of sentiment features obtained in the process of conducting the sentiment analysis. In this sense, this study proposes a novel method by using Bayesian Network. In this model MBFS (Markov Blanket-based Feature Selection) is used to reduce the number of sentiment features. To show the validity of our proposed model, we utilized online review data from Yelp, a famous social media about restaurant, bars, beauty salons evaluation and recommendation. We used a number of benchmarking feature selection methods like correlation-based feature selection, information gain, and gain ratio. A number of machine learning classifiers were also used for our validation tasks, like TAN, NBN, Sons & Spouses BN (Bayesian Network), Augmented Markov Blanket. Furthermore, we conducted Bayesian Network-based what-if analysis to see how the knowledge map between target node and related explanatory nodes could yield meaningful glimpse into what is going on in sentiments underlying the target dataset.

Impacts assessment of Climate changes in North Korea based on RCP climate change scenarios II. Impacts assessment of hydrologic cycle changes in Yalu River (RCP 기후변화시나리오를 이용한 미래 북한지역의 수문순환 변화 영향 평가 II. 압록강유역의 미래 수문순환 변화 영향 평가)

  • Jeung, Se Jin;Kang, Dong Ho;Kim, Byung Sik
    • Journal of Wetlands Research
    • /
    • v.21 no.spc
    • /
    • pp.39-50
    • /
    • 2019
  • This study aims to assess the influence of climate change on the hydrological cycle at a basin level in North Korea. The selected model for this study is MRI-CGCM 3, the one used for the Coupled Model Intercomparison Project Phase 5 (CMIP5). Moreover, this study adopted the Spatial Disaggregation-Quantile Delta Mapping (SDQDM), which is one of the stochastic downscaling techniques, to conduct the bias correction for climate change scenarios. The comparison between the preapplication and postapplication of the SDQDM supported the study's review on the technique's validity. In addition, as this study determined the influence of climate change on the hydrological cycle, it also observed the runoff in North Korea. In predicting such influence, parameters of a runoff model used for the analysis should be optimized. However, North Korea is classified as an ungauged region for its political characteristics, and it was difficult to collect the country's runoff observation data. Hence, the study selected 16 basins with secured high-quality runoff data, and the M-RAT model's optimized parameters were calculated. The study also analyzed the correlation among variables for basin characteristics to consider multicollinearity. Then, based on a phased regression analysis, the study developed an equation to calculate parameters for ungauged basin areas. To verify the equation, the study assumed the Osipcheon River, Namdaecheon Stream, Yongdang Reservoir, and Yonggang Stream as ungauged basin areas and conducted cross-validation. As a result, for all the four basin areas, high efficiency was confirmed with the efficiency coefficients of 0.8 or higher. The study used climate change scenarios and parameters of the estimated runoff model to assess the changes in hydrological cycle processes at a basin level from climate change in the Amnokgang River of North Korea. The results showed that climate change would lead to an increase in precipitation, and the corresponding rise in temperature is predicted to cause elevating evapotranspiration. However, it was found that the storage capacity in the basin decreased. The result of the analysis on flow duration indicated a decrease in flow on the 95th day; an increase in the drought flow during the periods of Future 1 and Future 2; and an increase in both flows for the period of Future 3.

Temperature-dependent Development of Pseudococcus comstocki(Homoptera: Pseudococcidae) and Its Stage Transition Models (가루깍지벌레(Pseudococcus comstocki Kuwana)의 온도별 발육기간 및 발육단계 전이 모형)

  • 전흥용;김동순;조명래;장영덕;임명순
    • Korean journal of applied entomology
    • /
    • v.42 no.1
    • /
    • pp.43-51
    • /
    • 2003
  • This study was carried out to develop the forecasting model of Pseudococcus comtocki Kuwana for timing spray. Field phonology and temperature-dependent development of p. comstocki were studied, and its stage transition models were developed. p comstocki occurred three generations a year in Suwon. The 1 st adults occurred during mid to late June, and the 2nd adults were abundant during mid to late August. The 3rd adults were observed after late October. The development times of each instar of p. comstocki decreased with increasing temperature up to 25$^{\circ}C$, and thereafter the development times increased. The estimated low-threshold temperatures were 14.5, 8.4, 10.2, 11.8, and 10.1$^{\circ}C$ for eggs, 1st+2nd nymphs, 3rd nymphs, preoviposition, and 1st nymphs to preoviposition, respectively. The degree-days (thermal constants) for completion of each instar development were 105 DD for egg,315 DD for 1st+2nd nymph, 143 DD for 3rd nymph, 143 DD for preoviposition, and 599 DD for 1 st nymph to preoviposition. The stage transition models of p. comstocki, which simulate the proportion of individuals shifted from a stage to the next stage, were constructed using the modified Sharpe and DeMichele model and the Weibull function. In field validation, degree-day models using mean-minus-base, sine wave, and rectangle method showed 2-3d, 1-7d, and 0-6 d deviation with actual data in predicting the peak oviposition time of the 1st and 2nd generation adults, respectively. The rate summation model, in which daily development rates estimated by biophysical model of Sharpe and DeMichele were accumulated, showed 1-2 d deviation with actual data at the same phonology predictions.

Development of Deep-Learning-Based Models for Predicting Groundwater Levels in the Middle-Jeju Watershed, Jeju Island (딥러닝 기법을 이용한 제주도 중제주수역 지하수위 예측 모델개발)

  • Park, Jaesung;Jeong, Jiho;Jeong, Jina;Kim, Ki-Hong;Shin, Jaehyeon;Lee, Dongyeop;Jeong, Saebom
    • The Journal of Engineering Geology
    • /
    • v.32 no.4
    • /
    • pp.697-723
    • /
    • 2022
  • Data-driven models to predict groundwater levels 30 days in advance were developed for 12 groundwater monitoring stations in the middle-Jeju watershed, Jeju Island. Stacked long short-term memory (stacked-LSTM), a deep learning technique suitable for time series forecasting, was used for model development. Daily time series data from 2001 to 2022 for precipitation, groundwater usage amount, and groundwater level were considered. Various models were proposed that used different combinations of the input data types and varying lengths of previous time series data for each input variable. A general procedure for deep-learning-based model development is suggested based on consideration of the comparative validation results of the tested models. A model using precipitation, groundwater usage amount, and previous groundwater level data as input variables outperformed any model neglecting one or more of these data categories. Using extended sequences of these past data improved the predictions, possibly owing to the long delay time between precipitation and groundwater recharge, which results from the deep groundwater level in Jeju Island. However, limiting the range of considered groundwater usage data that significantly affected the groundwater level fluctuation (rather than using all the groundwater usage data) improved the performance of the predictive model. The developed models can predict the future groundwater level based on the current amount of precipitation and groundwater use. Therefore, the models provide information on the soundness of the aquifer system, which will help to prepare management plans to maintain appropriate groundwater quantities.