• Title/Summary/Keyword: Validation Set

Search Result 679, Processing Time 0.024 seconds

QSPR Study of the Absorption Maxima of Azobenzene Dyes

  • Xu, Jie;Wang, Lei;Liu, Li;Bai, Zikui;Wang, Luoxin
    • Bulletin of the Korean Chemical Society
    • /
    • v.32 no.11
    • /
    • pp.3865-3872
    • /
    • 2011
  • A quantitative structure-property relationship (QSPR) study was performed for the prediction of the absorption maxima of azobenzene dyes. The entire set of 191 azobenzenes was divided into a training set of 150 azobenzenes and a test set of 41 azobenzenes according to Kennard and Stones algorithm. A seven-descriptor model, with squared correlation coefficient ($R^2$) of 0.8755 and standard error of estimation (s) of 14.476, was developed by applying stepwise multiple linear regression (MLR) analysis on the training set. The reliability of the proposed model was further illustrated using various evaluation techniques: leave-many-out crossvalidation procedure, randomization tests, and validation through the test set.

Comparison of EKF and UKF on Training the Artificial Neural Network

  • Kim, Dae-Hak
    • Journal of the Korean Data and Information Science Society
    • /
    • v.15 no.2
    • /
    • pp.499-506
    • /
    • 2004
  • The Unscented Kalman Filter is known to outperform the Extended Kalman Filter for the nonlinear state estimation with a significance advantage that it does not require the computation of Jacobian but EKF has a competitive advantage to the UKF on the performance time. We compare both algorithms on training the artificial neural network. The validation data set is used to estimate parameters which are supposed to result in better fitting for the test data set. Experimental results are presented which indicate the performance of both algorithms.

  • PDF

Development and Validation of a Model Using Radiomics Features from an Apparent Diffusion Coefficient Map to Diagnose Local Tumor Recurrence in Patients Treated for Head and Neck Squamous Cell Carcinoma

  • Minjae Kim;Jeong Hyun Lee;Leehi Joo;Boryeong Jeong;Seonok Kim;Sungwon Ham;Jihye Yun;NamKug Kim;Sae Rom Chung;Young Jun Choi;Jung Hwan Baek;Ji Ye Lee;Ji-hoon Kim
    • Korean Journal of Radiology
    • /
    • v.23 no.11
    • /
    • pp.1078-1088
    • /
    • 2022
  • Objective: To develop and validate a model using radiomics features from apparent diffusion coefficient (ADC) map to diagnose local tumor recurrence in head and neck squamous cell carcinoma (HNSCC). Materials and Methods: This retrospective study included 285 patients (mean age ± standard deviation, 62 ± 12 years; 220 male, 77.2%), including 215 for training (n = 161) and internal validation (n = 54) and 70 others for external validation, with newly developed contrast-enhancing lesions at the primary cancer site on the surveillance MRI following definitive treatment of HNSCC between January 2014 and October 2019. Of the 215 and 70 patients, 127 and 34, respectively, had local tumor recurrence. Radiomics models using radiomics scores were created separately for T2-weighted imaging (T2WI), contrast-enhanced T1-weighted imaging (CE-T1WI), and ADC maps using non-zero coefficients from the least absolute shrinkage and selection operator in the training set. Receiver operating characteristic (ROC) analysis was used to evaluate the diagnostic performance of each radiomics score and known clinical parameter (age, sex, and clinical stage) in the internal and external validation sets. Results: Five radiomics features from T2WI, six from CE-T1WI, and nine from ADC maps were selected and used to develop the respective radiomics models. The area under ROC curve (AUROC) of ADC radiomics score was 0.76 (95% confidence interval [CI], 0.62-0.89) and 0.77 (95% CI, 0.65-0.88) in the internal and external validation sets, respectively. These were significantly higher than the AUROC values of T2WI (0.53 [95% CI, 0.40-0.67], p = 0.006), CE-T1WI (0.53 [95% CI, 0.40-0.67], p = 0.012), and clinical parameters (0.53 [95% CI, 0.39-0.67], p = 0.021) in the external validation set. Conclusion: The radiomics model using ADC maps exhibited higher diagnostic performance than those of the radiomics models using T2WI or CE-T1WI and clinical parameters in the diagnosis of local tumor recurrence in HNSCC following definitive treatment.

The Simulation and Research of Information for Space Craft(Autonomous Spacecraft Health Monitoring/Data Validation Control Systems)

  • Kim, H;Jhonson, R.;Zalewski, D.;Qu, Z.;Durrance, S.T.;Ham, C.
    • Journal of the Korea Academia-Industrial cooperation Society
    • /
    • v.2 no.2
    • /
    • pp.81-89
    • /
    • 2001
  • Space systems are operating in a changing and uncertain space environment and are desired to have autonomous capability for long periods of time without frequent telecommunications from the ground station At the same time. requirements for new set of projects/systems calling for ""autonomous"" operations for long unattended periods of time are emerging. Since, by the nature of space systems, it is desired that they perform their mission flawlessly and also it is of extreme importance to have fault-tolerant sensor/actuator sub-systems for the purpose of validating science measurement data for the mission success. Technology innovations attendant on autonomous data validation and health monitoring are articulated for a growing class of autonomous operations of space systems. The greatest need is on focus research effort to the development of a new class of fault-tolerant space systems such as attitude actuators and sensors as well as validation of measurement data from scientific instruments. The characterization for the next step in evolving the existing control processes to an autonomous posture is to embed intelligence into actively control. modify parameters and select sensor/actuator subsystems based on statistical parameters of the measurement errors in real-time. This research focuses on the identification/demonstration of critical technology innovations that will be applied to Autonomous Spacecraft Health Monitoring/Data Validation Control Systems (ASHMDVCS). Systems (ASHMDVCS).

  • PDF

Prediction of Non-Genotoxic Carcinogenicity Based on Genetic Profiles of Short Term Exposure Assays

  • Perez, Luis Orlando;Gonzalez-Jose, Rolando;Garcia, Pilar Peral
    • Toxicological Research
    • /
    • v.32 no.4
    • /
    • pp.289-300
    • /
    • 2016
  • Non-genotoxic carcinogens are substances that induce tumorigenesis by non-mutagenic mechanisms and long term rodent bioassays are required to identify them. Recent studies have shown that transcription profiling can be applied to develop early identifiers for long term phenotypes. In this study, we used rat liver expression profiles from the NTP (National Toxicology Program, Research Triangle Park, USA) DrugMatrix Database to construct a gene classifier that can distinguish between non-genotoxic carcinogens and other chemicals. The model was based on short term exposure assays (3 days) and the training was limited to oxidative stressors, peroxisome proliferators and hormone modulators. Validation of the predictor was performed on independent toxicogenomic data (TG-GATEs, Toxicogenomics Project-Genomics Assisted Toxicity Evaluation System, Osaka, Japan). To build our model we performed Random Forests together with a recursive elimination algorithm (VarSelRF). Gene set enrichment analysis was employed for functional interpretation. A total of 770 microarrays comprising 96 different compounds were analyzed and a predictor of 54 genes was built. Prediction accuracy was 0.85 in the training set, 0.87 in the test set and increased with increasing concentration in the validation set: 0.6 at low dose, 0.7 at medium doses and 0.81 at high doses. Pathway analysis revealed gene prominence of cellular respiration, energy production and lipoprotein metabolism. The biggest target of toxicogenomics is accurately predict the toxicity of unknown drugs. In this analysis, we presented a classifier that can predict non-genotoxic carcinogenicity by using short term exposure assays. In this approach, dose level is critical when evaluating chemicals at early time points.

Accuracy Evaluation of Brain Parenchymal MRI Image Classification Using Inception V3 (Inception V3를 이용한 뇌 실질 MRI 영상 분류의 정확도 평가)

  • Kim, Ji-Yul;Ye, Soo-Young
    • Journal of the Institute of Convergence Signal Processing
    • /
    • v.20 no.3
    • /
    • pp.132-137
    • /
    • 2019
  • The amount of data generated from medical images is increasingly exceeding the limits of professional visual analysis, and the need for automated medical image analysis is increasing. For this reason, this study evaluated the classification and accuracy according to the presence or absence of tumor using Inception V3 deep learning model, using MRI medical images showing normal and tumor findings. As a result, the accuracy of the deep learning model was 90% for the training data set and 86% for the validation data set. The loss rate was 0.56 for the training data set and 1.28 for the validation data set. In future studies, it is necessary to secure the data of publicly available medical images to improve the performance of the deep learning model and to ensure the reliability of the evaluation, and to implement modeling by improving the accuracy of labeling through labeling classification.

Predicting the mortality of pneumonia patients visiting the emergency department through machine learning (기계학습모델을 통한 응급실 폐렴환자의 사망예측 모델과 기존 예측 모델의 비교)

  • Bae, Yeol;Moon, Hyung Ki;Kim, Soo Hyun
    • Journal of The Korean Society of Emergency Medicine
    • /
    • v.29 no.5
    • /
    • pp.455-464
    • /
    • 2018
  • Objective: Machine learning is not yet widely used in the medical field. Therefore, this study was conducted to compare the performance of preexisting severity prediction models and machine learning based models (random forest [RF], gradient boosting [GB]) for mortality prediction in pneumonia patients. Methods: We retrospectively collected data from patients who visited the emergency department of a tertiary training hospital in Seoul, Korea from January to March of 2015. The Pneumonia Severity Index (PSI) and Sequential Organ Failure Assessment (SOFA) scores were calculated for both groups and the area under the curve (AUC) for mortality prediction was computed. For the RF and GB models, data were divided into a test set and a validation set by the random split method. The training set was learned in RF and GB models and the AUC was obtained from the validation set. The mean AUC was compared with the other two AUCs. Results: Of the 536 investigated patients, 395 were enrolled and 41 of them died. The AUC values of PSI and SOFA scores were 0.799 (0.737-0.862) and 0.865 (0.811-0.918), respectively. The mean AUC values obtained by the RF and GB models were 0.928 (0.899-0.957) and 0.919 (0.886-0.952), respectively. There were significant differences between preexisting severity prediction models and machine learning based models (P<0.001). Conclusion: Classification through machine learning may help predict the mortality of pneumonia patients visiting the emergency department.

Incremental Model-based Test Suite Reduction with Formal Concept Analysis

  • Ng, Pin;Fung, Richard Y.K.;Kong, Ray W.M.
    • Journal of Information Processing Systems
    • /
    • v.6 no.2
    • /
    • pp.197-208
    • /
    • 2010
  • Test scenarios can be derived based on some system models for requirements validation purposes. Model-based test suite reduction aims to provide a smaller set of test scenarios which can preserve the original test coverage with respect to some testing criteria. We are proposing to apply Formal Concept Analysis (FCA) in analyzing the association between a set of test scenarios and a set of transitions specified in a state machine model. By utilizing the properties of concept lattice, we are able to determine incrementally a minimal set of test scenarios with adequate test coverage.

Analysis of Viscous Free Surface Flow around a Ship by a Level-set Method

  • Park, Il-Ryong;Chun, Ho-Hwan
    • Journal of Ship and Ocean Technology
    • /
    • v.6 no.2
    • /
    • pp.37-50
    • /
    • 2002
  • In the present numerical simulation of viscous free surface flow around a ship, two-fluids in-compressible Reynolds-averaged Navier-Stokes equations with the standard $\textsc{k}-\varepsilon$turbulence model are discretized on a regular grid by using a finite volume method. A local level-set method is introduced for capturing the free surface movement and the influence of the viscous layer and dynamic boundary condition of the free surface are implicitly considered. Partial differential equations in the level-set method are discretized with second order ENO scheme and explicit Euler scheme in the space and time integration, respectively. The computational results for the Series-60 model with $C_B=0.6$ show a good agreement with the experimental data, but more validation studies for commercial complicated hull forms are necessary.

Classification and Regression Tree Analysis for Molecular Descriptor Selection and Binding Affinities Prediction of Imidazobenzodiazepines in Quantitative Structure-Activity Relationship Studies

  • Atabati, Morteza;Zarei, Kobra;Abdinasab, Esmaeil
    • Bulletin of the Korean Chemical Society
    • /
    • v.30 no.11
    • /
    • pp.2717-2722
    • /
    • 2009
  • The use of the classification and regression tree (CART) methodology was studied in a quantitative structure-activity relationship (QSAR) context on a data set consisting of the binding affinities of 39 imidazobenzodiazepines for the α1 benzodiazepine receptor. The 3-D structures of these compounds were optimized using HyperChem software with semiempirical AM1 optimization method. After optimization a set of 1481 zero-to three-dimentional descriptors was calculated for each molecule in the data set. The response (dependent variable) in the tree model consisted of the binding affinities of drugs. Three descriptors (two topological and one 3D-Morse descriptors) were applied in the final tree structure to describe the binding affinities. The mean relative error percent for the data set is 3.20%, compared with a previous model with mean relative error percent of 6.63%. To evaluate the predictive power of CART cross validation method was also performed.