• 제목/요약/키워드: Performance confidence

검색결과 1,081건 처리시간 0.028초

Validation of CT-Based Risk Stratification System for Lymph Node Metastasis in Patients With Thyroid Cancer

  • Yun Hwa Roh;Sae Rom Chung;Jung Hwan Baek;Young Jun Choi;Tae-Yon Sung;Dong Eun Song;Tae Yong Kim;Jeong Hyun Lee
    • Korean Journal of Radiology
    • /
    • 제24권10호
    • /
    • pp.1028-1037
    • /
    • 2023
  • Objective: To evaluate the computed tomography (CT) features for diagnosing metastatic cervical lymph nodes (LNs) in patients with differentiated thyroid cancer (DTC) and validate the CT-based risk stratification system suggested by the Korean Thyroid Imaging Reporting and Data System (K-TIRADS) guidelines. Materials and Methods: A total of 463 LNs from 399 patients with DTC who underwent preoperative CT staging and ultrasound-guided fine-needle aspiration were included. The following CT features for each LN were evaluated: absence of hilum, cystic changes, calcification, strong enhancement, and heterogeneous enhancement. Multivariable logistic regression analysis was performed to identify independent CT features associated with metastatic LNs, and their diagnostic performances were evaluated. LNs were classified into probably benign, indeterminate, and suspicious categories according to the K-TIRADS and the modified LN classification proposed in our study. The diagnostic performance of both classification systems was compared using the exact McNemar and Kosinski tests. Results: The absence of hilum (odds ratio [OR], 4.859; 95% confidence interval [CI], 1.593-14.823; P = 0.005), strong enhancement (OR, 28.755; 95% CI, 12.719-65.007; P < 0.001), and cystic changes (OR, 46.157; 95% CI, 5.07-420.234; P = 0.001) were independently associated with metastatic LNs. All LNs showing calcification were diagnosed as metastases. Heterogeneous enhancement did not show a significant independent association with metastatic LNs. Strong enhancement, calcification, and cystic changes showed moderate to high specificity (70.1%-100%) and positive predictive value (PPV) (91.8%-100%). The absence of the hilum showed high sensitivity (97.8%) but low specificity (34.0%). The modified LN classification, which excluded heterogeneous enhancement from the K-TIRADS, demonstrated higher specificity (70.1% vs. 62.9%, P = 0.016) and PPV (92.5% vs. 90.9%, P = 0.011) than the K-TIRADS. Conclusion: Excluding heterogeneous enhancement as a suspicious feature resulted in a higher specificity and PPV for diagnosing metastatic LNs than the K-TIRADS. Our research results may provide a basis for revising the LN classification in future guidelines.

Development and Validation of 18F-FDG PET/CT-Based Multivariable Clinical Prediction Models for the Identification of Malignancy-Associated Hemophagocytic Lymphohistiocytosis

  • Xu Yang;Xia Lu;Jun Liu;Ying Kan;Wei Wang;Shuxin Zhang;Lei Liu;Jixia Li;Jigang Yang
    • Korean Journal of Radiology
    • /
    • 제23권4호
    • /
    • pp.466-478
    • /
    • 2022
  • Objective: 18F-fluorodeoxyglucose (FDG) PET/CT is often used for detecting malignancy in patients with newly diagnosed hemophagocytic lymphohistiocytosis (HLH), with acceptable sensitivity but relatively low specificity. The aim of this study was to improve the diagnostic ability of 18F-FDG PET/CT in identifying malignancy in patients with HLH by combining 18F-FDG PET/CT and clinical parameters. Materials and Methods: Ninety-seven patients (age ≥ 14 years) with secondary HLH were retrospectively reviewed and divided into the derivation (n = 71) and validation (n = 26) cohorts according to admission time. In the derivation cohort, 22 patients had malignancy-associated HLH (M-HLH) and 49 patients had non-malignancy-associated HLH (NM-HLH). Data on pretreatment 18F-FDG PET/CT and laboratory results were collected. The variables were analyzed using the Mann-Whitney U test or Pearson's chi-square test, and a nomogram for predicting M-HLH was constructed using multivariable binary logistic regression. The predictors were also ranked using decision-tree analysis. The nomogram and decision tree were validated in the validation cohort (10 patients with M-HLH and 16 patients with NM-HLH). Results: The ratio of the maximal standardized uptake value (SUVmax) of the lymph nodes to that of the mediastinum, the ratio of the SUVmax of bone lesions or bone marrow to that of the mediastinum, and age were selected for constructing the model. The nomogram showed good performance in predicting M-HLH in the validation cohort, with an area under the receiver operating characteristic curve of 0.875 (95% confidence interval, 0.686-0.971). At an appropriate cutoff value, the sensitivity and specificity for identifying M-HLH were 90% (9/10) and 68.8% (11/16), respectively. The decision tree integrating the same variables showed 70% (7/10) sensitivity and 93.8% (15/16) specificity for identifying M-HLH. In comparison, visual analysis of 18F-FDG PET/CT images demonstrated 100% (10/10) sensitivity and 12.5% (2/16) specificity. Conclusion: 18F-FDG PET/CT may be a practical technique for identifying M-HLH. The model constructed using 18F-FDG PET/CT features and age was able to detect malignancy with better accuracy than visual analysis of 18F-FDG PET/CT images.

Deep Learning-Assisted Diagnosis of Pediatric Skull Fractures on Plain Radiographs

  • Jae Won Choi;Yeon Jin Cho;Ji Young Ha;Yun Young Lee;Seok Young Koh;June Young Seo;Young Hun Choi;Jung-Eun Cheon;Ji Hoon Phi;Injoon Kim;Jaekwang Yang;Woo Sun Kim
    • Korean Journal of Radiology
    • /
    • 제23권3호
    • /
    • pp.343-354
    • /
    • 2022
  • Objective: To develop and evaluate a deep learning-based artificial intelligence (AI) model for detecting skull fractures on plain radiographs in children. Materials and Methods: This retrospective multi-center study consisted of a development dataset acquired from two hospitals (n = 149 and 264) and an external test set (n = 95) from a third hospital. Datasets included children with head trauma who underwent both skull radiography and cranial computed tomography (CT). The development dataset was split into training, tuning, and internal test sets in a ratio of 7:1:2. The reference standard for skull fracture was cranial CT. Two radiology residents, a pediatric radiologist, and two emergency physicians participated in a two-session observer study on an external test set with and without AI assistance. We obtained the area under the receiver operating characteristic curve (AUROC), sensitivity, and specificity along with their 95% confidence intervals (CIs). Results: The AI model showed an AUROC of 0.922 (95% CI, 0.842-0.969) in the internal test set and 0.870 (95% CI, 0.785-0.930) in the external test set. The model had a sensitivity of 81.1% (95% CI, 64.8%-92.0%) and specificity of 91.3% (95% CI, 79.2%-97.6%) for the internal test set and 78.9% (95% CI, 54.4%-93.9%) and 88.2% (95% CI, 78.7%-94.4%), respectively, for the external test set. With the model's assistance, significant AUROC improvement was observed in radiology residents (pooled results) and emergency physicians (pooled results) with the difference from reading without AI assistance of 0.094 (95% CI, 0.020-0.168; p = 0.012) and 0.069 (95% CI, 0.002-0.136; p = 0.043), respectively, but not in the pediatric radiologist with the difference of 0.008 (95% CI, -0.074-0.090; p = 0.850). Conclusion: A deep learning-based AI model improved the performance of inexperienced radiologists and emergency physicians in diagnosing pediatric skull fractures on plain radiographs.

Use of Artificial Intelligence for Reducing Unnecessary Recalls at Screening Mammography: A Simulation Study

  • Yeon Soo Kim;Myoung-jin Jang;Su Hyun Lee;Soo-Yeon Kim;Su Min Ha;Bo Ra Kwon;Woo Kyung Moon;Jung Min Chang
    • Korean Journal of Radiology
    • /
    • 제23권12호
    • /
    • pp.1241-1250
    • /
    • 2022
  • Objective: To conduct a simulation study to determine whether artificial intelligence (AI)-aided mammography reading can reduce unnecessary recalls while maintaining cancer detection ability in women recalled after mammography screening. Materials and Methods: A retrospective reader study was performed by screening mammographies of 793 women (mean age ± standard deviation, 50 ± 9 years) recalled to obtain supplemental mammographic views regarding screening mammography-detected abnormalities between January 2016 and December 2019 at two screening centers. Initial screening mammography examinations were interpreted by three dedicated breast radiologists sequentially, case by case, with and without AI aid, in a single session. The area under the receiver operating characteristic curve (AUC), sensitivity, specificity, and recall rate for breast cancer diagnosis were obtained and compared between the two reading modes. Results: Fifty-four mammograms with cancer (35 invasive cancers and 19 ductal carcinomas in situ) and 739 mammograms with benign or negative findings were included. The reader-averaged AUC improved after AI aid, from 0.79 (95% confidence interval [CI], 0.74-0.85) to 0.89 (95% CI, 0.85-0.94) (p < 0.001). The reader-averaged specificities before and after AI aid were 41.9% (95% CI, 39.3%-44.5%) and 53.9% (95% CI, 50.9%-56.9%), respectively (p < 0.001). The reader-averaged sensitivity was not statistically different between AI-unaided and AI-aided readings: 89.5% (95% CI, 83.1%-95.9%) vs. 92.6% (95% CI, 86.2%-99.0%) (p = 0.053), although the sensitivities of the least experienced radiologists before and after AI aid were 79.6% (43 of 54 [95% CI, 66.5%-89.4%]) and 90.7% (49 of 54 [95% CI, 79.7%-96.9%]), respectively (p = 0.031). With AI aid, the reader-averaged recall rate decreased by from 60.4% (95% CI, 57.8%-62.9%) to 49.5% (95% CI, 46.5%-52.4%) (p < 0.001). Conclusion: AI-aided reading reduced the number of recalls and improved the diagnostic performance in our simulation using women initially recalled for supplemental mammographic views after mammography screening.

Automated Measurement of Native T1 and Extracellular Volume Fraction in Cardiac Magnetic Resonance Imaging Using a Commercially Available Deep Learning Algorithm

  • Suyon Chang;Kyunghwa Han;Suji Lee;Young Joong Yang;Pan Ki Kim;Byoung Wook Choi;Young Joo Suh
    • Korean Journal of Radiology
    • /
    • 제23권12호
    • /
    • pp.1251-1259
    • /
    • 2022
  • Objective: T1 mapping provides valuable information regarding cardiomyopathies. Manual drawing is time consuming and prone to subjective errors. Therefore, this study aimed to test a DL algorithm for the automated measurement of native T1 and extracellular volume (ECV) fractions in cardiac magnetic resonance (CMR) imaging with a temporally separated dataset. Materials and Methods: CMR images obtained for 95 participants (mean age ± standard deviation, 54.5 ± 15.2 years), including 36 left ventricular hypertrophy (12 hypertrophic cardiomyopathy, 12 Fabry disease, and 12 amyloidosis), 32 dilated cardiomyopathy, and 27 healthy volunteers, were included. A commercial deep learning (DL) algorithm based on 2D U-net (Myomics-T1 software, version 1.0.0) was used for the automated analysis of T1 maps. Four radiologists, as study readers, performed manual analysis. The reference standard was the consensus result of the manual analysis by two additional expert readers. The segmentation performance of the DL algorithm and the correlation and agreement between the automated measurement and the reference standard were assessed. Interobserver agreement among the four radiologists was analyzed. Results: DL successfully segmented the myocardium in 99.3% of slices in the native T1 map and 89.8% of slices in the post-T1 map with Dice similarity coefficients of 0.86 ± 0.05 and 0.74 ± 0.17, respectively. Native T1 and ECV showed strong correlation and agreement between DL and the reference: for T1, r = 0.967 (95% confidence interval [CI], 0.951-0.978) and bias of 9.5 msec (95% limits of agreement [LOA], -23.6-42.6 msec); for ECV, r = 0.987 (95% CI, 0.980-0.991) and bias of 0.7% (95% LOA, -2.8%-4.2%) on per-subject basis. Agreements between DL and each of the four radiologists were excellent (intraclass correlation coefficient [ICC] of 0.98-0.99 for both native T1 and ECV), comparable to the pairwise agreement between the radiologists (ICC of 0.97-1.00 and 0.99-1.00 for native T1 and ECV, respectively). Conclusion: The DL algorithm allowed automated T1 and ECV measurements comparable to those of radiologists.

Sonographic Diagnosis of Cervical Lymph Node Metastasis in Patients with Thyroid Cancer and Comparison of European and Korean Guidelines for Stratifying the Risk of Malignant Lymph Node

  • Sae Rom Chung;Jung Hwan Baek;Yun Hwa Rho;Young Jun Choi;Tae-Yon Sung;Dong Eun Song;Tae Yong Kim;Jeong Hyun Lee
    • Korean Journal of Radiology
    • /
    • 제23권11호
    • /
    • pp.1102-1111
    • /
    • 2022
  • Objective: To evaluate the ultrasonography (US) features for diagnosing metastasis in cervical lymph nodes (LNs) in patients with thyroid cancer and compare the US classification of risk of LN metastasis between European and Korean guidelines. Materials and Methods: From January 2014 to December 2018, US-guided fine-needle aspiration was performed on 836 LNs from 714 patients for the preoperative nodal staging of thyroid cancer. The US features of LNs were retrospectively reviewed for the following features: size, presence of hilum, margin, orientation, cystic change, punctate echogenic foci (PEF), large echogenic foci, eccentric cortical thickening, abnormal vascularity, and cortical hyperechogenicity. A multiple logistic regression analysis was performed to identify the independent US features for the diagnosis of metastatic LNs. The diagnostic performance of independent US features was subsequently evaluated. LNs were categorized according to the Korean Thyroid Imaging Reporting and Data System (K-TIRADS) and European Thyroid Association (ETA) guidelines, and the correlation between the two sets of classifications was assessed. Results: Absence of the hilum, presence of cystic changes, PEF, abnormal vascularity, and cortical hyperechogenicity were independent US features of metastatic LNs. Cystic changes, PEF, abnormal vascularity, and cortical hyperechogenicity showed high specificity (86.8%-99.6%). The absence of the hilum had the highest sensitivity yet low specificity (66.4%). When LNs were classified according to the ETA guidelines and K-TIRADS, they yielded similar categorizations of malignancy risks and were strongly correlated (Spearman coefficient, 0.9766 [95% confidence interval, 0.973-0.979]). According to the ETA guidelines, 9.8% (82/836) of LNs were classified as "not specified." Conclusion: Absence of hilum, cystic changes, PEF, abnormal vascularity, and cortical hyperechogenicity were independent US features suggestive of metastatic LNs in thyroid cancer. Both K-TIRADS and the ETA guidelines provided similar risk stratification for metastatic LNs with a high correlation; however, the ETA guidelines failed to classify 9.8% of LNs into a specific risk stratum. These results may provide a basis for revising LN classification in future guidelines.

99mTc-3PRGD2 SPECT/CT Imaging for Diagnosing Lymph Node Metastasis of Primary Malignant Lung Tumors

  • Liming Xiao;Shupeng Yu;Weina Xu;Yishan Sun;Jun Xin
    • Korean Journal of Radiology
    • /
    • 제24권11호
    • /
    • pp.1142-1150
    • /
    • 2023
  • Objective: To evaluate 99mtechnetium-three polyethylene glycol spacers-arginine-glycine-aspartic acid (99mTc-3PRGD2) single-photon emission computed tomography (SPECT)/computed tomography (CT) imaging for diagnosing lymph node metastasis of primary malignant lung neoplasms. Materials and Methods: We prospectively enrolled 26 patients with primary malignant lung tumors who underwent 99mTc-3PRGD2 SPECT/CT and 18F-fluorodeoxyglucose (18F-FDG) positron emission tomography (PET)/CT imaging. Both imaging methods were analyzed in qualitative (visual dichotomous and 5-point grades for lymph nodes and lung tumors, respectively) and semiquantitative (maximum tissue-to-background radioactive count) manners for the lymph nodes and lung tumors. The performance of the differentiation of lymph nodes with and without metastasis was determined at the per-lymph node station and per-patient levels using histopathological results as the reference standard. Results: Total 42 stations had metastatic lymph nodes and 136 stations had benign lymph nodes. The differences between metastatic and benign lymph nodes in the visual qualitative and semiquantitative analyses of 99mTc-3PRGD2 SPECT/CT and 18F-FDG PET/CT were statistically significant (all P < 0.001). The area under the receiver operating characteristic curve (AUC) in the semi-quantitative analysis of 99mTc-3PRGD2 SPECT/CT was 0.908 (95% confidence interval [CI], 0.851-0.966), and the sensitivity, specificity, positive predictive value, and negative predictive value were 0.86 (36/42), 0.88 (120/136), 0.69 (36/52), and 0.95 (120/126), respectively. Among the 26 patients (including two patients each with two lung tumors), 15 had pathologically confirmed lymph node metastasis. The difference between primary lung lesions in patients with and without lymph node metastasis was statistically significant only in the semi-quantitative analysis of 99mTc-3PRGD2 SPECT/CT (P = 0.007), with an AUC of 0.807 (95% CI, 0.641-0.974). Conclusion: 99mTc-3PRGD2 SPECT/CT imaging may notably perform in the direct diagnosis of lymph node metastasis of primary malignant lung tumors and indirectly predict the presence of lymph node metastasis through uptake in the primary lesions.

의식이 명료한 글루포시네이트 중독환자의 신경학적 예후인자로서 APACHE II의 유용성 (Utility of the APACHE II score as a neurological prognostic factor for glufosinate-intoxicated patients with alert mental status)

  • 이록;신태용;문형준;이현정;정동길;이동욱;홍선인;김현준
    • 대한임상독성학회지
    • /
    • 제21권2호
    • /
    • pp.135-142
    • /
    • 2023
  • Purpose: In patients with glufosinate poisoning, severe neurological symptoms may be closely related to a poor prognosis, but their appearance may be delayed. Therefore, this study aimed to determine whether the Acute Physiology and Chronic Health Evaluation II (APACHE II) score could predict the neurological prognosis in patients with glufosinate poisoning who present to the emergency room with alert mental status. Methods: This study was conducted retrospectively through a chart review for patients over 18 years who presented to a single emergency medical center from January 2018 to December 2022 due to glufosinate poisoning. Patients were divided into groups with a good neurological prognosis (Cerebral Performance Category [CPC] Scale 1 or 2) and a poor prognosis (CPC Scale 3, 4, or 5) to identify whether any variables showed significant differences between the two groups. Results: There were 66 patients (67.3%) with good neurological prognoses and 32 (32.8%) with poor prognoses. In the multivariate logistic analysis, the APACHE II score, serum amylase, and co-ingestion of alcohol showed significant results, with odds ratios of 1.387 (95% confidence interval [CI], 1.027-1.844), 1.017 (95% CI, 1.002-1.032), and 0.196 (95% CI, 0.040-0.948), respectively. With an APACHE II score cutoff of 6.5, the AUC was 0.826 (95% CI, 0.746-0.912). The cutoff of serum amylase was 75.5 U/L, with an AUC was 0.761 (95% CI, 0.652-0.844), and the AUC of no co-ingestion with alcohol was 0.629 (95% CI, 0.527-0.722). Conclusion: The APACHE II score could be a useful indicator for predicting the neurological prognosis of patients with glufosinate poisoning who have alert mental status.

퍼포먼스 이론의 관점으로 바라본 대학생들의 찾아가는 교육연극 공연 경험에 관한 사례연구 (A case Study on the Experiences of College Students Participating in the Career Exploration credit System)

  • 신민주;곽비주
    • 국제교류와 융합교육
    • /
    • 제4권1호
    • /
    • pp.1-18
    • /
    • 2024
  • 본 연구는 4명의 연극전공 대학생들이 초등학교 입학을 앞둔 7살의 유치원생들에게 '훈이와 초록이'라는 제목으로 진행한 찾아가는 관객 참여형 교육연극 경험에 관한 질적 사례연구이다. 연극의 핵심 주제는 초등학교 입학 전 학교생활에 대한 불안감 해소를 돕고 또래 친구들과 원만한 의사소통을 할 수 있도록 돕기 위한 내용이며 이를 위하여 참여자인 대학생들이 시나리오 기획부터 유치원 섭외 그리고 세 곳의 유치원에서 40분간에 걸친 연극공연까지 직접 진행하였다. 연구 결과 '내 인생의 또 다른 성장'과 '유아들과의 만남을 통한 행복감 향상', '꿈을 향한 새로운 도전'이라는 핵심 구성요소가 도출되었다. 본 연구의 가장 큰 의의는 대학생들에게 관객 참여형 교육연극 경험이 그동안 자신들이 쌓아왔던 배움의 결과물을 누군가에게 나눔으로 실천하게 하였다는 점과 이러한 배움의 나눔 실천을 통해 조금은 막연했던 자신들의 진로나 꿈의 방향성을 설정할 수 있도록 '자신감 향상'과 '마음의 울림'을 경험하게 해 준 계기가 되었다는 점이다. 추후 찾아가는 관객참여형 교육연극이 다양한 측면에서 폭넓게 시행되기를 기대한다.

Prediction of Patient Management in COVID-19 Using Deep Learning-Based Fully Automated Extraction of Cardiothoracic CT Metrics and Laboratory Findings

  • Thomas Weikert;Saikiran Rapaka;Sasa Grbic;Thomas Re;Shikha Chaganti;David J. Winkel;Constantin Anastasopoulos;Tilo Niemann;Benedikt J. Wiggli;Jens Bremerich;Raphael Twerenbold;Gregor Sommer;Dorin Comaniciu;Alexander W. Sauter
    • Korean Journal of Radiology
    • /
    • 제22권6호
    • /
    • pp.994-1004
    • /
    • 2021
  • Objective: To extract pulmonary and cardiovascular metrics from chest CTs of patients with coronavirus disease 2019 (COVID-19) using a fully automated deep learning-based approach and assess their potential to predict patient management. Materials and Methods: All initial chest CTs of patients who tested positive for severe acute respiratory syndrome coronavirus 2 at our emergency department between March 25 and April 25, 2020, were identified (n = 120). Three patient management groups were defined: group 1 (outpatient), group 2 (general ward), and group 3 (intensive care unit [ICU]). Multiple pulmonary and cardiovascular metrics were extracted from the chest CT images using deep learning. Additionally, six laboratory findings indicating inflammation and cellular damage were considered. Differences in CT metrics, laboratory findings, and demographics between the patient management groups were assessed. The potential of these parameters to predict patients' needs for intensive care (yes/no) was analyzed using logistic regression and receiver operating characteristic curves. Internal and external validity were assessed using 109 independent chest CT scans. Results: While demographic parameters alone (sex and age) were not sufficient to predict ICU management status, both CT metrics alone (including both pulmonary and cardiovascular metrics; area under the curve [AUC] = 0.88; 95% confidence interval [CI] = 0.79-0.97) and laboratory findings alone (C-reactive protein, lactate dehydrogenase, white blood cell count, and albumin; AUC = 0.86; 95% CI = 0.77-0.94) were good classifiers. Excellent performance was achieved by a combination of demographic parameters, CT metrics, and laboratory findings (AUC = 0.91; 95% CI = 0.85-0.98). Application of a model that combined both pulmonary CT metrics and demographic parameters on a dataset from another hospital indicated its external validity (AUC = 0.77; 95% CI = 0.66-0.88). Conclusion: Chest CT of patients with COVID-19 contains valuable information that can be accessed using automated image analysis. These metrics are useful for the prediction of patient management.