• 제목/요약/키워드: Inter-observer variation

검색결과 13건 처리시간 0.019초

Large Variation in Clinical Practice amongst Pediatricians in Treating Children with Recurrent Abdominal Pain

  • van Kalleveen, Michael W.;Noordhuis, Elise J.;Lasham, Carole;Plotz, Frans B.
    • Pediatric Gastroenterology, Hepatology & Nutrition
    • /
    • 제22권3호
    • /
    • pp.225-232
    • /
    • 2019
  • Purpose: To evaluate intra- and inter-observer variability and guideline adherence amongst pediatricians in treating children aged between 4 and 18 years referred with recurrent abdominal pain (RAP) without red flags. Methods: The first part of the study is a retrospective single-center cohort study. The diagnostic work-ups of eight pediatricians were compared to the national guidelines. Intra- and inter-observer variability were examined by Cramer's V test. Intra-observer variability was defined as the amount of variation within a pediatrician and inter-observer variability as the amount of variation between pediatricians in the application of diagnostic work-up in children with RAP. Prospectively, the same pediatricians were requested to provide a report on their management strategy with a fictitious case to prove similarities in retrospective diagnostic work-up. Results: A total of 10 patients per pediatrician were analyzed. Retrospectively, a (very) weak association between pediatricians' diagnostic work-ups was found (0.22), which implies high inter-observer variability. The association between intra-observer diagnostic was moderate (range, 0.35-0.46). The Cramer's V of 0.60 in diagnostic work-up between pediatricians in the fictitious case implied the presence of a moderately strong association and lower inter-observer variability than in the retrospective study. Adherence to the guideline was 66.8%. Conclusion: We found a high intra- and inter-observer variability and moderate guideline adherence in daily clinical practice amongst pediatricians in treating children with RAP in a teaching hospital.

Gleason's Grading of Prostatic Adenocarcinoma: Inter-Observer Variation Among Seven Pathologists at a Tertiary Care Center in Oman

  • Qureshi, Asim;Lakhtakia, Ritu;Bahri, Maiya AL;Haddabi, Ibrahim Al;Saparamadu, Anna;Shalaby, Asem;Riyami, Marwa Al;Rizvi, Gauhar
    • Asian Pacific Journal of Cancer Prevention
    • /
    • 제17권11호
    • /
    • pp.4867-4868
    • /
    • 2016
  • Prostatic adenocarcinoma is the commonest solid malignancy seen in Omani elderly males 60-80 years of age. The Gleason grade is the most widely used grading system for prostatic carcinoma and is recommended by the World Health Organization. A peer review was carried out at the Pathology Department of Sultan Qaboos University Hospital (SQUH), Oman, to assess the quality of reporting at the center. The aim of this study was to determine inter-observer variation among 7 pathologists working at a tertiary care center in Oman. A total of 47 consecutive prostatic biopsies were interdependently reviewed by seven pathologists and the results obtained were compared with each other and the original diagnosis. This peer review indicated a fair inter-observer agreement (0.482) among 7 pathologists in the department, with fair to moderate agreement when the results were compared to the reported diagnosis, comparable to the published literature. Dual and sub-specialty reporting are being instituted to improve the performance in this vital aspect of pathology.

Echocardiography Core Laboratory Validation of a Novel Vendor-Independent Web-Based Software for the Assessment of Left Ventricular Global Longitudinal Strain

  • Ernest Spitzer;Benjamin Camacho;Blaz Mrevlje;Hans-Jelle Brandendburg;Claire B. Ren
    • Journal of Cardiovascular Imaging
    • /
    • 제31권3호
    • /
    • pp.135-141
    • /
    • 2023
  • BACKGROUND: Global longitudinal strain (GLS) is an accurate and reproducible parameter of left ventricular (LV) systolic function which has shown meaningful prognostic value. Fast, user-friendly, and accurate tools are required for its widespread implementation. We aim to compare a novel web-based tool with two established algorithms for strain analysis and test its reproducibility. METHODS: Thirty echocardiographic datasets with focused LV acquisitions were analyzed using three different semi-automated endocardial GLS algorithms by two readers. Analyses were repeated by one reader for the purpose of intra-observer variability. CAAS Qardia (Pie Medical Imaging) was compared with 2DCPA and AutoLV (TomTec). RESULTS: Mean GLS values were -15.0 ± 3.5% from Qardia, -15.3 ± 4.0% from 2DCPA, and -15.2 ± 3.8% from AutoLV. Mean GLS between Qardia and 2DCPA were not statistically different (p = 0.359), with a bias of -0.3%, limits of agreement (LOA) of 3.7%, and an intraclass correlation coefficient (ICC) of 0.88. Mean GLS between Qardia and AutoLV were not statistically different (p = 0.637), with a bias of -0.2%, LOA of 3.4%, and an ICC of 0.89. The coefficient of variation (CV) for intra-observer variability was 4.4% for Qardia, 8.4% 2DCPA, and 7.7% AutoLV. The CV for inter-observer variability was 4.5%, 8.1%, and 8.0%, respectively. CONCLUSIONS: In echocardiographic datasets of good image quality analyzed at an independent core laboratory using a standardized annotation method, a novel web-based tool for GLS analysis showed consistent results when compared with two algorithms of an established platform. Moreover, inter- and intra-observer reproducibility results were excellent.

Spinal Enumeration by Morphologic Analysis of Spinal Variants: Comparison to Counting in a Cranial-To-Caudal Manner

  • Yun, Sam;Park, Sekyoung;Park, Jung Gu;Huh, Jin Do;Shin, Young Gyung;Yun, Jong Hyouk
    • Korean Journal of Radiology
    • /
    • 제19권6호
    • /
    • pp.1140-1146
    • /
    • 2018
  • Objective: To compare the spinal enumeration methods that establish the first lumbar vertebra in patients with spinal variants. Materials and Methods: Of the 1446 consecutive patients who had undergone computed tomography of the spine from March 2012 to July 2016, 100 patients (62 men, 38 women; mean age, 47.9 years; age range, 19-88 years) with spinal variants were included. Two radiologists (readers 1 and 2) established the first lumbar vertebra through morphologic analysis of the thoracolumbar junction, and labeled the vertebra by counting in a cranial-to-caudal manner. Inter-observer agreement was established. Additionally, reader 1 detected the 20th vertebra under the assumption that there are 12 thoracic vertebra, and then classified it as a thoracic vertebra, lumbar vertebra, or thoracolumbar transitional vertebra (TLTV), on the basis of morphologic analysis. Results: The first lumbar vertebra, as established by morphologic analysis, was labeled by each reader as the 21st segment in 65.0% of the patients, as the 20th segment in 31.0%, and as the 19th segment in 4.0%. Inter-observer agreement between the two readers in determining the first lumbar vertebra, based on morphologic analysis, was nearly perfect (${\kappa}$ value: 1.00). The 20th vertebra was morphologically classified as a TLTV in 60.0% of the patients, as the first lumbar segment in 31.0%, as the second lumbar segment in 4.0%, and as a thoracic segment in 5.0%. Conclusion: The establishment of the first lumbar vertebra using morphologic characteristics of the thoracolumbar junction in patients with spinal variants was consistent with the morphologic traits of vertebral segmentation.

Real Time Versus Photographic Assessment of Stool Consistency Using the Brussels Infant and Toddler Stool Scale: Are They Telling Us the Same?

  • Aman, Berthold Albert;Levy, Elvira Ingrid;Hofman, Benjamine;Vandenplas, Yvan;Huysentruyt, Koen
    • Pediatric Gastroenterology, Hepatology & Nutrition
    • /
    • 제24권1호
    • /
    • pp.38-44
    • /
    • 2021
  • Purpose: Digital communication is becoming increasingly important in clinical practice and research. The finding that stool consistency can be evaluated similarly using either "in vivo" or photographic material by health care professionals will decrease subjective interpretation by parents. The primary outcome of this study was the reliability of stool consistency scoring using the Brussels Infant and Toddler Stool Scale (BITSS) between fresh stools and their photos; the secondary outcome was the inter-rater reliability based on the fresh stools. Methods: Fresh stool samples from healthy children were collected in a day care center. These stools, and one month later the corresponding photos presented in a random order, were presented to 14 observers. Reliabilities were analyzed using absolute agreements and weighted and unweighted Cohen's κ. Results: In total, 202 samples were rated 576 times. Absolute agreement between photographic and real time assessment ranged between 71.1% and 83.3% among observers. This corresponded with substantial agreement (unweighted κ=0.70 [95% CI, 0.61-0.78]; weighted κ=0.86 [95% CI, 0.78-0.88]). The inter-observer agreement showed similar percentages of absolute agreement (81.4-82.0%) and κ-values corresponding with fair-to-moderate agreement. Conclusion: Our findings suggest that the assessment of fresh stool consistency can also reliably be done on photographic material when using the BITSS. This opens opportunities in scientific surroundings and in our daily life communication with parents and caretakers.

Evaluation of the Impact of Iterative Reconstruction Algorithms on Computed Tomography Texture Features of the Liver Parenchyma Using the Filtration-Histogram Method

  • Pamela Sung;Jeong Min Lee;Ijin Joo;Sanghyup Lee;Tae-Hyung Kim;Balaji Ganeshan
    • Korean Journal of Radiology
    • /
    • 제20권4호
    • /
    • pp.558-568
    • /
    • 2019
  • Objective: To evaluate whether computed tomography (CT) reconstruction algorithms affect the CT texture features of the liver parenchyma. Materials and Methods: This retrospective study comprised 58 patients (normal liver, n = 34; chronic liver disease [CLD], n = 24) who underwent liver CT scans using a single CT scanner. All CT images were reconstructed using filtered back projection (FBP), hybrid iterative reconstruction (IR) (iDOSE4), and model-based IR (IMR). On arterial phase (AP) and portal venous phase (PVP) CT imaging, quantitative texture analysis of the liver parenchyma using a single-slice region of interest was performed at the level of the hepatic hilum using a filtration-histogram statistic-based method with different filter values. Texture features were compared among the three reconstruction methods and between normal livers and those from CLD patients. Additionally, we evaluated the inter- and intra-observer reliability of the CT texture analysis by calculating intraclass correlation coefficients (ICCs). Results: IR techniques affect various CT texture features of the liver parenchyma. In particular, model-based IR frequently showed significant differences compared to FBP or hybrid IR on both AP and PVP CT imaging. Significant variation in entropy was observed between the three reconstruction algorithms on PVP imaging (p < 0.05). Comparison between normal livers and those from CLD patients revealed that AP images depend more strongly on the reconstruction method used than PVP images. For both inter- and intra-observer reliability, ICCs were acceptable (> 0.75) for CT imaging without filtration. Conclusion: CT texture features of the liver parenchyma evaluated using the filtration-histogram method were significantly affected by the CT reconstruction algorithm used.

Prevalence of pineal gland calcification as an incidental finding in patients referred for implant dental therapy

  • Mutalik, Sunil;Tadinada, Aditya
    • Imaging Science in Dentistry
    • /
    • 제47권3호
    • /
    • pp.175-180
    • /
    • 2017
  • Purpose: Pineal gland calcification has been proposed to play a role in the pathogenesis of Alzheimer disease. This study evaluated the prevalence and extent of pineal gland calcification in cone-beam computed tomography (CBCT) scans of patients referred for dental implant therapy who could possibly be a vulnerable group for this condition. Materials and Methods: A retrospective evaluation of 500 CBCT scans was conducted. Scans that showed the area where the pineal gland was located were included. The scans were initially screened by a single observer to record the prevalence and extent of calcification. Six weeks following the completion of the study, another investigator randomly reviewed and selected 50 scans to investigate inter-observer variation, which was evaluated using reliability analysis statistics. The prevalence and measurements of the calcifications were reported using descriptive statistics. The chi-square test was used to compare the prevalence between males and females. Results: The prevalence of pineal gland calcification was 58.8%. There was no statistically significant correlation between age and the extent of the calcification. The prevalence of calcification was 58.6% in females and 59.0% in males. The average anteroposterior measurement was $3.73{\pm}1.63mm$, while the average mediolateral measurement was $3.47{\pm}1.31mm$. The average total calcified area was $9.79{\pm}7.59mm^2$. Conclusion: The prevalence of pineal gland calcification was high in patients undergoing implant therapy. While not all pineal gland calcifications lead to neurodegenerative disorders, they should be strongly considered in the presence of any symptoms as a reason to initiate further investigations.

java Based Magnetic Resonance User Interface의 Advanced Method for Accurate, Robust, and Efficient Spectral Fitting 분석방법의 관찰자 변동 요소 (Observer Variation Factor on Advanced Method for Accurate, Robust, and Efficient Spectral Fitting of java Based Magnetic Resonance User Interface for MRS data analysis)

  • 이석준;유승만
    • 대한방사선기술학회지:방사선기술과학
    • /
    • 제39권2호
    • /
    • pp.143-148
    • /
    • 2016
  • 본 연구에서는 MRS 정량적 분석 중 jMRUI AMARES방법의 관찰자의 의존적 원인 요소를 숙련자와 비숙련자의 측정을 통하여 파악하고 하였다. 실험용 10주령 수컷 쥐의 간 부분을 3T MRI 장치를 활용하여 point resolved spectroscopy 펄스시퀀스를 이용하여 자기공명분광 데이터를 획득하였다. 획득된 데이터는 기준 값으로 사용하기 위해서 LCModel software 이용하여 1.3 ppm의 메틸렌 양성자와 4.7 ppm의 물 분자 양성자의 정량 비를 계산하였다. 7명의 비숙련 관찰자는 jMRUI AMARES 방법으로 총 지질을 1, 2주 간격으로 측정한 후 측정된 값을 SPSS를 이용하여 interclass correlation coefficient를 시행하였다. 관찰자 사이 간 측정치의 일관성의 신뢰도 분석을 표현한 크논바 알파 계수는 0.1 미만으로 나타났다. 1주차 데이터 값과 2주차 데이터 값의 평균값은 $0.096{\pm}0.038$로 LCModel의 분석 값보다 0.048로 50% 높게 관찰되었다. jMRUI AMARES분석 방법이 LCModel과 동일한 결과를 얻기 위해서는 정확한 대사물질의 개요를 숙지하고 획득된 그래프의 형태를 잘 파악하여 잔존 대사물질를 최소화 하여야 한다.

시뮬레이션기반 실습 시 간호학생의 간호사정 및 의사소통 기술에 대한 표준화 환자와 교수자 간의 평가 일치도 (Comparison of Standardized Patient and Faculty Agreement in Evaluating Nursing Students' Assessment and Communication Skills)

  • 김영주
    • 기본간호학회지
    • /
    • 제24권3호
    • /
    • pp.189-199
    • /
    • 2017
  • Purpose: This study was conducted to examine the level of agreement between a standardized patient (SP) and a faculty member in the evaluation of nursing students' assessment and communication skills. Methods: Participants were 51 third year nursing students in a simulation practice of 'nursing care for a patient admitted with chest pain'. Using a 30-item checklist and a 16-item communication tool, a SP and faculty member evaluated the students' assessment and communication skills during the simulation. Results: The average values for percent agreement and kappa statistic for nursing assessment between the two evaluators were 85.3% and .48 respectively. Twenty of thirty items evaluating assessment skill had above moderate agreement (${\geq}.41$) by kappa between the evaluators. Seven of sixteen items evaluating communication and interpersonal skills showed above fair agreement (${\geq}.40$) between the two evaluators, which was measured by intraclass correlation coefficient. Conclusion: The findings show that the evaluation of the SP was consistent with those of the faculty member to a moderate degree. Clear guidelines for evaluating criteria and optimal time and effort for SP training are necessary to increase the reliability of standardized patients as evaluators in simulation-based nursing education.

Interpretation of Complete Tumor Response on MRI Following Chemoradiotherapy of Rectal Cancer: Inter-Reader Agreement and Associated Factors in Multi-Center Clinical Practice

  • Hae Young Kim;Seung Hyun Cho;Jong Keon Jang;Bohyun Kim;Chul-min Lee;Joon Seok Lim;Sung Kyoung Moon;Soon Nam Oh;Nieun Seo;Seong Ho Park
    • Korean Journal of Radiology
    • /
    • 제25권4호
    • /
    • pp.351-362
    • /
    • 2024
  • Objective: To measure inter-reader agreement and identify associated factors in interpreting complete response (CR) on magnetic resonance imaging (MRI) following chemoradiotherapy (CRT) for rectal cancer. Materials and Methods: This retrospective study involved 10 readers from seven hospitals with experience of 80-10210 cases, and 149 patients who underwent surgery after CRT for rectal cancer. Using MRI-based tumor regression grading (mrTRG) and methods employed in daily practice, the readers independently assessed mrTRG, CR on T2-weighted images (T2WI) denoted as mrCRT2W, and CR on all images including diffusion-weighted images (DWI) denoted as mrCRoverall. The readers described their interpretation patterns and how they utilized DWI. Inter-reader agreement was measured using multi-rater kappa, and associated factors were analyzed using multivariable regression. Correlation between sensitivity and specificity of each reader was analyzed using Spearman coefficient. Results: The mrCRT2W and mrCRoverall rates varied widely among the readers, ranging 18.8%-40.3% and 18.1%-34.9%, respectively. Nine readers used DWI as a supplement sequence, which modified interpretations on T2WI in 2.7% of cases (36/1341 [149 patients × 9 readers]) and mostly (33/36) changed mrCRT2W to non-mrCRoverall. The kappa values for mrTRG, mrCRT2W, and mrCRoverall were 0.56 (95% confidence interval: 0.49, 0.62), 0.55 (0.52, 0.57), and 0.54 (0.51, 0.57), respectively. No use of rectal gel, larger initial tumor size, and higher initial cT stage exhibited significant association with a higher interreader agreement for assessing mrCRoverall (P ≤ 0.042). Strong negative correlations were observed between the sensitivity and specificity of individual readers (coefficient, -0.718 to -0.963; P ≤ 0.019). Conclusion: Inter-reader agreement was moderate for assessing CR on post-CRT MRI. Readers' varying standards on MRI interpretation (i.e., threshold effect), along with the use of rectal gel, initial tumor size, and initial cT stage, were significant factors associated with inter-reader agreement.