• Title/Summary/Keyword: Kappa coefficient

Search Result 243, Processing Time 0.025 seconds

A Study on the Morphological Analysis of Sperm (정자의 형태학적 특성 분석에 관한 연구)

  • Paick, Jae-Seung;Jeon, Seong-Soo;Kim, Soo-Woong;Yi, Won-Jin;Park, Kwang-Suk
    • Clinical and Experimental Reproductive Medicine
    • /
    • v.24 no.2
    • /
    • pp.153-165
    • /
    • 1997
  • In male reproducible health, fertility and IVF (in-vitro fertilization), semen analysis has been most important. Semen analysis can be divided into concentration, motional and morphological analysis of sperm. The existing method which was developed earlier to analyze semen concentrated on the sperm motility analysis. To provide more useful and precise solutions for clinical problems such as infertility, semen analysis must include sperm morphological analysis. But the traditional tools for semen analysis are subjective, imprecise, inaccurate, difficult to standardize, and difficult to reproduce. Therefore, with the help of development of microcomputers and image processing techniques, we developed a new sperm morphology analyzer to overcome these problems. In this study the agreement on percent normal morphology was studied between different observers and a computerized sperm morphology analyzer on a slide-by-slide basis using strict criteria. Slides from 30 different patients from the SNUH andrology laboratory were selected randomly. Microscopic fields and sperm cells were chosen randomly and percent normal morphology was recorded. The ability of sperm morphology analyzer to repeat the same reading for normal and abnormal cells was studied. The results showed that there was no significant bias between two experienced observers. The limits of agreement were 4.1%${\sim}$-3.8%. The Pearson correlation coefficient between readers was 0.79. Between the manual and sperm morphology analyzer, the same findings were reported. In this experiments the slides were stained by two different methods, PAP and Diff-Quik staining methods. The limits of agreement were 7.2%${\sim}$-5.7% and 6.0%${\sim}$-6.3%, respectively. The Pearson correlation coefficients ware 0.76 and 0.91, respectively. The limits of agreement was tighter below 20% normal forms. In the experiments of repeatability, 52 cells stained by PAP and Diff-Quik staining methods were analyzed three times in succession. Estimating pairwise agreement, the kappa statistic for the pairs were 0.76, 0.81, 0.86, and 0.75, 0.88, 0.88 respectively. In this study it was shown that there was good agreement between manual and computerized assessment of normal and abnormal cells. The repeatability and agreement per slide of computerized sperm morphology analyzer was excellent. The computer's ability to classify normal morphology per slide is promising. Based on results obtained, this system can be of clinical value both in andrology laboratories and IVF units.

  • PDF

[Retracted]Assessing Nutritional Status in Outpatients after Gastric Cancer Surgery: A Comparative Study of Five Nutritional Screening Tools ([논문철회]위암 수술 후 외래환자의 영양상태 평가: 5가지 영양검색도구의 비교연구)

  • Cho, Jae Won;Youn, Jiyoung;Choi, Min-Gew;Rha, Mi Young;Lee, Jung Eun
    • Korean Journal of Community Nutrition
    • /
    • v.26 no.4
    • /
    • pp.280-295
    • /
    • 2021
  • Objectives: This study aimed to examine the characteristics of patients according to their nutritional status as assessed by five nutritional screening tools: Patient-Generated Subjective Global Assessment (PG-SGA), NUTRISCORE, Nutritional Risk Index (NRI), Prognostic Nutritional Index (PNI), and Controlling Nutritional Status (CONUT) and to compare the agreement, sensitivity, and specificity of these tools. Methods: A total of 952 gastric cancer patients who underwent gastrectomy and chemotherapy from January 2009 to December 2012 at the Samsung Medical Center were included. We categorized patients into malnourished and normal according to the five nutritional screening tools 1 month after surgery and compared their characteristics. We also calculated the Spearman partial correlation, Cohen's Kappa coefficient, the area under the curve (AUC), sensitivity, and specificity of each pair of screening tools. Results: We observed 86.24% malnutrition based on the PG-SGA and 85.82% based on the NUTRISCORE among gastric cancer patients in our study. When we applied NRI or CONUT, however, the malnutrition levels were less than 30%. Patients with malnutrition as assessed by the PG-SGA, NUTRISCORE, or NRI had lower intakes of energy and protein compared to normal patients. When NRI, PNI, or CONUT were used to identify malnutrition, lower levels of albumin, hemoglobin, total lymphocyte count, total cholesterol, and longer postoperative hospital stays were observed among patients with malnutrition compared to those without malnutrition. We found relatively high agreement between PG-SGA and NUTRISCORE; sensitivity was 90.86% and AUC was 0.78. When we compared NRI and PNI, sensitivity was 99.64% and AUC was 0.97. AUC ranged from 0.50 to 0.67 for comparisons between CONUT and each of the other nutritional screening tools. Conclusions: Our study suggests that PG-SGA and NRI have a relatively high agreement with the NUTRISCORE and PNI, respectively. Further cohort studies are needed to examine whether the nutritional status assessed by PG-SGA, NUTRISCORE, NRI, PNI, and CONUT predicts the gastric cancer prognosis.

Early Estimation of Rice Cultivation in Gimje-si Using Sentinel-1 and UAV Imagery (Sentinel-1 및 UAV 영상을 활용한 김제시 벼 재배 조기 추정)

  • Lee, Kyung-do;Kim, Sook-gyeong;Ahn, Ho-yong;So, Kyu-ho;Na, Sang-il
    • Korean Journal of Remote Sensing
    • /
    • v.37 no.3
    • /
    • pp.503-514
    • /
    • 2021
  • Rice production with adequate level of area is important for decision making of rice supply and demand policy. It is essential to grasp rice cultivation areas in advance for estimating rice production of the year. This study was carried out to classify paddy rice cultivation in Gimje-si using sentinel-1 SAR (synthetic aperture radar) and UAV imagery in early July. Time-series Sentinel-1A and 1B images acquired from early May to early July were processed to convert into sigma naught (dB) images using SNAP (SeNtinel application platform, Version 8.0) toolbox provided by European Space Agency. Farm map and parcel map, which are spatial data of vector polygon, were used to stratify paddy field population for classifying rice paddy cultivation. To distinguish paddy rice from other crops grown in the paddy fields, we used the decision tree method using threshold levels and random forest model. Random forest model, trained by mainly rice cultivation area and rice and soybean cultivation area in UAV image area, showed the best performance as overall accuracy 89.9%, Kappa coefficient 0.774. Through this, we were able to confirm the possibility of early estimation of rice cultivation area in Gimje-si using UAV image.

[Republished study] Assessing Nutritional Status in Outpatients after Gastric Cancer Surgery: A Comparative Study of Five Nutritional Screening Tools ([재출판] 위암 수술 후 외래환자의 영양상태 평가: 5가지 영양검색도구의 비교연구)

  • Cho, Jae Won;Youn, Jiyoung;Choi, Min-Gew;Rha, Mi Young;Lee, Jung Eun
    • Korean Journal of Community Nutrition
    • /
    • v.27 no.3
    • /
    • pp.205-222
    • /
    • 2022
  • Objectives: This study examined the characteristics of patients according to nutritional status assessed by five nutritional screening tools: Patient-Generated Subjective Global Assessment (PG-SGA), NUTRISCORE, Nutritional Risk Index (NRI), Prognostic Nutritional Index (PNI), and Controlling Nutritional Status (CONUT) and to compare the agreement, sensitivity, and specificity of these tools. Methods: A total of 952 gastric cancer patients who underwent gastrectomy and chemotherapy from January 2009 to December 2012 were included. The patients were categorized into malnutrition and normal status according to five nutritional screening tools one month after surgery. The Spearman partial correlation, Cohen's Kappa coefficient, the area under the curve (AUC), sensitivity, and specificity of each two screening tools were calculated. Results: Malnutrition was observed in 86.24% of patients based on the PG-SGA and 85.82% based on the NUTRISCORE. When NRI or CONUT were applied, the proportions of malnutrition were < 30%. Patients with malnutrition had lower intakes of energy and protein than normal patients when assessed using the PG-SGA, NUTRISCORE, or NRI. Lower levels of albumin, hemoglobin, total lymphocyte count, and total cholesterol and longer postoperative hospital stays were observed among patients with malnutrition compared to normal patients when NRI, PNI, or CONUT were applied. Relatively high agreement for NUTRISCORE relative to PG-SGA was found; the sensitivity was 90.86%, and the AUC was 0.78. When NRI, PNI, and CONUT were compared, the sensitivities were 23.72% for PNI relative to NRI, 44.53% for CONUT relative to NRI, and 90.91% for CONUT relative to PNI. The AUCs were 0.95 for NRI relative to PNI and 0.91 for CONUT relative to PNI. Conclusions: NUTRISCORE had a high sensitivity compared to PG-SGA, and CONUT had a high sensitivity compared to PNI. NRI had a high specificity compared to PNI. This relatively high sensitivity and specificity resulted in 77.00% agreement between PNI and CONUT and 77.94% agreement between NRI and PNI. Further cohort studies will be needed to determine if the nutritional status assessed by PG-SGA, NUTRISCORE, NRI, PNI, and CONUT predicts the gastric cancer prognosis.

Assessing the Impact of Sampling Intensity on Land Use and Land Cover Estimation Using High-Resolution Aerial Images and Deep Learning Algorithms (고해상도 항공 영상과 딥러닝 알고리즘을 이용한 표본강도에 따른 토지이용 및 토지피복 면적 추정)

  • Yong-Kyu Lee;Woo-Dam Sim;Jung-Soo Lee
    • Journal of Korean Society of Forest Science
    • /
    • v.112 no.3
    • /
    • pp.267-279
    • /
    • 2023
  • This research assessed the feasibility of using high-resolution aerial images and deep learning algorithms for estimating the land-use and land-cover areas at the Approach 3 level, as outlined by the Intergovernmental Panel on Climate Change. The results from different sampling densities of high-resolution (51 cm) aerial images were compared with the land-cover map, provided by the Ministry of Environment, and analyzed to estimate the accuracy of the land-use and land-cover areas. Transfer learning was applied to the VGG16 architecture for the deep learning model, and sampling densities of 4 × 4 km, 2 × 4 km, 2 × 2 km, 1 × 2 km, 1 × 1 km, 500 × 500 m, and 250 × 250 m were used for estimating and evaluating the areas. The overall accuracy and kappa coefficient of the deep learning model were 91.1% and 88.8%, respectively. The F-scores, except for the pasture category, were >90% for all categories, indicating superior accuracy of the model. Chi-square tests of the sampling densities showed no significant difference in the area ratios of the land-cover map provided by the Ministry of Environment among all sampling densities except for 4 × 4 km at a significance level of p = 0.1. As the sampling density increased, the standard error and relative efficiency decreased. The relative standard error decreased to ≤15% for all land-cover categories at 1 × 1 km sampling density. These results indicated that a sampling density more detailed than 1 x 1 km is appropriate for estimating land-cover area at the local level.

A Short form of the Gray-Wheelwright Test (단축형 그레이-휠라이트 검사)

  • Ju-Kab Lee;Sung-Hyun Kim;Yong-Wook Shin
    • Sim-seong Yeon-gu
    • /
    • v.33 no.1
    • /
    • pp.61-80
    • /
    • 2018
  • We investigated whether the 81 items of the Gray-Wheelwright test correctly measure the concept of Jung's typology and aimed to refine the test. Participants (n=431) completed the Gray-Wheelwright test, and the results were analyzed using factor analysis with the varimax rotation and the maximum likelihood extraction method. A pair of opposing attitudes, introversion/extroversion, or one of the two pairs of opposing functional types, thinking/feeling or intuition/sensation, was labeled to the extracted factor according to the majority type of the items in the factor. The minority items or items not included in any factors were excluded from making a short form of the Gray-Wheelwright test with 45 items. We used intraclass correlation (ICC) coefficient and Cronbach's alpha for the test-retest reliability and internal consistency of the test, respectively. The newly developed short form of the Gray-Wheelwright test measured the Jung's personality types well, which was comparable to the original one while reducing time and effort required for the testing.

Comparative Performance of Susceptibility Map-Weighted MRI According to the Acquisition Planes in the Diagnosis of Neurodegenerative Parkinsonism

  • Suiji Lee;Chong Hyun Suh;Sungyang Jo;Sun Ju Chung;Hwon Heo;Woo Hyun Shim;Jongho Lee;Ho Sung Kim;Sang Joon Kim;Eung Yeop Kim
    • Korean Journal of Radiology
    • /
    • v.25 no.3
    • /
    • pp.267-276
    • /
    • 2024
  • Objective: To evaluate the diagnostic performance of susceptibility map-weighted imaging (SMwI) taken in different acquisition planes for discriminating patients with neurodegenerative parkinsonism from those without. Materials and Methods: This retrospective, observational, single-institution study enrolled consecutive patients who visited movement disorder clinics and underwent brain MRI and 18F-FP-CIT PET between September 2021 and December 2021. SMwI images were acquired in both the oblique (perpendicular to the midbrain) and the anterior commissure-posterior commissure (AC-PC) planes. Hyperintensity in the substantia nigra was determined by two neuroradiologists. 18F-FP-CIT PET was used as the reference standard. Inter-rater agreement was assessed using Cohen;s kappa coefficient. The diagnostic performance of SMwI in the two planes was analyzed separately for the right and left substantia nigra. Multivariable logistic regression analysis with generalized estimating equations was applied to compare the diagnostic performance of the two planes. Results: In total, 194 patients were included, of whom 105 and 103 had positive results on 18F-FP-CIT PET in the left and right substantia nigra, respectively. Good inter-rater agreement in the oblique (κ = 0.772/0.658 for left/right) and AC-PC planes (0.730/0.741 for left/right) was confirmed. The pooled sensitivities for two readers were 86.4% (178/206, left) and 83.3% (175/210, right) in the oblique plane and 87.4% (180/206, left) and 87.6% (184/210, right) in the AC-PC plane. The pooled specificities for two readers were 83.5% (152/182, left) and 82.0% (146/178, right) in the oblique plane, and 83.5% (152/182, left) and 86.0% (153/178, right) in the AC-PC plane. There were no significant differences in the diagnostic performance between the two planes (P > 0.05). Conclusion: There are no significant difference in the diagnostic performance of SMwI performed in the oblique and AC-PC plane in discriminating patients with parkinsonism from those without. This finding affirms that each institution may choose the imaging plane for SMwI according to their clinical settings.

Coronal Three-Dimensional Magnetic Resonance Imaging for Improving Diagnostic Accuracy for Posterior Ligamentous Complex Disruption In a Goat Spine Injury Model

  • Xuee Zhu;Jichen Wang;Dan Zhou;Chong Feng;Zhiwen Dong;Hanxiao Yu
    • Korean Journal of Radiology
    • /
    • v.20 no.4
    • /
    • pp.641-648
    • /
    • 2019
  • Objective: The purpose of this study was to investigate whether three-dimensional (3D) magnetic resonance imaging could improve diagnostic accuracy for suspected posterior ligamentous complex (PLC) disruption. Materials and Methods: We used 20 freshly harvested goat spine samples with 60 segments and intact surrounding soft tissue. The animals were aged 1-1.5 years and consisted of 8 males and 12 females, which were sexually mature but had not reached adult weights. We created a paraspinal contusion model by percutaneously injecting 10 mL saline into each side of the interspinous ligament (ISL). All segments underwent T2-weighted sagittal and coronal short inversion time inversion recovery (STIR) scans as well as coronal and sagittal 3D proton density-weighted spectrally selective inversion recovery (3D-PDW-SPIR) scans acquired at 1.5T. Following scanning, some ISLs were cut and then the segments were rescanned using the same magnetic resonance (MR) techniques. Two radiologists independently assessed the MR images, and the reliability of ISL tear interpretation was assessed using the kappa coefficient. The chi-square test was used to compare the diagnostic accuracy of images obtained using the different MR techniques. Results: The interobserver reliability for detecting ISL disruption was high for all imaging techniques (0.776-0.949). The sensitivity, specificity, and diagnostic accuracy of the coronal 3D-PDW-SPIR technique for detecting ISL tears were 100, 96.9, and 97.9%, respectively, which were significantly higher than those of the sagittal STIR (p = 0.000), coronal STIR (p = 0.000), and sagittal 3D-PDW-SPIR (p = 0.001) techniques. Conclusion: Compared to other MR methods, coronal 3D-PDW-SPIR provides a more accurate diagnosis of ISL disruption. Adding coronal 3D-PDW-SPIR to a routine MR protocol may help to identify PLC disruptions in cases with nearby contusion.

A novel brief questionnaire using a face rating scale to assess dental anxiety and fear

  • Takuya Mino;Aya Kimura-Ono;Hikaru Arakawa;Kana Tokumoto;Yoko Kurosaki;Yoshizo Matsuka;Kenji Maekawa;Takuo Kuboki
    • The Journal of Advanced Prosthodontics
    • /
    • v.16 no.4
    • /
    • pp.244-254
    • /
    • 2024
  • PURPOSE. This study aimed to evaluate the reliability and validity of a four-item questionnaire using a face rating scale to measure dental trait anxiety (DTA), dental trait fear (DTF), dental state anxiety (DSA), and dental state fear (DSF). MATERIALS AND METHODS. Participants were consecutively selected from patients undergoing scaling (S-group; n = 47) and implant placement (I-group; n = 25). The S-group completed the questionnaire both before initial and second scaling, whereas the I-group responded on the pre-surgery day (Pre-day), the day of implant placement (Imp-day), and the day of suture removal (Post-day). RESULTS. The reliability in the S-group was evaluated using the test-retest method, showing a weighted kappa value of DTA, 0.61; DTF, 0.46; DSA, 0.67; DSF, 0.52. Criterion-related validity, assessed using the State-Trait Anxiety Inventory's trait anxiety and state anxiety, revealed positive correlations between trait anxiety and DTA/DTF (DTA, ρ = 0.30; DTF, ρ = 0.27, ρ: correlation coefficient) and between state anxiety and all four items (DTA, ρ = 0.41; DTF, ρ = 0.32; DSA, ρ = 0.25; DSF, ρ = 0.25). Known-group validity was assessed using the initial data and Imp-day data from the S-group and I-group, respectively, revealing significantly higher DSA and DSF scores in the I-group than in the S-group. Responsiveness was gauged using I-group data, showing significantly lower DSA and DSF scores on post-day compared to other days. CONCLUSION. The newly developed questionnaire has acceptable reliability and validity for clinical use, suggesting its usefulness for research on dental anxiety and fear and for providing patient-specific dental care.

Reliability and validity study of a life style questionnaire for elderly people (노인 생활습관 설문서의 신뢰도 및 타당도 평가 연구)

  • Park, Byung-Joo;Kim, Dae-Sung;Koo, Hye-Won;Bae, Jong-Myon
    • Journal of Preventive Medicine and Public Health
    • /
    • v.31 no.1 s.60
    • /
    • pp.49-58
    • /
    • 1998
  • The study was done to determine the reliability and validity of a life style questionnaire for the elderly. The questionnaires were sent to 16,524 elderly people who were beneficiaries of Korean Medical Insurance Corporation in Pusan. Among the completed 9,139 questionnaires, 200 were randomly sampled and retested. finally, 110 duplicates were collected. Weighted kappa-value and Pearson correlation coefficients were estimated to measure the reliability. Validity coefficient was estimated by using reliability coefficient. In self-self responses, reliability coefficients of the most of items were over 0.6 except some physical activity related item. Relatively high reliability was observed in smoking, alcohol related items and anthropometric items. In self-proxy responses, most of the physical activity related items were found to be less reliable than self-self responses. Smoking and alcohol related items were consistently reliable. Male showed higher validity in food related item than female. On the other hand, some of the physical activity related items and smoking and alcohol related items were less valid in male than female. With regard to bias of proxy respondents, offsprings tended to underestimate the frequency of house cleaning' and 'kitchen work' and overestimate the height of them. In conclusion, the life style questionnaire was found to be reliable in the most of items. But, some items related with physical activity were found to be somewhat less reliable. Sexual difference on the validity was identified in some items. With regard to bias of proxy respondents, offsprings tended to have bias in part of items of housework and anthropometry.

  • PDF