• Title/Summary/Keyword: Item difficulty

Search Result 285, Processing Time 0.033 seconds

Rasch Analysis of the Korean Version of the Fullerton Advanced Balance Scale

  • Jeon, Yong-jin;Kim, Gyoung-mo
    • Physical Therapy Korea
    • /
    • v.24 no.4
    • /
    • pp.20-28
    • /
    • 2017
  • Background: Rasch analysis has the advantage of placing both the items and the person along a single ratio scale and calibrates person ability and item difficulty onto an interval scale by logits. Therefore, Rasch analysis has been recommended as a better method for evaluating functional outcome questionnaires than traditional analyses. Objects: The aim of current study was to investigate item fit, item difficulty, rating scale, and separation index of the Korean version of the Fullerton Advanced Balance (KFAB) scale using Rasch analysis. Methods: In total, 93 patients with stroke (male=58, female=35) participated in this study. To investigate the item fit, difficulty, rating scale, and separation index of the KFAB scale, Rasch analysis was completed by the Winsteps software program. Results: In this study, all items of the KFAB scale were included in the Rasch model. The most difficult item was 'standing with feet together and eyes closed', and the easiest item was 'two-footed jump'. The rating scale was a 4-point scale instead of the original 5-point scale. Person and item separation indices showed high values that can identify a person with a wide range of balance ability. Conclusion: The KFAB scale appears to be a reliable and valid tool to assess balance function in patients with stroke. Furthermore, the scale was found to discriminate among stroke patients of varying balance abilities.

A study on the improvement of the test items in Korean scholastic ability test (English test) (대학수학능력시험(영어시험)의 문항개선에 대한 연구)

  • Jeon, Sung-Ae
    • English Language & Literature Teaching
    • /
    • v.18 no.2
    • /
    • pp.189-211
    • /
    • 2012
  • The purpose of the study was to explore ways to improve the test items on the Korean scholastic ability test. More specifically, the researchers investigated whether use of the target language in test items would make a difference in total scores, discriminatory power, and item difficulty. A total of 288 high school seniors participated in the study. The subjects were divided into the experimental group (N=145) and the control group (N=143). A 25-item test resembling the Korean scholastic ability test was administered to both groups. The experimental group was given items whose questions and alternatives were all presented in English, whereas the control group was given items whose questions and alternatives were presented in Korean only. Statistical analyses revealed that use of English vs. Korean in the questions and alternatives made a significant difference in total scores, item discrimination, and item difficulty level. The findings strongly suggest that use of English is one way to improve the quality of the Korean scholastic ability test by enhancing item discrimination and face validity. Considering that the test in question is a high-stakes exam in Korea, further research on how to improve the Korean scholastic ability test is urgently called for.

  • PDF

A Comparative Study of Item Difficulty Hierarchy of Self-Reported Activity Measure Versus Metabolic Equivalent of Tasks

  • Choi, Bong-Sam
    • Physical Therapy Korea
    • /
    • v.20 no.3
    • /
    • pp.89-99
    • /
    • 2013
  • The purposes of this study were: 1) to show the item difficulty hierarchy of walking/moving construct of the International Classification of Functioning, Disability and Health-Activity Measure (ICF-AM), 2) to evaluate the item-level psychometrics for model fit, 3) to describe the relevant physical activity defined by level of activity intensity expressed as Metabolic Equivalent of Tasks (MET), and 4) to explore what extent the empirical activity hierarchy of the ICF-AM is linked to the conceptual model based on the level of energy expenditure described as MET. One hundred and eight participants with lower extremity impairments were examined for the present study. A newly created activity measure, the ICF-AM using an item response theory (IRT) model and computer adaptive testing (CAT) method, has a construct on walking/moving construct. Based on the ICF category of walking and moving, the instrument comprised items corresponding to: walking short distances, walking long distances, walking on different surfaces, walking around objects, climbing, and running. The item difficulty hierarchy was created using Winstep software for 20 items. The Rasch analyses (1-parameter IRT model) were performed on participants with lower extremity injuries who completed the paper and pencil version of walking/moving construct of the ICF-AM. The classification of physical activity can also be performed by the use of METs that is often preferred to determine the level of physical activity. The empirical item hierarchy of walking, climbing, running activities of the ICF-AM instrument was similar to the conceptual activity hierarchy based on the METs. The empirically derived item difficulty hierarchy of the ICF-AM may be useful in developing MET-based activity measure questionnaires. In addition to convenience of applying items to questionnaires, implications of the finding could lead to the use of CAT method without sacrificing the objectivity of physiologic measures.

Item Analysis using Classical Test Theory and Item Response Theory, Validity and Reliability of the Korean version of a Pressure Ulcer Prevention Knowledge (한국어판 욕창예방지식도구의 고전검사이론과 문항반응이론을 적용한 문항분석, 타당도와 신뢰도)

  • Kang, Myung Ja;Kim, Myoung Soo
    • Journal of Korean Biological Nursing Science
    • /
    • v.20 no.1
    • /
    • pp.11-19
    • /
    • 2018
  • Purpose: The purposes of this study were to perform items analysis using the classical test theory (CTT) and the item response theory (IRT), and to establish the validity and reliability of the Korean version of pressure ulcer prevention knowledge. Methods: The 26-item pressure ulcer prevention knowledge instrument was translated into Korean, and the item analysis of the 22 items having an adequate content validity index (CVI), was conducted. A total of 240 registered nurses in 2 university hospitals completed the questionnaire. Each item was analyzed applying CTT and IRT according to 2-parameter logistic model. Response alternatives quality, item difficulty and item discrimination were evaluated. For testing validity and reliability, Pearson correlation coefficient and Kuder Richardson-20 (KR-20) were used. Results: Scale CVI was .90 (Item-CVI range= .75-1.00). The total correct answer rate for this study population was relatively low as 52.5%. The quality of response alternatives was found to be relatively good (range= .02-.83). The item difficulty of the questions ranged form .10 to .86 according to CTT and -12.19 to 29.92 according to the IRT. This instrument had 12-low, 2-medium and 8-high item difficulty applying IRT. The values for the item discrimination ranged .04-.57 applying CTT and .00-1.47 applying IRT. And overall internal consistency (KR-20) was .62 and stability (test-retest) was .82. Conclusion: The instrument had relatively weak construct validity, item discrimination according to the IRT. Therefore, the cautious usage of a Korean version of this instrument would be recommended for discrimination because there are so many attractive response alternatives and low internal consistency.

Factors of Predicting Difficulty of Mathematics Test Items in College Scholastic Ability Test (고등학교 수리영역 시험의 난이도 예측 요인 분석)

  • Ko, Ho-Kyoung;Yi, Hyun-Sook
    • Journal of the Korean School Mathematics Society
    • /
    • v.10 no.1
    • /
    • pp.113-127
    • /
    • 2007
  • This study explored the possibility of building a statistical model predicting difficulty of mathematics test items through the analysis of nation-wide scholastic ability test results for the past 5 years. Multiple linear regression analysis was conducted in predicting difficulty of mathematics test items. We adopted three major areas for independent variables: the content area, the behavior area, and the test item format area, each of which was categorized into more detailed sub-areas. For the dependent variable, the proportion of correct answer was used to represent the item difficulty. Statistically significant independent variables were included in the regression model based on the stepwise selection method. Several important factors affecting difficulty of mathematics test items for each area were identified. R-squares for the final regression model were fairly high, implying that the regression equation can be used to predict difficulty of test items at an acceptable level. Lastly, the regression model was cross-validated using independently collected data. We believe that this study will provide basic but very critical information for predicting the proportion of correct answer by showing the factors that should be considered for developing mathematics test items for the college entrance examination or high school classroom test.

  • PDF

Item analysis of the Korean version of the Intensive Care Experience Questionnaire: Using the Rasch Model based on Item Response Theory (Rasch 모형을 이용한 한국어판 중환자실경험 측정도구의 문항 분석)

  • Kang, Jiyeon;Kim, Minhui
    • Journal of Korean Critical Care Nursing
    • /
    • v.15 no.3
    • /
    • pp.37-50
    • /
    • 2022
  • Purpose : This study aimed to examine the item characteristics of the Korean version of the intensive care experience questionnaire (K-ICEQ) using the Rasch analysis model of the item response theory. Methods : In this methodological study, the validity of the scale was examined, and a secondary analysis was conducted using cohort data of patients who were discharged from the intensive care units (ICU). Data from 891 patients who responded to the K-ICEQ upon ICU discharge were analyzed. The WINSTEP program was used to analyze item characteristics, including item difficulty, fit indices, appropriateness scale, and separation reliability. Results : The difficulty level of all 26 items of the K-ICEQ was appropriate, and the fit indices of the 25 items, except for item 18, were good. The 5-point scale of the K-ICEQ was not appropriate in the three subscales. The item separation reliability was good in all subscales, but did not meet the criteria in terms of respondents. Conclusion : The results of examining the item characteristics of the K-ICEQ revealed a good degree of difficulty, fitness, and item separation reliability. To increase the validity of the K-ICEQ, we suggest the rearrangement of the overall item order, modification of the item description of the "recall of experience" subscale, and reduction of the scale response level.

Study on the herbology test items in Korean medicine education using Item Response Theory (문항반응이론을 활용한 한의학 교육에서 본초학 시험문항에 대한 연구)

  • Chae, Han;Han, Sang Yun;Yang, GiYoung;Kim, Hyungwoo
    • The Korea Journal of Herbology
    • /
    • v.37 no.2
    • /
    • pp.13-21
    • /
    • 2022
  • Objectives : The evaluation of academic achievement is pivotal for establishing accurate direction and adequate level of medical education. The purpose of this study was to firstly establish innovative item analysis technique of Item Response Theory (IRT) for analyzing multiple-choice test of herbology in the traditional Korean medicine education which has not been available for the difficulty of test theory and statistical calculation. Methods : The answers of 390 students (2012-2018) to the 14 item herbology test in college of Korean medicine were used for the item analysis. As for the multidimensional analysis of item characteristics, difficulty, discrimination, and guessing parameters along with item-total correlation and percentage of correct answer were calculated using Classical Test Theory (CTT) and IRT. Results : The validity parameters of strong and weak items were illustrated in multiple perspectives. There were 4 items with six acceptable index scores, and 5 items with only one acceptable index score. The item discrimination of IRT was found to have no significant correlation with difficulty and discrimination indices of CTT emphasizing attention of professionals of medical education as for the test credibility. Conclusion : The critical suggestions for the development, utilization and revision of test items in the e-learning and evidence-based Teaching era were made based on the results of item analysis using IRT. The current study would firstly provide foundation for upgrading the quality of Korean medicine education using test theory.

Analysis of the difficulty and discrimination of paper-based tests and computer-based tests according to item response theory: focusing on the National Dental Technician Examination (문항반응이론에 따른 지필 시험과 컴퓨터적용 시험의 난이도와 변별도 분석: 치과기공사 국가시험을 중심으로)

  • Hwang, Kyung-Sook
    • Journal of Technologic Dentistry
    • /
    • v.44 no.3
    • /
    • pp.104-110
    • /
    • 2022
  • Purpose: This study analyzes the difficulty and discrimination of the paper-based test (PBT) and the computer-based test (CBT) according to item response theory, focusing on the National Dental Technician Examination. Methods: A mock test was conducted from September 15 to 23, 2020, and the final 179 (1 out of 180 absentees)people were the subjects of this study. Both frequency analysis and factor analysis were performed. The collected data were analyzed using IBM SPSS Statistics ver. 18.0 (IBM) and jMetrik programs. The significance level was set to 0.05. Results: The difficulty of the mock test was more easily responded to in CBT. It was also predicted that the CBT could better measure the ability of test takers than the PBT could. Conclusion: The difficulty, discrimination, and reliability of the questions were not affected by the examination method through the mock test. The feasibility of a future change to the CBT was confirmed by the National Dental Technician Examination.

A Preliminary Study for Development of the Aphasia Screening Test (실어증 선별검사 도구개발을 위한 예비연구)

  • Kim, Hyang-Hee;Lee, Hyun-Joung;Kim, Deog-Yong;Heo, Ji-Hoe;Kim, Yong-Wook
    • Speech Sciences
    • /
    • v.13 no.2
    • /
    • pp.7-18
    • /
    • 2006
  • An aphasia screening test can serve a main purpose of differentiating aphasics from non-aphasic patients in a quick as well as efficient manner. As a preliminary study for developing a standardized aphasia screening test for Korean patients, we constructed an aphasia screening test constituting items from the Paradise' Korean version-the Western Aphasia Battery(P K-WAB). All test items were analyzed in order to extract items with optimal item discrimination and adequate item difficulty indices. From the results, we were able to select some items from each subtest with optimal results of discriminant function analysis for aphasic and normal control groups. It is expected, thus, that information on the item analysis could be utilized in developing a Korean aphasia screening test.

  • PDF

Psychometric Properties of the Vocational Ability Scale in Individuals with Intellectual Disabilities

  • Park, Eun-Young
    • International Journal of Contents
    • /
    • v.15 no.3
    • /
    • pp.1-6
    • /
    • 2019
  • The purpose of this study was to identify the psychometric properties of the vocational ability scale used in the 8th Panel Survey of Employment for the Disabled in Korea by using the Rasch model. The sample data was collected from 398 individuals with intellectual disabilities. Item fitness, item difficulty, the appropriateness of the rating scale, and the separation index of the vocational ability scale were evaluated. All 15 items show an appropriate fitness level. The analysis of item difficulty indicate that modifications are required. Specifically, the need for the addition of less difficult question items is identified. The use of a 5-point rating scale is shown to decrease the test difficulty in terms of clarity and readability when appropriate and a 4-point modification is also determined as appropriate. With respect to the outcomes of the analysis, the person separation reliability value and separation index are high, and the reliability of the items is also high.