• 제목/요약/키워드: test item

Search Result 1,564, Processing Time 0.032 seconds

A Psychometric Item Goodness-of-Fit of the Test of Performance Strategies for Athletes with Physical Disabilities Applying Rasch Model (Rasch 모형을 적용한 지체장애 엘리트선수의 스포츠수행전략(TOPS) 척도 타당화)

  • Seo, Eunchul;Baek, Jae keun
    • 재활복지
    • /
    • v.21 no.2
    • /
    • pp.169-190
    • /
    • 2017
  • The purpose of this study was to investigate item goodness-of-fit of Scale, Rasch rating scale model was applied to 5 dimensions 24 items of the Test of Performance Strategies (TOPS) in a sample of athletes with physical disabilities (n=215). An assumption to test Rasch Model, which is satisfaction of unidimensionality, is regarded through PCAR test, and WINSTEPS 3.65 program is used to test the goodness-of-fit of items. The results of this study were: First, 3-point rating category was appropriate for the TOPS instead of the existing 5-point rating category. Second, as a result of analyzing the goodness-of-fit of the items, 21 items of the TOPS were suitable, but 3 items were not. Third, the item reliability of person separation of the TOPS was acceptable, but the person reliability of item separation was not suitable and it was necessary to adjust the item order considering the difficulty level of the items. Fourth, as a result of comparing the individual attribute score and the difficulty level through the Item-Person Map, the distribution of the item difficulty distribution was shown to be biased in some factors compared to the personal attribute score distribution.

Analysis of the difficulty and discrimination of paper-based tests and computer-based tests according to item response theory: focusing on the National Dental Technician Examination (문항반응이론에 따른 지필 시험과 컴퓨터적용 시험의 난이도와 변별도 분석: 치과기공사 국가시험을 중심으로)

  • Hwang, Kyung-Sook
    • Journal of Technologic Dentistry
    • /
    • v.44 no.3
    • /
    • pp.104-110
    • /
    • 2022
  • Purpose: This study analyzes the difficulty and discrimination of the paper-based test (PBT) and the computer-based test (CBT) according to item response theory, focusing on the National Dental Technician Examination. Methods: A mock test was conducted from September 15 to 23, 2020, and the final 179 (1 out of 180 absentees)people were the subjects of this study. Both frequency analysis and factor analysis were performed. The collected data were analyzed using IBM SPSS Statistics ver. 18.0 (IBM) and jMetrik programs. The significance level was set to 0.05. Results: The difficulty of the mock test was more easily responded to in CBT. It was also predicted that the CBT could better measure the ability of test takers than the PBT could. Conclusion: The difficulty, discrimination, and reliability of the questions were not affected by the examination method through the mock test. The feasibility of a future change to the CBT was confirmed by the National Dental Technician Examination.

The Effects of Item Parceling on Causal Parameter Testing and Goodness-of-Fit Indices in Structural Equation Modeling (구조방정식 모델에서 항목묶음이 인과 모수의 검정과 적합도 평가에 미치는 영향)

  • Cho, Hyun-Chul;Kang, Suk-Hou
    • Journal of Global Scholars of Marketing Science
    • /
    • v.17 no.3
    • /
    • pp.133-151
    • /
    • 2007
  • The purpose of this article is to examine the effects of item parceling on the consistency of significance testing of the causal parameters with regard to the relationship between the relevant constructs, as well as the effects of the item parceling on the goodness-of-fit indices of LISREL's general models. Most of the researchers' major purpose of using structural equation modeling (SEM) is to test their research hypotheses associated with the causal parameters. Therefore, we investigated three general models of LISREL, rather than the frequently used confirmatory factor analytic (CFA) models by many other researchers. The results of the study showed that there was a high level of consistency in the calculated test statics of causal parameters between the item-parceled solutions and the item-level solutions, and that the item-parceled solutions had better goodness-of-fit indices, such as GFI, AGFI, CFI, and NFI, than the solutions at the item level. However, in terms of RMSEA, there was no such tendency.

  • PDF

A Dimensionality Assessment for Polytomously Scored Items Using DETECT

  • Kim, Hae-Rim
    • Communications for Statistical Applications and Methods
    • /
    • v.7 no.2
    • /
    • pp.597-603
    • /
    • 2000
  • A versatile dimensionality assessment index DETECT has been developed for binary item response data by Kim (1994). The present paper extends the use of DETECT to the polytomously scored item data. A simulation study shows DETECT performs well in differentiating multidimensional data from unidimensional one by yielding a greater value of DETECT in the case of multidimensionality. An additional investigation is necessary for the dimensionally meaningful clustering methods, such as HAC for binary data, particularly sensitive to the polytomous data.

  • PDF

Reliability and Validity of The Korean Version Scale of Impact of Weight on Quality of Life in $Kids^{(C)}$ (한국어 버전 청소년의 체중 관련 삶의 질 측정도구의 신뢰도와 타당도 검증)

  • Kim, Jeoung-Hyun;Chun, Sungsoo;Choi, Han-Sik
    • The Journal of Korean Society for School & Community Health Education
    • /
    • v.15 no.3
    • /
    • pp.105-125
    • /
    • 2014
  • Background: The purpose of this study was to evaluate reliability and validity of a 27-item Korean Version of the Impact of Weight on Quality of Life in adolescents ($IWQOL-Kids^{(C)}$: Korean Version). Methods: This instrument was administered to 872 adolescents (mean z-BMI: 2.61, mean $age{\pm}SD$: $13.9{\pm}1.2$, male: 51.9%). Reliability was tested by internal consistency method and item analysis, validity test was performed by index of content validity, exploratory factor analysis, confirmatory factor analysis and concurrent validity. Sensitivity was tested by ANOVA and t-test. Analyses were performed using SPSS and Amos 18.0. Results: By an exploratory factor analysis, 4 factors were extracted; 'Body esteem' consisted of 9 items with 35.9% of variance (social life: 6 items, 10.23%, physical comfort: 6 items, 8.21%, family relations: 6 items, 7.0%). Four factors explained 61.34% of total variance. Internal consistency coefficients ranged from .766 to .929 for scales on 27 items and equal to .920 for total score for both the 26-item and 27-item tools. A confirmatory factor analysis was conducted for the convergent validity and discriminant validity. The standardized factor loadings to test the convergent validity showed more than .5(C.R<1.965) on all paths after deletion of item PC1 (avoid stairs). The average variances extracted were more than .50 and the construct reliabilities were more than .70. The average variances extracted were stronger than the squares of correlation coefficient of inter-latent variables. Conclusions: These results support that the $IWQOL-Kids^{(C)}$: Korean Version with a 26-item is a reliable and valid tool in Korean obese adolescents.

  • PDF

Psychometric Properties of the Alzheimer's Disease Knowledge Scale-Korean Version (한국어판 알츠하이머병 지식 측정도구의 신뢰도와 타당도)

  • Kim, Eun Joo;Jung, Ji-Young
    • Journal of Korean Academy of Nursing
    • /
    • v.45 no.1
    • /
    • pp.107-117
    • /
    • 2015
  • Purpose: The purpose of this study was to evaluate the psychometric properties of the Korean version of the Alzheimer's Disease Knowledge Scale (ADKS-K) to determine its applicability to Korean adults. Methods: Cross-cultural validity was performed according to Consensus-based Standards for the Selection of Health Measurement Instruments (COSMIN). The Kuder-Richardson Formula 20 for internal consistency and Intraclass Correlation Coefficient (ICC) for test-retest reliability were conducted. Content validity, criterion related validity and construct validity were evaluated. The Classical Test Theory (CTT) model and the Item Response Theory (IRT) model were applied in performing the item analysis. Results: The KR 20 was .71, and the ICC was .90, indicating that the ADKS-K has internal consistency and stability reliability. Thirty items of the ADKS-K had significant Content Validity Ratio (CVR) values, i.e., mean of 0.82 and range of 0.60~1.00. Mean item difficulty and discrimination indices calculated by TestAn program were 0.63 and 0.23, respectively. Mean item difficulty and discrimination indices calculated by BayesiAn program were -0.60 and 0.77, respectively. These tests indicate that ADKS-K has an acceptable level of difficulty and discriminating efficiency. Conclusion: Results suggest that ADKS-K has the potential to be a proper instrument for assessing AD knowledge in Korean adults.

Qualitative and Quantitative Analysis of Paper-Pencil Test Items for Exploring its Appropriateness as a Selection Tool of the Gifted in Science (과학 영재 선발 도구로서 지필 검사의 적합성 탐색을 위한 질적 및 양적 문항 분석)

  • Lee, Ki-Young;Dong, Hyo-Kwan;Hong, Jun-Eui;Kim, Hyun-Kyung;Jo, Bong-Jae
    • Journal of The Korean Association For Science Education
    • /
    • v.28 no.1
    • /
    • pp.32-46
    • /
    • 2008
  • The purpose of this study was to analyse the qualitative and quantitative characteristics of paper-pencil tests for exploring its appropriateness as a selection tool of the gifted in science. For this purpose, we developed two (internal and external) item analysis frameworks, and applied these frameworks to analyse qualitative characteristics. Also, we analysed the relationship between two characteristics. The results of analysing qualitative characteristics revealed that the portion of items with acceleration context exceeding middle school curriculum level was relatively large, which caused low content validity. Furthermore, there was considerable deviation in content and context by subject matter and year, which caused test unstability. Items measuring knowledge domain was the most prevalent, and too much weight on data interpretation & analysis domain in inquiry process skills. In case of creativity test, the portion of items measuring convergent thinking was much larger than that of divergent or associative thinking. Most of these items were represented by using pictures and tables rather than using graphs. Item types of multiple-choice and short answers were superior to essay types. Discrimination index, on the whole, was appropriate (above 0.3), but item difficulty showed a vast deviation ($0.01{\sim}0.90$). Correlation coefficients among subject matters and test tools were very low, and test reliabilities were also low. Low item difficulty & high discrimination index item types were distinguishable. Items with acceleration context were more discriminating than enrichment context. Implications of developing quality paper-pencil test items in the selection of gifted students are discussed.

A study on the perception of occupational therapy majors on Cognitive Impairment Screening Test (CIST)

  • Lee, Sun-myung;Chae, Joo-hyun;Sung, I-sul;Lee, Soo-jin;Moon, Soo-bin;Park, Da-hee;Park, So-hyun
    • Journal of Korean Clinical Health Science
    • /
    • v.9 no.2
    • /
    • pp.1493-1501
    • /
    • 2021
  • Purpose: The purpose of this study is to classify the characteristics of each item of CIST evaluation and to find out the degree of recognition of the characteristics of the cognitive tool. Methods: This study was conducted for occupational therapy majors at M University located in Gyeongsangnam-do. The data collection from May to June 2021. Total of 25 copies of the data were finally analyzed, SPSS Statistics 26 was used for data analysis. Results: As a result of the study, the significance level was visual reasoning 1 test strip and the visual reasoning 1 tool. In the relationship between the correspondence 1 figure simulation sheet and the figure simulation tool for each item and statistically significant, and the correspondence 2 visual reasoning 2 sheet. Visual reasoning 2 sheet and visual reasoning tool also showed that was found to be statistically significant. The correlation for visual reasoning 1 sheet and the visual reasoning 1 tool, reasoning 2 tool and visual reasoning 1 sheet, and the visual reasoning 2 tool and the verbal reasoning sheet. Conclusion: In this study, in the CIST items that may be difficult, it is better to attach the actual tool rather than the verbal explanation of the test paper to increase the efficiency of the test and the understanding of subjects with mild cognitive impairment. It was implemented by applying the tool, and it was found that the use of the tool in the visual reasoning item showed a high correlation by item. Furthermore, based on this study, it will be possible to suggest a method to control the difficulty of each subject of the cognitive evaluation tool, and to prepare a standard for future research.

STUDY ON PREDICTION OF THE INDUCED TEMPERATURE IN ENVIRONMENTAL TEST (얇은 평판의 환경시험에서 유도온도 예측에 대한 연구)

  • Lee, J.Y.;Baek, S.H.;Park, S.J.
    • Journal of computational fluids engineering
    • /
    • v.13 no.4
    • /
    • pp.24-32
    • /
    • 2008
  • Environmental test is divided into operation test and storage test. The temperature of storage test is induced temperature which is considered with all sort of the heat source. Induced temperature is the temperature to be adapted to each item and platform and can be induced by computer simulation, laboratory, and real field test. We considered the induced temperature to be associated with solar heat source. In this research. First, we compared the induced temperature which be occurred by one experiment for thin plate in solar test chamber with the other one which be occurred by computer simulation to be SolidWorks 2007 COSMOS FloWorks. After this verification, we showed induced temperature which can be occurred when the test item is stored. Especially, we bring out the induced temperature by applying the ambient temperatures which is presented by MIL-STD-810F and brought out in preceding research.

Detecting Differential Item Functioning based on Gender: Field of Mathematics in the TIMSS 2007 (초등학생의 성별에 따른 차별기능문항 분석: 수학 과학 성취도 국제비교연구(TIMSS) 2007 수학영역을 중심으로)

  • LEE, Seungbae;KIM, Sukwoo
    • Journal of Fisheries and Marine Sciences Education
    • /
    • v.29 no.3
    • /
    • pp.757-766
    • /
    • 2017
  • This study investigated not only the existence of differently functioned item due to gender but also domain. In this study, the randomly selected data of TIMSS 2007, which consist of 681 male and 646 women, were analyzed. To detect differently functioned items, this study employed Raju method. For Raju method, three-parameter logistic model was selected. Signed and unsigned area between two item characteristic curve were measured within the real ability range. An item which was detected commonly SA and UA area in Raju method was defined as a differently functioned item. As a result of this study, six items among twenty seven items of mathematics in the TIMSS 2007 were differently functioned item. Five items among those six items, were in favor of boys and one item was in favor of girls. Number, Geometric Shapes and Measures, and Applying were in favor of boys. but Data Display, Reasoning were in favor of girls. The conclusion of this study was summarized as existing differently functioned items in TIMSS 2007 and difference between favorable domain based gender. Finally, it is desirable to consider the differently functioned items by relating those item content for improving the test reliability of TIMSS 2007.