• Title/Summary/Keyword: item discrimination

Search Result 127, Processing Time 0.022 seconds

Item Difference Difficulty & Item Discrimination based Item Suitability Verification for Test Bank System (문제은행 시스템의 문항 차분 난이도 및 변별도를 기반으로 한 문항 적합성 검증)

  • 전병호
    • Proceedings of the Korea Multimedia Society Conference
    • /
    • 2001.06a
    • /
    • pp.403-406
    • /
    • 2001
  • 문제은행 시스템은 피험자의 조건에 따라 데이터베이스에서 문항을 추출하여 가상공간에서 평가를 수행한다. 가상 공간에서의 평가는 피험자에게 적용하는 경우 출제 빈도에 따라 운항의 난이도 및 변별력에 영향을 주게 된다 출제 빈도에 따라 난이도나 변별력이 낮아지는 문항은 출제를 제한하는 기준이 필요하다. 본 논문에서는 문항 사후 난이도와 문항 변별도를 기반으로 하여, 문항 차분 난이도를 주기적으로 측정하고 난이도 차이가 일정 수준 이상이 되는 문항에 대해 출제를 제한하는 방안과 전체 피험자에 대한 운항의 변별력을 측정하러 변별력이 떨어지는 문항을 출제자에게 문항을 수정하게 하거나 삭제하도록 하는 방안들 제안한다.

  • PDF

How to develop tiered tests: A developmental framework using statistical indexes and four tier types in secondary physics

  • Kim, Min-Kee;Jung, Jin-Sun;Pak, Sung-Jae
    • Journal of The Korean Association For Science Education
    • /
    • v.29 no.3
    • /
    • pp.277-290
    • /
    • 2009
  • In the era of the outcome-based education, multiple-choice test has been widely employed owing to its efficiency that enables educators to evaluate a quantity of students with much objectiveness. However, the prevalent test has not been reconsidered enough to overcome its apparent shortcomings: examiners' effort for developing plausible and faultless distracters defending from every falsification, and students' random guessing on key choices. For alleviating such defects, tiered test as an experimental format of multiple-choice tests has been suggested in science education. Since there has not accumulated much study on the implementation of tiered tests, our research aim is set to construct a framework suggesting statistical indexes for rationally discerning tiered units that develop an effective tiered test. Graded both by our tiered-scoring and by the conventional partial-scoring, the preliminary tiered test in secondary physics attests the improvement in its discrimination and difficulty distribution. The findings reveal that the two indexes discern effective tiered items: discrimination increase (Ct-p) and difficulty decrease (Dp-t). Based on the index information, 4 heterogeneous tier types are recommended in the content of secondary physics: directional manipulation, repeated calculation, diverse explanation, and plural variables.

The Development and Validity of the Home Literacy Environment Rating Scale (유아를 위한 가정문해환경 평정척도 개발 및 타당화 연구)

  • Park, Chan-Hwa;Kim, Gil-Sook
    • Journal of the Korean Home Economics Association
    • /
    • v.46 no.9
    • /
    • pp.87-97
    • /
    • 2008
  • This study was conducted to develop the Home Literacy Environment Rating Scale(HLERS) and to analyze its item discrimination, reliability, and validity. The participants of this study were 438 parents whose children were three to five years old. The item discrimination, determined by comparing the highest and lowest group using Chi-square($x^2$) and Cramer's V, was found to be satisfactory. The Cronbach's $\alpha$ for internal consistency reliability was .78. Factor analysis revealed that the structure of the HLERS consisted of three factors: 'reading books,' 'reading behavior and modeling of parents' and 'literacy learning.' The concurrent validity was also identified by correlation between the HLERS and two sub-tests of EC-HOME. In conclusion, these results demonstrated that the Home Literacy Environment Rating Scale is reliable and valid to examine the home literacy environment for Korean families.

Item Response Analysis on Items Related to Statistical Unit in the National Academic Aptitude Test -Empirical Study for Jellabuk-do Preliminary Testee- (대학수학능력시험의 통계단원 문제에 대한 문항반응분석 - 전북지역 예비 수험생을 대상으로 한 탐색연구 -)

  • Choi, Kyoung-Ho
    • Communications for Statistical Applications and Methods
    • /
    • v.17 no.3
    • /
    • pp.327-335
    • /
    • 2010
  • Item response theory provides a fixed results about students, regardless of the item difficulty and discrimina-tion and it is also a kind of item analysis methods which provides the same proper competence scores to students in spite of them taking different test repeatedly. In this paper, we researched item difficulty and item discrimina-tion and analyzed items in the national academic aptitude test which were given from 2000 to 2009 in the past 10 years through item response theory, especially, in connection with given items about statistical unit. As a result, we found that about 60 percents of the items were too difficult for high school students to solve, however, item discrimination proved to be great.

Psychometric Properties and Item Evaluation of Korean Version of Night Eating Questionnaire (KNEQ) (한국어판 야식증후군 측정도구의 신뢰도, 타당도 및 문항반응이론에 의한 문항분석)

  • Kim, Beomjong;Kim, Inja;Choi, Heejung
    • Journal of Korean Academy of Nursing
    • /
    • v.46 no.1
    • /
    • pp.109-117
    • /
    • 2016
  • Purpose: The aim of this study was to develop a Korean version of Night Eating Questionnaire (KNEQ) and test its psychometric properties and evaluate items according to item response theory. Methods: The 14-item NEQ as a measure of severity of the night eating syndrome was translated into Korean, and then this KNEQ was evaluated. A total of 1171 participants aged 20 to 50 completed the KNEQ on the Internet. To test reliability and validity, Cronbach's alpha, correlation, simple regression, and factor analysis were used. Each item was analyzed according to Rasch-Andrich rating scale model and item difficulty, discrimination, infit/outfit, and point measure correlation were evaluated. Results: Construct validity was evident. Cronbach's alpha was .78. The items of evening hyperphagia and nocturnal ingestion showed high ability in discriminating people with night eating syndrome, while items of morning anorexia and mood/sleep provided relatively little information. The results of item analysis showed that item2 and item7 needed to be revised to improve the reliability of KNEQ. Conclusion: KNEQ is an appropriate instrument to measure severity of night eating syndrome with good validity and reliability. However, further studies are needed to find cut-off scores to screen persons with night eating syndrome.

Verification of the Usefulness of the Mock TOEIC Test using Corpus Indices : Focusing on the Analysis of Difficulty and Discrimination (코퍼스 지표를 활용한 모의 토익시험의 유용성 검증 : 난이도와 변별도 분석을 중심으로)

  • Lee, Yena
    • The Journal of the Korea Contents Association
    • /
    • v.21 no.10
    • /
    • pp.576-593
    • /
    • 2021
  • In this study, in order to investigate the factors that affect the percentage of correct answers and the degree of discrimination of the TOEIC test, a regression analysis was performed using corpus indicators that influence correct answer rate and the degree of discrimination for each part derived from the item analysis. The basic calculation word_length, consistency index LSA_overlap_adjacent_sentences, lexical diversity MTLD_VOCD, conjunction All_logical_causal_connectives_incidence, situational model casual_particles_causal_verbs_Ratio, syntactic complexity Left_embeddedness, and syntactic pattern density Infinitive_density were found to have negative effects. These factors that lower the correct answer rate can be utilized when setting learning goals. Vocabulary diversity index MTLD_VOCD, conjunction Additive_connectives_incidence, syntactic pattern density Infinitive_density, and lexical information person1_2_pronoun_incidence were found to have a positive effect. Factors influencing the increase in discrimination may provide important information for developing a learning program.

Item Analysis for Selecting Science Gifted Middle School Students at Physics Class (과학영재교육원 중학교 물리 전공 선발 문항 분석)

  • Lim, Chun-Woo;Park, Yune-Bae
    • Journal of Gifted/Talented Education
    • /
    • v.20 no.1
    • /
    • pp.61-77
    • /
    • 2010
  • The purpose of this study was to analyze the items that were used in entrance examination for science gifted education center for middle school students by using content analysis and classical item analysis. In content analysis, objective type items exhibited mathematics and physics were dominant. Science giftedness & creativity items were dominant. And essay type items consisted of physics items, have evaluated creative problem solving ability. Item difficulty and discrimination index, on the whole, were appropriate. Comparing with objective type, essay type has higher discrimination index. In correlation analysis between total score and score of each type of items, total score has the highest correlation with essay type items and science giftedness & creativity. It was recommended that mathematics, physics and chemistry items with focusing giftedness & creativity could give some implications for future selection methods of science gifted education center.

Development of Parallel Short Forms of the Convergent Thinking and Problem Solving Inventory Utilizing Item Response Theory : A Case Study of Students in H University (문항반응이론을 적용한 융합적 사고 및 문제해결 역량진단 도구의 병렬 단축형 개발 : H 대학교를 중심으로)

  • You, Hyunjoo;Nam, Na-Ra
    • Journal of Engineering Education Research
    • /
    • v.26 no.3
    • /
    • pp.35-41
    • /
    • 2023
  • The study was conducted to develop two parallel short forms for the Convergent thinking and Problem solving questionnaires which are part of H University's core competency diagnostic tools, based on Multi-Item Response Theory. Item responses of 2,580 students were analyzed using Graded Response Model(GRM) to determine item difficulty and discrimination of each item. The research results are as follows. Two parrallel short tests were developed for the Convergent thinking questionnaire consisting of 12 items which were originally 17 items. Likewise, the Problem solving questionnaire, which originally consisted of 15 questions, was divided into two parallel short forms, each consisting of 9 items. The reliability of the shortened parallel tests was confirmed through internal consistency analysis, and their similarity to the original tests was established through correlation analysis. This study contributed to quality management of competency-based education and programs at H University by developing shortened tests. Based on the results, implications were presented as well as limitations and discussions.

A Method for Developing Items to Assess Earth Science Creativity (지구과학 창의력 평가 문항 개발 방법에 관한 연구)

  • Lee, Hang-Ro
    • Journal of the Korean earth science society
    • /
    • v.24 no.3
    • /
    • pp.150-159
    • /
    • 2003
  • This study suggests methods of assessing scientific creativity and developing items, which can be achieved when both earth science knowledge and general creativity are applied at the same time. According to the results of this study, the cognitive ability gaps between creativity and scientific creativity were clearly defined by the terms' operational definition. Four factors in the Subcategory Of Scientific Creativity-fluency, flexibility, elaboration, and originality-were selected, and the possibility of developing items out of these factors was discovered. The operational definitions of the four factors were given and the criteria for assessment and scoring were set. The validity, reliability, discrimination, and difficulty, which were the conditions required for the assessment instruments, were verified through three field trials of inputting the assessment instruments for scientific creativity. The assessment instruments were composed of 8 items with 2items for each factor. The average item fitness index obtained was 0.99, Cronbach , the item inter-consistency was 0.79,the inter-rater reliability of each item was 0.78, the inter-rater reliability of each factor was 0.75, the item discrimination power was 0.19, and the item difficulty was 0.00. Because the results were within the permitted limit of the conditions required for assessment instruments, the assessment instruments developed for scientific creativity in this study can be said to be very favorable.

Analysis of the Characteristics of Multiple-Choice Test Items Used in Integrated Science Assessment: Focused on the Case of Four High School (융합형 '과학' 평가에 사용된 선다형 문항의 특성 분석 : 4개 고등학교의 사례)

  • Lee, Ki-Young;Cho, Hee-Hyung;Kwon, Suk-Min;Kim, Hee-Kyong;Yoon, Heesook
    • Journal of Science Education
    • /
    • v.37 no.2
    • /
    • pp.278-293
    • /
    • 2013
  • The purpose of this study was to analyze the characteristics of multiple-choice test items used in assessment of high school integrated science according to 2009 revised curriculum. For the analysis of the tendency of item setting, we devised an analytic framework specific to integrated science, and analyzed the characteristics of items by applying the devised framework and item response theory. The results of the tendency of item setting revealed that most of items run counter to the intent of integrated science in terms of item resource, integration extent, and cognitive level, which means teachers are stick to separative method in teaching-learning and assessment of integrated science. The results of the analysis applying item response theory showed that item difficulty was appropriate and item discrimination was considerably high. However, there was no relevance between the tendency of item setting and qualitative characteristics of the items. We also discussed some agendas to improve the teaching-learning and assessment of integrated science based on the results of this study.

  • PDF