• Title/Summary/Keyword: item discrimination

Search Result 128, Processing Time 0.207 seconds

The Effectiveness of the Training Program to Improve Mathematics Teachers' Professional Competency of Developing Assessment Instrument (현직 수학교사 문항 개발 연수의 평가도구 개발 전문성 향상 효과)

  • Choi, Jiseon
    • Journal of Educational Research in Mathematics
    • /
    • v.24 no.2
    • /
    • pp.253-267
    • /
    • 2014
  • This study aims to analyse the effectiveness of the mathematics teacher's training program: whether the training program is effective to improve mathematics teachers' professional competency of the developing assessment instrument (including items) or not. The teachers who were attendee of the program did pre-test before the program and post-test after the program. They wrote the opinions on the pre-developed items which had many errors in the beginning and discussed their opinions each others in the middle of program. The differences between pre-test and post-test and between opinions in the beginning and in the middle were analysed. The first result of the study is that the teacher's professional competency which is evaluated as self-perceived points is improved with regard to understanding the standardized test, item difficulty and item discrimination. Secondly, the proportion of the opinion with concrete reasons is increased as the program progressed. Thirdly, the effective elements of the program are the discussion in the group, discussion between groups, and feedback from the instructor. But the ineffective element of the program is the insufficient time for practicing.

  • PDF

An Analysis about the Features of Mathematical Learning of Middle School Students through the Distribution Graphs of the Responses Percentages in National Assessment of Educational Achievement (학업성취도 평가에서 답지 반응률 분포 그래프를 활용한 중학생의 수학과 학업 특성 분석)

  • Jo, Yun Dong;Lee, Kwang Sang
    • Journal of Educational Research in Mathematics
    • /
    • v.25 no.1
    • /
    • pp.1-19
    • /
    • 2015
  • This paper aims to explore what we can improve in the curriculum, teaching-learning, and evaluation on the bases of the analyses of multiple-choice items set in National Assessment of Educational Achievement. For this goal, by using the distribution curves of the responses percentages, we will grasp the features of educational achievement which appear to students through an in-depth analysis about not only item itself but also the contents included in particular distracters. These analyses provide more information than the descriptive statistical values such as the mean of correct answer percentage and the discrimination of whole group and the mean of responses percentages of replies of subgroups. Because the distribution curves of the responses percentages reveal the transition from the lowest to the highest educational achievement very well. From these analyses we acquire the implications about the concept of prime factor or prime factorization, ratio(proportion) such as velocity, linear function, volume of cone, properties of solid figure, and probabilities of empty event and total event.

Development of a Behavior Rating Scale for Preschool Children (아동의 행동발달 평정척도 개발에 관한 연구)

  • RHEE, Un Hai;KOH, Yun Joo
    • Korean Journal of Child Studies
    • /
    • v.9 no.2
    • /
    • pp.1-28
    • /
    • 1988
  • The purpose of this study was to develop a behavior rating scale for the evaluation of children's development for utilization by preschool teachers. The procedures for the study included content validation, pilot test, and main study. A total of 97 items were retained after the content validation and pilot test. The items of the scale were grouped into five areas (physical, language, cognitive, emotional, and social development) and 11 sub-areas. The resulting "Behavior Rating Scale for Preschool Children" was administered to 479 boys and girls, 3-6 through 6-5 years of age, selected from 10 different kindergartens and early education centers in Seoul, Pusan, and Chonju. The analysis of data was done with SPSS computer programs, including item analysis, Cronbach's ${\alpha}$ for reliability, factor analysis to test construct validity, two-way ANOVA to test age and sex differences, and percentile norms. The 97 items of the scale were found to be satisfactory in terms of item discrimination with indices ranging from .31 to .73. Cronbach's ${\alpha}$ was .98 for the total scale and ranged from .87 to .93 in specific domains, which was considered satisfactory. The factors extracted from each area were consistent with the educational objectives of the Yonsei Open Education Program except for emotional development. The intercorrelations among the domains were relatively high, ranging from .56 to .81. Age differences were significant in cognitive, physical, and language development, but not significant in social and emotional development. Sex differences were significant in all areas with girls higher on the average than the boys. Percentile ranks were drived from the total score for each age group and quartiles were calculated for sub-scores in each domain.

  • PDF

A Study on Graduate Attributes Assessment for K-EP (K-Engineering Professional) Qualification (K-EP(K-Engineering Professional) 자격을 위한 졸업생역량 평가방안 연구)

  • Choi, Se Hyu;Kang, Sang Hee;Kim, Jung Soo;Yoon, Jiyoung
    • Journal of Engineering Education Research
    • /
    • v.24 no.6
    • /
    • pp.30-39
    • /
    • 2021
  • The present study aims to demonstrate that it is possible to objectively evaluate the competency (referred to as graduate attributes or program outcomes) of graduates of engineering education programs. To strengthen the link between engineering education accreditation and the qualification/certification system of engineering professionals, referred to as K-Engineering Professional (K-EP), individually assuring the quality of accredited graduates using multiple-choice test as main assessment tool is proposed. Test questions related to the basic vocational skills of NCS are developed for seven of the 10 program outcomes of the ABEEK KEC2015. The three program outcomes, PO1, PO3, and PO5, which need to fully accommodate the characteristics of each disciplinary field, are excluded. A pilot test involving graduates of eight accredited programs is conducted. Applying on Rasch model based on Item Response Theory (IRT), the item difficulty, fit and discrimination of multiple choice test are demonstrated. The pilot study strongly suggests that individual competency evaluation is possible at a certain level for seven program outcomes tested. For PO1, PO3, and PO5, however, questions that address the characteristics of each disciplinary field need to be devised. If a suitable pool of questions is built, it can be used as a program outcomes assessment tool by the accredited programs.

Development of Preventive Self-Management Knowledge Related to Premature Labor (PSMK-PL) Scale for Women of Childbearing Age : An Item Response Theory Approach (가임 여성의 조기진통에 대한 예방적 자가관리 지식 측정 도구 개발: 문항반응이론 적용)

  • Kim, Sun-Hee;Lee, Yu-Jin
    • The Journal of the Korea Contents Association
    • /
    • v.22 no.9
    • /
    • pp.439-450
    • /
    • 2022
  • Purpose: This study was to develop the Preventive Self-Management Knowledge related to Premature Labor (PSMK-PL) scale for women of childbearing age. Methods: Preliminary items were developed based on the literature and interview results of those who experienced premature labor. The online survey was conducted and the data of 250 women were analyzed using the DIMTEST and DETECT programs by applying the item response theory. Internal consistency reliability was analyzed with Cronbach's alpha (95% CI). Results: Among the 30 preliminary items, six items were deleted. The difficulty and discrimination of the 24 final three-dimensional scales were all acceptable, respectively. Cronbach's alpha (95% CI) was .89 (.87~.91). Conclusion: The PSMK-PL scale generally consisted of items with validity, and the reliability was acceptable.

Development of a Creativity Test for Children from 4 to 7 years (유아와 초등학교 저학년 아동을 위한 창의성 검사 도구 개발)

  • Kim, Ho
    • Journal of Creative Information Culture
    • /
    • v.5 no.3
    • /
    • pp.265-273
    • /
    • 2019
  • This study develops and examines the validity of a Creativity Test for children from of 4 to 7. 47 items were constructed through the validity of the literature and expert's content validity. The results of this study consisted of 22 items with three factors. These factors are flow and independence, expansion of thinking, curiosity and openness. The tests scale was examined about item's response distribution, item's discrimination, and construct validity and internal consistency reliability, and it was adoptable. It can be concluded that the test tool developed in this study is suitable for measuring the creativity of Korean children from 4 to 7 years in a simple manner, and it measures relatively well the cognitive and affective characteristics of creativity. Creativity test tool for children from 4 to 7 years in this study will contribute to the active progress of research on creativity for young children and elementary school children.

Development of an Instrument to Measure Scientific Problem-Finding Ability for Scientifically-Gifted Student (과학 영재 학생들의 과학적 문제발견 능력을 측정하기 위한 도구 개발)

  • Ryu, Si-Gyeong;Park, Jong-Seok
    • Journal of The Korean Association For Science Education
    • /
    • v.28 no.2
    • /
    • pp.139-149
    • /
    • 2008
  • The purpose of this study is to develop a valid and reliable instrument for measuring scientific problem-finding ability for the scientifically-gifted student. On the basis of an operational definition of scientific problem-finding ability, this instrument consists of five sections(appropriateness, flexibility, originality, elaboration, and valuation) which are designed for measuring the ability of the scientifically-gifted. This instrument was checked the validity of content and evaluation criteria by the five experienced specialists in science education, and then was administered to 38 students of science high school. Because the validity of content and evaluation criteria, construct validity, inter-rater reliability, item difficulty, and item discrimination are suitable for the criteria of good test, this developed instrument in this study is considered valid and reliable.

Development and Evaluation of Criterion-Referenced Performance Assessment Items Based on the 7th National Science Curriculum -Subject Unit of Reproduction and Biological Accumulation- (제7차 교육과정에 근거한 준거지향적 수행평가 문항의 개발과 평가 -고등학교 과학 "생식"과 "생물 농축" 단원을 중심으로-)

  • Chung, Young-Lan;Park, Jin-Joo
    • Journal of The Korean Association For Science Education
    • /
    • v.24 no.3
    • /
    • pp.519-531
    • /
    • 2004
  • In recent years, there has been an increased emphasis on performance assessment to evaluate students' abilities. Our nation has introduced a change in testing and assessment. Additional work on the efficacy, reliability, and comparability in order to develop the performance assessment item has been needed in the enforcement of the 7th National Science Curriculum. Also, criteria for professional and technical standards has been needed to be developed. The purpose of this study was to draw out various key concepts and to develop achievement standards, assessment standards and performance assessment items based on the 7th National Science Curriculum on the subject matter of reproduction(chapter 13) and biological accumulation(chapter 17). And also, this study examined the validity of completed performance assessment items based on classical test theory and polytomous item response theory. Twelve key concepts in chapter 13(reproduction) and four from chapter 17(biological accumulation) were abstracted. Twenty-six achievement standards in chapter 13(reproduction), and nine in chapter 17(biological accumulation) were developed. The achievement standards were determined in terms of knowledge(K), process skill(P) and attitude(A). Twenty-five assessment standards in chapter 13(reproduction) and nine in chapter 17(biological accumulation) were developed. Based on the developed achievement standards and assessment standards, twenty-two performance assessment items(seventeen open-ended questions, three essays, and two portfolios) with concrete grading criteria were developed. Eight open-ended items were applied to 240 10th graders to evaluate reliabilities of the test which consisted of four items per each chapter. The results would be suggested that the applied items were valid for performance assessment because item difficulties and item discriminations were proper. There was not much differences in item discrimination between interpretation from classical test theory and that from polytomous item response theory. However, there were some differences in item difficulties between the interpretations of two theories because the characteristics of examinees were reflected in classical test theory.

A Validation Study of the Behavior Rating Scale for Preschool Children based on the Yonsei Open Education Curriculum (연세 개방주의 교육과정에 기초한 유아 행동발달척도 타당화 연구)

  • Park, Kyung Ja;Chung, Young Sun;Park, Mi Hyun;Woo, Hyun Kyung;Bang, Eun Yeong;Choi, Seon Hwa
    • Korean Journal of Childcare and Education
    • /
    • v.13 no.5
    • /
    • pp.43-64
    • /
    • 2017
  • Objective: The purpose of this study was to develop and validate the Behavior Rating Scale for Preschool Children based on the Yonsei Open Education Curriculum. Methods: The subjects of the study were 145 children aged three to six attending a preschool affiliated with a university and their teachers. Teachers observed their children for at least two weeks and completed the Behavior Rating Scale for Preschool Children. The scale consisted of five areas and 44 items which was a five level rubric. Results: Results showed that age differences were significant and development trends were revealed in almost all items. Second, the mean between the upper and lower groups showed a significant difference. Third, the internal consistency reliability was .97 for all items and for the five areas ranged from .86 to .93. The inter-observers reliability was .84. Forth, the concurrent validity and content validity of the scale were relatively high. Conclusion/Implications: The Behavior Rating Scale for Preschool Children can be used as a valid and reliable instrument to assess preschool children's development.

Development of Measurement of Stress for Female Marriage Immigrants in Korea (여성결혼이민자의 스트레스 측정도구 개발)

  • Park, Min Hee;Yang, Sook Ja
    • Journal of Korean Public Health Nursing
    • /
    • v.26 no.3
    • /
    • pp.518-531
    • /
    • 2012
  • Purpose: This study was conducted in order to develop and test a measurement for assessment of stress of female marriage immigrants in Korea. Methods: Forty four preliminary items were initially developed based on literature review and focus group interviews. Those items were evaluated by experts for content validity, resulting in six factors and 26 items. The 26 items were translated into Chinese, Vietnamese, and English by professional translators and were reviewed by native speakers of each language who are fluent in Korean. For testing validity and reliability, data were collected from 323 female marriage immigrants residing in five regions in Korea. Results: As a result of item analysis, 25 items were selected. Factor analysis yielded 21 items in four factors, including 1) household economic 2) parenting and discrimination 3) cultural and 4) emotional stressors, explaining 61.3% of the total variance of stress of female marriage immigrants in Korea. The Cronbach's alpha reliability coefficient was .903 for the overall instrument and .692-.892 for four factors. Conclusion: Measurement of stress for female marriage immigrants in Korea has high validity and reliability. Therefore, this measurement may be utilized for systematic assessment of stress and for identification of areas of support for female marriage immigrants in Korea.