• Title/Summary/Keyword: Criterion-referenced test

Search Result 16, Processing Time 0.027 seconds

Current and Future Challenges of Student Assessment in Medical Education from an Outcome-based Education Perspective (성과중심교육 측면에서 우리나라 의과대학 학생평가의 현실과 과제)

  • Park, Jang Hee
    • Korean Medical Education Review
    • /
    • v.15 no.3
    • /
    • pp.112-119
    • /
    • 2013
  • Most medical colleges in Korea have been shifting from traditional education to outcome-based education, which is the general trend in medical education. The purpose of this study was to make some suggestions in light of the reality and challenges of student assessment in medical education from the perspective of outcome- based education. First, those who are responsible for student assessment should be diversified to include faculty, residents, students, and evaluation committee members. They need separate roles in educational evaluation, so evaluation competencies are required for them. Second, various methods for evaluation and score interpretation can be used for effective evaluation. We can adopt diagnostic, formative, and summative evaluation functionally, and the norm-referenced, criterion-referenced, growth-referenced, and ability-referenced evaluation based on criteria for score interpretation. Finally, various evaluation domains and test forms can be administered together in the common lectures in the medical school. We can test not only knowledge but also skills and attitudes, with diverse test forms such as supply and performance types.

Investigation of Various Reliability Indices of Pre-service Mathematics Teachers' Teaching Aptitude and Personality Test based on Setting Cut Scores (예비수학교사의 교직 적성·인성 검사에서 분할점수 변화에 따른 다양한 신뢰도 탐색)

  • Kim, Sungyeun
    • The Mathematical Education
    • /
    • v.57 no.1
    • /
    • pp.55-74
    • /
    • 2018
  • The purpose of this study is first to examine the relative influence of each error source and to investigate the optimal measurement conditions to ensure satisfactory multiple reliability coefficients based on the teaching aptitude and personality test for pre-service teachers. Participants were 33 students enrolled in mathematics education in a graduate school of education located in the Seoul metropolitan area from 2013 to 2017. The main results were as follows. First, the estimated variance due to residual was highest, followed by nesting of items within domains, graduate students, interactions of graduate students with domains, and domains. Second, total 96 items, with 12 domains containing 8 items in each domain, with cut score of 598, and original 210 items, with 14 domains containing 15 items in each domain, with cut scores of 615 or 716 were optimal measurement conditions to reach acceptable reliability levels based on the joint consideration of dependability coefficients, cut score dependability coefficients, adjusted dependability coefficients, and standard errors of measurement. Third, larger deviations between the arithmetic mean and the cut score indicated higher reliability coefficients of the test results. Finally, this study suggests ways for practitioners to consider how to apply generalizability theory for criterion-referenced tests and how to develop future research based on limitations.

A Study on the Development of the Model for the Process-focused Assessment Using Manipulatives -Focused on Middle School Mathematics- (교구를 활용한 수학적 과정의 평가모델 개발에 관한 연구 -중학교 수학을 중심으로-)

  • Choi-Koh, Sang Sook;Han, Hye Sook;Lee, Chang Yean
    • Communications of Mathematical Education
    • /
    • v.27 no.4
    • /
    • pp.581-609
    • /
    • 2013
  • Students' learning processes and mathematical levels should be correctly diagnosed in many different methods of assessment to help students learn mathematics. The study developed the model for the process-based assessment while using manipulatives in the middle school in order to improve problem solving, reasoning and communication which are emphasized in 2009 reformed curriculum as the areas of mathematical process. Identifying the principles of assessment, we created the assessment model for each area and carried out a preliminary study. Based on this, we revised the representative items and the observation checklist and then conducted a main study. Through the results of assessment, we found that students' thinking processes were well presented in scoring rubric for their responses on each item. It meant that the purpose of the assessment as a criterion-referenced test was achieved.

A Study on the Student Assessment of Elementary School Mathematics (초등학교 수학과 학생평가 실태 분석)

  • Lee, Jong-Euk
    • The Mathematical Education
    • /
    • v.48 no.1
    • /
    • pp.21-32
    • /
    • 2009
  • The purpose of this study is to diagnose the current states and the problems of student assessment of Elementary School Mathematics. For that purpose, this study conducted a survey and had the individual interviews. The surrey items consisted of the six main parts: questions about the development of assessment tools, the method to assess, the grading, the special supplementary courses, the opening of learning effect, and the follow-up guidances. The results of this study are as the follow First, elementary teachers depended heavily on internet sites for developing assessment problems. Second, elementary teachers made use of a performance assessment, a unit assessment, and a term examination at ordinary times. Third, unit assessment was largely referred for grading by elementary teachers. Fourth, in selecting the students for the special supplementary courses, both criterion-referenced assessment and norm-referenced assessment were considered. After finishing the special supplementary courses, additional tests were usually taken. Fifth, elementary teachers took a negative attitude in opening of learning effect. specialty opening of test paper to parents of students was done under 30%. Sixth, fellow-up guidances were the most through the classroom guidances. but consulting with parents of students was not frequently conducted by teachers.

  • PDF

Student Responses to Smart Device-Based Test on Competency Evaluation in Dental Education

  • Kim, Jooah;Kim, Soo-Yoon
    • Journal of Korean Dental Science
    • /
    • v.12 no.2
    • /
    • pp.58-65
    • /
    • 2019
  • Purpose: This study was aimed to investigate the possibility of utilizing smart device-based test (SBT) for competency evaluation in dental education and to analyze the student responses on overall competency evaluation using SBT method, in comparison to ubiquitous-based test (UBT). Materials and Methods: Questionnaire surveys have been conducted at Yonsei University College of Dentistry from 2015 to 2018 to obtain students' feedback on the application of SBT to competency evaluation. In addition, in order to supplement the competency evaluation procedure, considerations were explored by comparing the expected and actual difficulty of each item when preparing items for competency evaluation with SBT. Result: According to the survey results, student responses between the initial two years (2015 and 2016) differed from those in next two years (2017 and 2018). Students in 2017 and 2018 had more positive responses on competency evaluation with SBT. To determine the test validity, criterion-referenced evaluation was adopted to compare the data in 2017 and 2018 and slight differences in test difficulty in 2018 between the expected and actual difficulty of items were found. Conclusion: The results indicated that SBT was more appropriate for competency evaluation than UBT, based on four-year period of competency evaluation. The SBT was not affected by either the file size or the number of test-takers. Interestingly, students were not sensitive to test version of competency evaluation (paper-based test and SBT). This study suggests that the quality of the test items should be measured by continuous monitoring of the expected and actual difficulty of items for determining test validity. More detailed results and discussions of the findings are given for the development of test procedure and further potential research directions in dental education.

Development of an evaluation tool of quality of nursing care for gastrointestinal surgery patient (위.장관계 수술 환자간호의 질평가를 위한 도구개발)

  • Lee, Byeong-Suk;Park, Jeong-Ho;Jo, Hyeon
    • Quality Improvement in Health Care
    • /
    • v.4 no.2
    • /
    • pp.260-278
    • /
    • 1997
  • Background : Quality of professional nursing care is the most essential factor for survival and growth of nursing profession. Then, nursing professionals have responsibility for the evaluation of quality of professional nursing care. The purpose of this study was to develope an evaluation tool of nursing care for patients received gastrointestinal surgery with general anesthesia. This study was a primary work for the developement of a computer program for the evaluation of nursing care. Methods : This study was done through some consecutive steps. They were (1) Developement of items for the tool (2) Developement of an evaluation tool of nursing care quality for the G-I surgery patient (3) Test of reliability and validity of the tool. Two groups of experts and expert pannels who had much experience of the QA and the care of G-I surgery patients participated for developement of the items. 85 nursing records were used for the test of reliability and validity of the developed tool. The evaluation tools were developed with two types of scoring, norm-referenced tool and criterion-referenced tool. Results The system of items for tool was evaluation area evaluation item-indicator. There were 7evaluation areas which contained 32evaluation items which contained 7lindicators. Evaluation areas 1, 2, 3, 4 were for the evaluation of process and 5, 6, 7 were for the evaluation of outcome of nursing care for G-I surgery patient. For the test of interrator reliability, correlation coefficients of each scores of items and intragroup correlation coefficients were calculated. The average correlation coefficients between two rators were 0.65, 0.54 and the intragroup correlation coefficient were 0.99 and 1.00 by the types of scoring. The Cronbach alpha coefficients of the tools were 0.54 and 0.46 by the types of scoring. The average content validity index of the items was 0.95 from 4 pairs of experts. Because there were significant differences between some scores of quality of nursing care of 3 general hospitals regardless of the types of scoring, the tools could be thought to have some construct validity. And also, there were significant correlations between some scores of quality of nursing care and admission days and admission days after surgery regardless of the types of scoring, the tools could be thought to have predictive validity. Conclusion In this study, the evaluation tool of nursing care was developed for the very specified group of patient, G-I surgery patient. And the items were developed and tested by the experts of nursing practice. Because of these reasons, it was supposed that the tool could be used effectively in nursing pratice. And the procedures for the development and the test of the evaluation tool of nursing care in this study were supposed to be used for the developement of other tools.

  • PDF

A Study on Health Education Program Development of Respiratory Communicable Disease Prevention for Preschool Children and the Measurement of It's Effects (학령전 아동을 위한 호흡기전염병 예방 프로그램의 개발 및 효과에 관한 연구)

  • Kim, Il-Ok
    • Child Health Nursing Research
    • /
    • v.10 no.1
    • /
    • pp.66-79
    • /
    • 2004
  • Purpose: The purpose of this study were to develop a respiratory communicable disease prevention program for preschoolers and measure it's effects. Method: The respiratory communicable disease prevention program for preschoolers consisted of texts, cartoons, photographs, discussions, demonstrations, puzzle games, die games, compensation/reinforcement, and token economy which were directed under the systematic design of instruction by Dick %amp; Carey. This study was a quasi experimental study under the nonequivalent control group with pretest-posttest design. The subjects of this study were 45 preschool children who are attending 3 different district nursery schools and they were matched by the age, pretest knowledge, and pretest behavior. The instrument used in this study was criterion referenced test items that were developed by a researcher for evaluating the subject's knowledge, attitude, and behavior about respiratory communicable disease prevention. A pretest was administered a week before treatment. Experimental group Ⅰ was administered by the treatment of respiratory communicable disease prevention program. Experimental group Ⅱ was administered by above program with token economy program. The posttest was conducted on the eighth day. The third test for behavior was completed 15th day. To determine the effect of the program, the data were analyzed by the SAS 6.12 program with Kruskal Wallis test, ANCOVA, ANOVA, Duncan's test and paired t-test. Result: 1) There was a significant difference in knowledge between the experimental groups and control group(F=5.89, P=0.0197). 2) There was a significant difference in attitude between the experimental groups and control group(F=3.29, P=0.0469). 3) There was a non-significant difference in behavior between the experimental groups and control group(F=0.00, P=0.9512). 4) In the experimental groupⅡ, there was highly significant increase in behavior after token economy(t=4.5252, P=0.0005). Conclusion: It was found that the respiratory communicable disease prevention program for preschool children was effective in changing the preschoolers' knowledge and attitude on the respiratory communicable disease prevention, but not enough for changing the preschoolers' behavior. Token economy was improved as an effective and strong method for inducing desirable changes of preschoolers' behavior.

  • PDF

Effectiveness of a Drug Misuse and Abuse Preventive Program for Middle School Students (중학생 약물오남용 프로그램의 효과)

  • Lee, Yun-Yeong;Han, Suk-Jeong
    • Journal of the Korean Society of School Health
    • /
    • v.19 no.2
    • /
    • pp.89-104
    • /
    • 2006
  • Purpose: This study was to develop and verify the effects of drug misuse and abuse preventive program for middle school students. Methods:This research was a quasi experimental study under the nonequivalent control group with pretest-post test design which tried to protect children from the detrimental effect of drugs and develop a drug abuse prevention program for middle school students. Data was collected from October 10th to 21th, 2005. Subject consisted of 145 middle school students in Kyeonggi, experimental group-72, control group-73. Dick & Carey's(1996) educational system was applied, based on documents and materials online related to drug abuse in order to develop drug abuse prevention program. It's composed of 4 parts, 45 minute each. The evaluation instrument testing for the knowledge about drugs was a criterion of referenced test items modeled by Dick & Carey. The instrument for attitudes about drugs was modeled by Kim, Soyaja. A pre-test was taken on the knowledge and attitudes to drugs. The experimental students were given four sessions of drug abuse prevention education. A post-test similar to the pre-test questionnaire was given in 1 week, 4 weeks following the last session. Collected data was analyzed by using SAS 9.1 program. Results:Followings are the summarized result of study 1. The experimental group, that attended the drug abuse prevention program will have more knowledgable about drugs than the control group (F=27.31, p<.0001). 2. The experimental group, that attended the drug abuse prevention program displayed greater negativism attitude than the control group (F=0.58, p=0.4477). Conclusion:The results conclude that drug abuse prevention programs increase the knowledge of middle school students but doesn't change their attitude toward drugs. Therefore we need to offer them more systematic education to increase their knowledge so it will also improve their attitudes as well.

Development and Evaluation of Criterion-Referenced Performance Assessment Items Based on the 7th National Science Curriculum -Subject Unit of Reproduction and Biological Accumulation- (제7차 교육과정에 근거한 준거지향적 수행평가 문항의 개발과 평가 -고등학교 과학 "생식"과 "생물 농축" 단원을 중심으로-)

  • Chung, Young-Lan;Park, Jin-Joo
    • Journal of The Korean Association For Science Education
    • /
    • v.24 no.3
    • /
    • pp.519-531
    • /
    • 2004
  • In recent years, there has been an increased emphasis on performance assessment to evaluate students' abilities. Our nation has introduced a change in testing and assessment. Additional work on the efficacy, reliability, and comparability in order to develop the performance assessment item has been needed in the enforcement of the 7th National Science Curriculum. Also, criteria for professional and technical standards has been needed to be developed. The purpose of this study was to draw out various key concepts and to develop achievement standards, assessment standards and performance assessment items based on the 7th National Science Curriculum on the subject matter of reproduction(chapter 13) and biological accumulation(chapter 17). And also, this study examined the validity of completed performance assessment items based on classical test theory and polytomous item response theory. Twelve key concepts in chapter 13(reproduction) and four from chapter 17(biological accumulation) were abstracted. Twenty-six achievement standards in chapter 13(reproduction), and nine in chapter 17(biological accumulation) were developed. The achievement standards were determined in terms of knowledge(K), process skill(P) and attitude(A). Twenty-five assessment standards in chapter 13(reproduction) and nine in chapter 17(biological accumulation) were developed. Based on the developed achievement standards and assessment standards, twenty-two performance assessment items(seventeen open-ended questions, three essays, and two portfolios) with concrete grading criteria were developed. Eight open-ended items were applied to 240 10th graders to evaluate reliabilities of the test which consisted of four items per each chapter. The results would be suggested that the applied items were valid for performance assessment because item difficulties and item discriminations were proper. There was not much differences in item discrimination between interpretation from classical test theory and that from polytomous item response theory. However, there were some differences in item difficulties between the interpretations of two theories because the characteristics of examinees were reflected in classical test theory.

Development of National Curriculum-Based Assessment Standards and Instruments for High School Common Science (국가 교육과정에 근거한 공통과학 평가 기준 및 평가 도구 개발 연구)

  • Lee, Yang-Rak;Lee, Sun-Kyung;Hong, Mi-Young;Hong, Jae-Sig
    • Journal of The Korean Association For Science Education
    • /
    • v.19 no.1
    • /
    • pp.159-172
    • /
    • 1999
  • This is the second year study of ''The Development of Model of National Criterion- Referenced Assessment Standards" that had started in 1997. In the study, national assessment standards for high school common science were developed based on national curriculum. In the whole process of developing the standards, high school teachers, university professors and administrators of the Ministry of Education have participated as the "developing group" or "consulting group". Through various activities such as conference, workshop, intensive work, examination by science education experts, the standards and instruments were developed and modified. The research contents can be itemized as follows: - modifying the achievement standards developed in the first year research based on the opinions of various experts(science teachers, professors of science education, philosophers) - developing assessment standards based on the specially designed system. The standards divide students' achievements into three levels(upper/middle/low) and state each level so that it can guide evaluation of achievement. - developing various types of test instruments to probe students' achievement levels for each assessment standard.

  • PDF