• Title/Summary/Keyword: Structured evaluation items

Search Result 54, Processing Time 0.022 seconds

Exploring automatic scoring of mathematical descriptive assessment using prompt engineering with the GPT-4 model: Focused on permutations and combinations (프롬프트 엔지니어링을 통한 GPT-4 모델의 수학 서술형 평가 자동 채점 탐색: 순열과 조합을 중심으로)

  • Byoungchul Shin;Junsu Lee;Yunjoo Yoo
    • The Mathematical Education
    • /
    • v.63 no.2
    • /
    • pp.187-207
    • /
    • 2024
  • In this study, we explored the feasibility of automatically scoring descriptive assessment items using GPT-4 based ChatGPT by comparing and analyzing the scoring results between teachers and GPT-4 based ChatGPT. For this purpose, three descriptive items from the permutation and combination unit for first-year high school students were selected from the KICE (Korea Institute for Curriculum and Evaluation) website. Items 1 and 2 had only one problem-solving strategy, while Item 3 had more than two strategies. Two teachers, each with over eight years of educational experience, graded answers from 204 students and compared these with the results from GPT-4 based ChatGPT. Various techniques such as Few-Shot-CoT, SC, structured, and Iteratively prompts were utilized to construct prompts for scoring, which were then inputted into GPT-4 based ChatGPT for scoring. The scoring results for Items 1 and 2 showed a strong correlation between the teachers' and GPT-4's scoring. For Item 3, which involved multiple problem-solving strategies, the student answers were first classified according to their strategies using prompts inputted into GPT-4 based ChatGPT. Following this classification, scoring prompts tailored to each type were applied and inputted into GPT-4 based ChatGPT for scoring, and these results also showed a strong correlation with the teachers' scoring. Through this, the potential for GPT-4 models utilizing prompt engineering to assist in teachers' scoring was confirmed, and the limitations of this study and directions for future research were presented.

Study on the development of Quantitative assessment indicator of safety culture for the construction site (건설현장 안전문화의 정량적 평가지표 개발에 관한 연구)

  • Jun, Heakyung;Kwon, Changhee
    • Journal of the Society of Disaster Information
    • /
    • v.12 no.4
    • /
    • pp.403-411
    • /
    • 2016
  • The objectives of this study is to develop evaluation indicators for the quantitative evaluation of construction safety culture level in order to prevent accidents by evaluating the level of safety culture and each safety culture elements of the construction site and to present the areas that should be focused on improvements. In this study, it was presented assessment indicators of the construction safety culture by analyzing previous studies for safety culture, by categorizing items as an important element of safety culture hierarchically and by reflecting the opinion of the construction site professional personnels using AHP analysis methodology. The assessment indicators of the construction safety culture were structured the details of the leadership, systems, and personal characteristics and derived weighted value by the pairwise comparison to quantify the detail assessment indicators in order to assess the construction safety culture level. This study presents a safety culture assessment indicators for the construction site to suggest directions for improving the construction site safety culture and prevent the accidents of the construction site by derived via a safety culture assessment of construction site.

Science Teachers' Perceptions of Science Practices (과학과 행동영역에 대한 과학 교사들의 인식 조사)

  • Park, Hyun-Ju;Jeong, Dae-Hong;Choi, Won-Ho
    • Journal of The Korean Association For Science Education
    • /
    • v.31 no.1
    • /
    • pp.61-77
    • /
    • 2011
  • This study investigates science teachers' perceptions of science practices for science assessment. Science practices have information about students' ability to understand scientific knowledge and to perform scientific inquiry. For this study, seven science teachers, who have served for more than five years in secondary schools in Seoul, were chosen. A structured questionnaire consisting of twenty-seven items were used in National Assessment of Educational Achievement. And then, in-depth interviews followed. Co-workers analyzed and discussed the questionnaire and interviews. As results show, science teachers tend to determine science practices based on materials and way to present materials included in questions. Science teachers tend to recognize science practice as different, depending on information and thinking process, which is expected in solving them. In addition, they have a variety of the level of definition and understanding about science practices.

Analysis of mathematics test structures and tasks in Abitur (독일 아비투어(Abitur)의 수학시험 체제 및 문항 분석)

  • Kim, Seong-kyeong;Lee, Miyoung
    • The Mathematical Education
    • /
    • v.61 no.2
    • /
    • pp.287-303
    • /
    • 2022
  • The purpose of this study is to draw implications for the improvement in the CSAT by analyzing structures and tasks in the Abitur. To this end, it analyzes the mathematics test system with a focus on the basic and advanced level examination systems, the operator, the using technology, and mathematical formulas. And the characteristics of tasks in the 2021 Abitur were analyzed. As a result of the analysis, first, Germany evaluates whether students have the competency emphasized in the curriculum at Abitur. Second, Germany, which emphasizes the proper use of technology, utilizes both tasks that use technology and those that do not in the Abitur. Third, the Abitur consists of most of the tasks using promised operators and uses various types of operators to present various types of questions to evaluate competence. Fourth, the Abitur includes not only simple structured items consisting of 2-3 subtasks but also tasks dealing in depth with a single situation centered on a big idea. Finally, mathematical justification and proof play an important role in the Abitur. Based on this, some specific measures for improving the CSAT were suggested.

The Comparative Study of Family Dynamics between Families of Problem Students and of Normal Students (문제학생가족과 정상학생가족의 가족역동 비교연구)

  • 김윤희;문희자
    • Journal of Korean Academy of Nursing
    • /
    • v.23 no.2
    • /
    • pp.187-206
    • /
    • 1993
  • The study was done to better understand problem behavior in high school students as described in family system theory, which explains the individual’s problem within the family interactions. The purpose of the study 1. To analyze the difference in the parents’ relationship as a couple between the two groups. 2. To analyze the difference in the parent-adolecent relationship between the two groups. 3. To analyze the difference in the family function (cohesion adaptability) between the two groups. The method of the study The staudy subjects consisted of a total of 176 families (528 persons), 109 high school students (End grade) with problem behavior and their parents (problem family group) ,and 69 high school students (same grade) with normal behavior and their parents (normal family group) residing in the Seoul area. Data were gathered from structured, self-reporting qestionaires which included a Couple Relation measurement (95 items) , Parent-Adolescent communication measurement (20 items), Family Cohesion Adaptability Scale (20 items) by DavidH. Olson et al., and a behavior evaluation tool. The results of the study 1. The results as related to the hypothesis were as follows. Hypothesis 1 : “satisfaction within The couple's relationship of the parents of problem family group will be lower than the normal family group was supported significantly(t=3.07, p=.005). Hypothesis 2: “The parent-adolescent relationship of the problem family group will be more negative and problematic than the normal family group” was supported significantly(t=4.06, p=.000). Hypothesis 3: “The family function (cohesion adaptability) of the problem family group will be lower than the normal family group" was supported significantly(t=2.20, p=.022) 2. The results of related analysis were as follows 1) Analysis of a causal relation between the couple’s relationship, the parent-adolescent relationship, family function and adolescent behavior showed that the Above 3 variables influenced adolescent behavior.. In cases where couple’s relation-ship, the parent-adolescent’s relationship, the family function are the better, their adolescent’s behavior is better. 2) Discriminant analysis of the research tool showed The discriminant ability of couple’s relationship tool was 75.57%, the Parent-Adolescent communication tool, 67.05, the family adaptability cohesion tool.67. 61%. In summary, interpersonal relationships in the family subsystems are interactive and their relation influences the behaviors. of adolescents in the family. Therefore, family therapy would be a more effective method than individual therapy, to resolve negative problem for adolscents, and the research tool used in this study are very useful for family system diagnosis and nursing intervention.

  • PDF

Predictability of the completeness of medical recording of quality of care for inpatients (의무기록 완성도의 입원환자 진료적정성에 대한 예측도 평가)

  • Park, Un Je;Park, Eal Whan
    • Quality Improvement in Health Care
    • /
    • v.3 no.2
    • /
    • pp.60-68
    • /
    • 1997
  • Background : Medical records are used to assess clinical performance of physicians and quality of care. The contents which are written in medical records are considered as the objective evidences to know what the doctors think about the patient's problems. But the problem to use medical records as the assessment tools is the incompleteness of medical recording. The purpose of this study is to know if the completeness of medical recording is correlated to quality of care for inpattients and it can predict physicians's quality of care. Method : 32 clinical physicians reviewed 200 patients' medical records who were selected randomly from the inpatients who were admitted to the university hospital during July, 1995 and June, 1996. The reviewers used the structured evaluation questionnaires which were composed of two part. One part evaluated the completeness of the medical recording and the other evaluating appropriateness of diagnosis and treatment processes. We summated the scores of each items and calculated percentile scores. Results : The mean percentile score of completeness of the medical recording was 67.9% in 1995 and 79.8% in 1996. The mean percentile score of appropriateness was 52.2% in 1995 and 69.5% in 1996. This change between 1995 and 1996 was statistically significant. In non-surgical patients, the percentile scores of the completeness and those of the appropriateness were correlated positively and this correlation was statistically significant(p<0.05). In surgical patients, the positve correlation between the completeness and the appropriateness was also statistically significant(p<0.05). Discussion : In conclusion, the completeness of medical recording is considered as the good predictor of the quality of care for inpatients.

  • PDF

Effect of the Education on AIDS for Korean Health Care Workers (건강 관리자의 에이즈 교육 효과)

  • 장순복;이창우
    • Journal of Korean Academy of Nursing
    • /
    • v.27 no.1
    • /
    • pp.201-211
    • /
    • 1997
  • This study was an evaluation study of AIDS education program. The purpose of this study was to clarify the education effects on AIDS for health care workers to develop a better next education program. This study was done by self reporting with a 67 items of structured questionnaire by 431 health care workers included doctors, nurses, laboratory technicians, and health educators. Data were collected at the time of completion of each AIDS education with the help of education program manager. Both the AIDS related knowledge score and the acceptance attitudes score were significantly higher in the male group, in the medical institution employer group, in the group who have met the HIV infected person, who has known the HIV positive person, and the group of laboratory technician, but the AIDS prevention intention score was statistically higher in the group of female and laboratory technician group. The post education scores of AIDS related knowledge. acceptance attitudes, and preventive intention were statistically higher than those of the preeducation. The most increased item among AIDS prevention intention list was 'I will provide the meeting between the HIV infected persons and the public (+21.9%)'. But even the decreased item among AIDS prevention intention list was 'I will advice to female not to have extra marital sexual contact to avoid AIDS(-3.1%)'. It could be concluded that the health care workers were ignorant of vertical transmission of AIDS, they were afraid of disclosing the infection status, and have less AIDS prevention intention. Therefore it is needed to take an assessment process before each new education trategy to increase AIDS related the effect of the education on AIDS.

  • PDF

Level of Third-Year Students' Competency and Correlating Curricular Factors (3학년 학생의 역량수준과 관련 요소)

  • Kam, Beesung;Lee, Sang Yeoup;Im, Sun Ju
    • Korean Medical Education Review
    • /
    • v.15 no.2
    • /
    • pp.87-92
    • /
    • 2013
  • The purpose of this study was to assess third-year medical students' competency for development or revision of the undergraduate curriculum and assessments. One hundred and twenty-seven third-year medical students at the Pusan National University were included in the study. After third- and fourth-year students took a common written examination, clinical performance examination (CPX), and objective structured clinical examination (OSCE) with common items as a summative assessment, the third-year students' competency was compared with 132 forth-year students' results. The correlation of the written examination and CPX/OSCE was analysed, and the summative results were compared with the grade point average (GPA) through the second year, CPX/ OSCE in the second year, and GPA in the clerkship. On the written examination, the third-year students' mean score was lower than the fourth-year students' by over 11 points, whereas the gap in the CPX/OSCE was 4 points and there was no difference in the OSCE. There was a moderate correlation between the written examination and the CPX/OSCE scores (R=0.371, p<0.01). The written examination was highly correlated with GPA through the second year, which mainly evaluated medical knowledge (R=0.771, p<0.01). A relatively high correlation was observed between CPX/OSCE scores and GPA in the clerkship (R=0.641, p<0.01). The summative CPX/ OSCE scores showed a moderate correlation with formative CPX/OSCE scores in the second year (R=0.464, p< 0.01). The third-year students' score was quite low on the written examination and slightly low on the CPX/OSCE compared to that of the fourth-year students. The written examination and CPX/OSCE cannot replace each other and should be combined with other methods of evaluation to measure competency. Early OSCE and workplacebased assessment should be useful in the early assessment of clinical skills competency.

The Elements of E-Portfolio - Focused on the Portfolio of IT Company Designers (e-포트폴리오의 구성에 관한 연구-IT기업 디자이너 포트폴리오를 중심으로)

  • Park, Min-kyung;Jang, Sun-hee
    • The Journal of the Korea Contents Association
    • /
    • v.19 no.6
    • /
    • pp.204-213
    • /
    • 2019
  • This study examines the ePortfolio structure of IT company design interns and the differences among companies in 'Cofolios' site for employment of design major students. First, we examine the common configuration steps of ePortfolios [1. Project Brief${\rightarrow}$2-1. Investigation and Analysis${\rightarrow}$2-2. Strategy development${\rightarrow}$2-3. Virtualization, Final Design${\rightarrow}$2-4. Presentation, Evaluation, and Improvement${\rightarrow}$3. Read More]. Secondly, all the sub-items used in the ePortfolio were organized into words and classified into 6 stages. Finally, this was analyzed by majors and companies. Through this, the interns of the IT companies can [2-2. Strategy development] and that they are actively utilizing the 'connectivity' attribute linking the links. In addition, interns confirmed that the ePortfolio was structured differently depending on their major and the desired company.

Feature analysis for competency and representation type of mathematics assessment (수학과 평가 문항의 역량 및 표현 형식 특성 분석)

  • Park, Ji Hyun
    • The Mathematical Education
    • /
    • v.60 no.2
    • /
    • pp.209-228
    • /
    • 2021
  • The purpose of this study is developed the Item Feature Analysis (IFA) frameworks for curriculum-based assessments, focusing on Math competency and representation in secondary schools and implemented the IFA in National Assessment of Educational Achievement. To conduct the study, previous studies were analyzed, and feasibility studies were conducted twice. As a result of the study, we structured the IFA framework based on the 2015 revised mathematics curriculum in Korea and developed a method to analyze the characteristics of the math items. The results of structuring the framework for math included two categories: math competency in the content aspects, and representation type in the formal aspects. Specifically, 12 features of math competency and 8 features of representation type were identified, and an item feature analysis framework composed of these features was developed. The math competency was developed based on the subject competency of 2015 national curriculum. Math assessments in high schools, which have been changed to the competency-based assessments, had more frequency of the feature of math competency compared to middle schools. In this study, implemented the IFA in National Assessment of Educational Achievement and explored the way of ensuring the validity. These have been proved as critical applications for ensuring the validity of curriculum-based student assessment as well as building a tool for assessment.