Search | Korea Science

Automatic Evaluation of Korean Free-text Answers through Predicate Normalization (서술어 정규화를 이용한 한국어 서술형 답안의 자동 채점)

Bae, Byunggul;Park, II-Nam;Kang, Seung-Shik
- Annual Conference on Human and Language Technology
- /
- 2012.10a
- /
- pp.121-122
- /
- 2012
컴퓨터를 사용한 서술형 답안의 자동채점은 채점의 편의성과 객관성을 제고하기 위하여 많은 연구자들이 연구해 왔으며 자동채점의 성능을 향상시키기 위해 여러 가지 방법들이 제안되었다. 본 논문은 서술어 정규화를 통하여 서술형 답안의 자동채점 정확도를 높이고자 하였다. 기존의 다른 채점 방법들과 비교했을때 서술어 정규화 기법을 적용한 채점 방식은 기존의 방법들보다 유사도 계산 정확도가 향상되어 정답 판별 정확도가 향상되는 것을 확인할 수 있었다. 서술어 정규화는 기존의 모든 서술형 답안 채점 방법에 추가적으로 적용할 수 있는 범용성을 가지고 있다. 따라서 서술어 정규화는 기존 방법들의 자동채점 정확도를 향상시켜 보다 정확하게 서술형 답안을 채점할 수 있다.
PDF

Design and Implementation of an Automatic Scoring Model Using a Voting Method for Descriptive Answers (투표 기반 서술형 주관식 답안 자동 채점 모델의 설계 및 구현)

Heo, Jeongman;Park, So-Young
- Journal of the Korea Society of Computer and Information
- /
- v.18 no.8
- /
- pp.17-25
- /
- 2013
TIn this paper, we propose a model automatically scoring a student's answer for a descriptive problem by using a voting method. Considering the model construction cost, the proposed model does not separately construct the automatic scoring model per problem type. In order to utilize features useful for automatically scoring the descriptive answers, the proposed model extracts feature values from the results, generated by comparing the student's answer with the answer sheet. For the purpose of improving the precision of the scoring result, the proposed model collects the scoring results classified by a few machine learning based classifiers, and unanimously selects the scoring result as the final result. Experimental results show that the single machine learning based classifier C4.5 takes 83.00% on precision while the proposed model improve the precision up to 90.57% by using three machine learning based classifiers C4.5, ME, and SVM.
https://doi.org/10.9708/jksci.2013.18.8.017 인용 PDF KSCI

An Autonomous Assessment of a Short Essay Answer by Using the BLEU (BLEU 를 활용한 단기 서술형 답안의 자동 채점)

Cho, Jung-Hyun;Jung, Hyun-Ki;Park, Chan-Young;Kim, Yu-Seop
- 한국HCI학회:학술대회논문집
- /
- 2009.02a
- /
- pp.606-610
- /
- 2009
We propose a method utilizing BLEU(BiLingual Evaluation Understudy), which is widely used in automatic evaluation of machine translations, for an autonomous assessment of a short essay answer. BLEU evaluates translations with an assumption that the translation by a machine is supposed to be more accurate as it is getting to be more similar to the translation by a human. BLEU scores the translation by comparing the n-grams of translations by a machine and humans. Similarly we score students answers by comparing to multiple reference answers with BLEU. In the experiment, we compute correlation coefficient values between scores of our system and human instructors.
PDF

Strengthening the Instruction-Assessment Alignment: Development of Items for Essay-Type Assessment Based on the Achievement Standards (수업과 평가 일체화를 위한 성취기준 중심 가정과 서술형 평가 문항개발 연구)

Yang, Ji Sun;Lee, Gyeong Suk
- Journal of Korean Home Economics Education Association
- /
- v.32 no.3
- /
- pp.135-159
- /
- 2020
The purpose of this study was to develop items of an essay response assessment that could align with the instructions and assessments in the high school home economics curriculum. The contents of the study were as follows. First, to establish an assessment plan, 14 achievement standards were analyzed in the assessment area, and the elements of the questions were developed including the content elements of a total of 29 questions. Second, to develop the assessment tools, preliminary questions suited to the structure of essay questions were developed, and the method of presenting data and scoring criteria to be utilized in the questions was selected. Third, to prepare the answers and the scoring criteria tables, the answers to the sample questions for each score were prepared in form of a scoring criteria table, and the objectives of the assessment, the scoring items, and the scores for each item were reviewed. Fourth, the developed questions and answers were revised and supplemented by teachers of the professional learning community through preliminary and mutual review on the components of the questions, the embodiment of the assessment objectives, the implementation of the assessment intent, and the grading. This study can be used as a foundational study for the development of essay-type questions and scoring criteria in essay assessment in the field of education. Furthermore, the results of this study could help teachers enhance their learners' ability to apply knowledge in the future.
https://doi.org/10.19031/jkheea.2020.09.32.3.135 인용 PDF

Design and Implementation of Short-Essay Marking System by Using Semantic Kernel and WordNet (의미 커널과 워드넷을 이용한 주관식 문제 채점 시스템의 설계 및 구현)

Cho, Woo-Jin;Chu, Seung-Woo;O, Jeong-Seok;Kim, Han-Saem;Kim, Yu-Seop;Lee, Jae-Young
- Proceedings of the Korea Information Processing Society Conference
- /
- 2005.05a
- /
- pp.1027-1030
- /
- 2005
기존 의미커널을 적용한 주관식 채점 시스템은 여러 답안과 말뭉치에서 추출한 색인어들과의 상관관계를 벡터방식으로 표현하여 자연어 처리에 대한 문제를 해결하려 하였다. 본 논문에서는 기존 시스템의 답안 및 색인어의 표현 한계로 인한 유사도 계산오차 가능성에 대한 문제를 해결하고자 시소러스를 이용한 임의 추출 방식의 답안 확장을 적용하였다. 서술형 주관식 평가에서는 문장의 문맥보다는 사용된 어휘에 채점가중치가 높다는 점을 착안, 출제자와 수험자 모두의 답안을 동의어, 유의어 그룹으로 확장하여 채점 성능을 향상시키려 하였다. 우선 두 답안을 형태소 분석기를 이용해 색인어를 추출한 후 워드넷을 이용하여 동의어, 유의어 그룹으로 확장한다. 이들을 말뭉치 색인을 이용하여 단어들 간 상관관계를 측정하기 위한 벡터로 구성하고 의미 커널을 적용하여 정답 유사도를 계산하였다. 출제자의 채점결과와 각 모델의 채점 점수의 상관계수 계산 결과 ELSA 모델이 가장 높은 유사도를 나타내었다..
PDF

Exploring automatic scoring of mathematical descriptive assessment using prompt engineering with the GPT-4 model: Focused on permutations and combinations (프롬프트 엔지니어링을 통한 GPT-4 모델의 수학 서술형 평가 자동 채점 탐색: 순열과 조합을 중심으로)

Byoungchul Shin;Junsu Lee;Yunjoo Yoo
- The Mathematical Education
- /
- v.63 no.2
- /
- pp.187-207
- /
- 2024
In this study, we explored the feasibility of automatically scoring descriptive assessment items using GPT-4 based ChatGPT by comparing and analyzing the scoring results between teachers and GPT-4 based ChatGPT. For this purpose, three descriptive items from the permutation and combination unit for first-year high school students were selected from the KICE (Korea Institute for Curriculum and Evaluation) website. Items 1 and 2 had only one problem-solving strategy, while Item 3 had more than two strategies. Two teachers, each with over eight years of educational experience, graded answers from 204 students and compared these with the results from GPT-4 based ChatGPT. Various techniques such as Few-Shot-CoT, SC, structured, and Iteratively prompts were utilized to construct prompts for scoring, which were then inputted into GPT-4 based ChatGPT for scoring. The scoring results for Items 1 and 2 showed a strong correlation between the teachers' and GPT-4's scoring. For Item 3, which involved multiple problem-solving strategies, the student answers were first classified according to their strategies using prompts inputted into GPT-4 based ChatGPT. Following this classification, scoring prompts tailored to each type were applied and inputted into GPT-4 based ChatGPT for scoring, and these results also showed a strong correlation with the teachers' scoring. Through this, the potential for GPT-4 models utilizing prompt engineering to assist in teachers' scoring was confirmed, and the limitations of this study and directions for future research were presented.
https://doi.org/10.7468/mathedu.2024.63.2.187 인용 PDF

Research of Verifying the Remote Test Answer Sheets Authentication (원격시험 컴퓨터활용 답안지 진본성 검증에 관한 연구)

Park, Kee-Hong;Jang, Hae-Sook
- Journal of the Korea Society of Computer and Information
- /
- v.17 no.3
- /
- pp.135-141
- /
- 2012
Development of the Internet has brought many changes in methods of education and assesment. When enforcing the on-line distance education, the tests to check the outcomes of the learning are taken on the Internet. The current trends of education evaluation are focused on the types of questions and the detachments of exam proctor but verifying the authentication of answer sheet. There are several forms to make answers; selection type, short-answer type, write-out answer type, practical exercise type, etc. All the forms can be done on the Internet except the practical exercise type because the source of the examinee's answer sheet is unreliable. In this paper, we made the verification system to solve the doubt by setting the proved information on the answer sheet. Putting the information down to confirm the authenticity during the exam on the server is distinct character of this system. After the test finished, the system will operate when examinee turn in the answer sheet.
https://doi.org/10.9708/jksci.2012.17.3.135 인용 PDF KSCI

The defects of questions of descriptive assessment in elementary school mathematics and the suggestions for its improvement -focusing on the questions produced by Gyeonggi Provincial Office of Education (초등 수학과 서술형 평가문항의 문제점과 개선방안 -경기도 교육청 창의.서술형 평가 문항을 중심으로-)

Chang, Suchin;Kim, Soomi
- Journal of Elementary Mathematics Education in Korea
- /
- v.18 no.2
- /
- pp.297-318
- /
- 2014
This study is designed for helping elementary school teachers have an insight into making or choosing questions of descriptive assessment in mathematics. For this, it is analyzed 30 descriptive mathematical questions produced by Gyeonggi Provincial Office of Education in 2011 and 2012 and 3rd to 6th grade students' papers marked by their teachers in charge from 2 elementary schools located in Gyeonggi Province. The main focus of analysis is the errors of students' answers and teachers' marking not from their own mistakes but from the defects of questions themselves. As a result of analysis, 7 cases of problematic situations are induced and they are reorganized into 3 categories as follow: i) case of not performing unique purpose of descriptive assessment, ii) case of inducing the problem of fairness of grading, iii) case of leading students erroneous direction.
PDF

The development and application of the descriptive evaluation questionnaire on the Clothing and Textiles section of the middle school Technology & Home Economics textbook (중학교 기술.가정 의생활영역의 서술형 평가문항 개발 및 적용)

Lee, Soo-Kyung;Lee, Hye-Ja
- Journal of Korean Home Economics Education Association
- /
- v.23 no.3
- /
- pp.69-90
- /
- 2011
To develop the descriptive evaluation questionnaire with high validity and reliability on the Clothing and Textiles section of the middle school Technology & Hone Economics textbook, apply it to students and analyze its results. We made out a draft for descriptive evaluation questionnaire that was based upon the concrete establishment of the goal and the range of evaluation. We also made a rubric for scoring as well as sample answer-sheets. Finally, we completed a total of twenty three descriptive evaluation questions and we applied it to sixty five 2nd-grade students in two classes in a middle school. Descriptive evaluation questionnaire exhibited the relative high validity on each question. Moreover, three graders gave the same score on each question of descriptive evaluation, suggesting that descriptive evaluation questionnaire has the high inter-grader reliability and the strong correlation. But, low academic achievement was generally observed in the subjects. They had difficulty in describing their knowledge via their own language and drawing up accurate and detailed answers. They recognized the positive aspects of descriptive evaluation questionnaire, but they felt it uncomfortable due to study-burden and description itself. To overcome these limitations, it is required that students should experience various materials related to subject contents in classes as well as textbooks, concentrate themselves on finding solutions for problems, expand their scope, and practice describe them in advance. Therefore, the additional training for description evaluation questionnaire will be necessary for the more efficient and discriminative questionnaire. Also the questionnaire with high validity and reliability should be developed and the aggressive and voluntary participation of teachers will be needed.
PDF

An Intelligent Marking System based on Semantic Kernel and Korean WordNet (의미커널과 한글 워드넷에 기반한 지능형 채점 시스템)

Cho Woojin;Oh Jungseok;Lee Jaeyoung;Kim Yu-Seop
- The KIPS Transactions:PartA
- /
- v.12A no.6 s.96
- /
- pp.539-546
- /
- 2005
Recently, as the number of Internet users are growing explosively, e-learning has been applied spread, as well as remote evaluation of intellectual capacity However, only the multiple choice and/or the objective tests have been applied to the e-learning, because of difficulty of natural language processing. For the intelligent marking of short-essay typed answer papers with rapidness and fairness, this work utilize heterogenous linguistic knowledges. Firstly, we construct the semantic kernel from un tagged corpus. Then the answer papers of students and instructors are transformed into the vector form. Finally, we evaluate the similarity between the papers by using the semantic kernel and decide whether the answer paper is correct or not, based on the similarity values. For the construction of the semantic kernel, we used latent semantic analysis based on the vector space model. Further we try to reduce the problem of information shortage, by integrating Korean Word Net. For the construction of the semantic kernel we collected 38,727 newspaper articles and extracted 75,175 indexed terms. In the experiment, about 0.894 correlation coefficient value, between the marking results from this system and the human instructors, was acquired.
https://doi.org/10.3745/KIPSTA.2005.12A.6.539 인용 PDF KSCI

Search Result 12, Processing Time 0.027 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)