• 제목/요약/키워드: assessment item analysis

검색결과 352건 처리시간 0.026초

초등학교 5학년 학생들의 분수 연산능력 평가 문항에 대한 분석 (An Analysis of Assessment Item on the Fraction Computation Ability for Fifth Grade Student)

  • 이강섭;김규상
    • 한국수학교육학회지시리즈C:초등수학교육
    • /
    • 제8권2호
    • /
    • pp.61-68
    • /
    • 2004
  • The purpose of this study is to develope and analyze the assessment items which can be used in evaluation of the fraction computation ability for fifth grade students. The item development team consists of three elementary school teachers and three mathematics education expert. The developed items was analyzed item analysis after applying to 135 fifth grade students. As a results of item analysis, it shows meaningful over the level of a standard basis: reliability: 0.80; validity: item 1(1.05), item 2(1.10), item 3(.85), item 4(.90), item 5(1.08); item difficulty: item 1(-.22), item 2(-.41), item 3(.23), item 4(.40), item 5(-.01); item discrimination: item 1(.73), item 2(.73), item 3(.67), item 4(.51), item 5(.56). This means that the test tool could be useful in the evaluation of the fraction computation ability.

  • PDF

융합형 '과학' 평가에 사용된 선다형 문항의 특성 분석 : 4개 고등학교의 사례 (Analysis of the Characteristics of Multiple-Choice Test Items Used in Integrated Science Assessment: Focused on the Case of Four High School)

  • 이기영;조희형;권석민;김희경;윤희숙
    • 과학교육연구지
    • /
    • 제37권2호
    • /
    • pp.278-293
    • /
    • 2013
  • 이 연구에서는 2009 개정 교육과정에 따른 고등학교 융합형 '과학' 교과에서 사용된 선다형 평가 문항의 특성을 분석하였다. 문항의 출제 경향을 분석하기 위해 융합형 교과에 특이적인 분석틀을 구안하여 분석하였으며, 문항 반응 이론을 적용하여 문항의 특성을 양적으로 분석하였다. 문항의 출제 경향 분석 결과, 문항 출처, 융합 범위, 인지 수준 측면에서 융합형 '과학' 교과의 취지를 제대로 반영하고 있지 못하는 것으로 나타났으며, 기존 교육과정의 과학 교과와 같은 분과적 교수-학습 및 평가가 이루어지고 있음을 알 수 있었다. 문항 반응 이론을 적용한 분석 결과, 대체로 문항의 난이도는 적절하며, 변별도는 높은 수준인 것으로 나타났다. 하지만 분석틀을 구성하는 3개 범주와의 연관성은 나타나지 않았다. 이와 같은 연구 결과를 토대로 융합형 '과학' 교과의 취지를 반영한 교수-학습과 평가를 위한 개선점을 논의하였다.

  • PDF

뇌졸중 환자를 대상으로 실시한 한글판 기능적 보행평가의 타당도 (Validation of the Korean Functional Gait Assessment in Patients With Stroke)

  • 박소연
    • 한국전문물리치료학회지
    • /
    • 제23권2호
    • /
    • pp.35-43
    • /
    • 2016
  • Background: The Functional Gait Assessment (FGA) was developed to measure of gait-related activities. The FGA was translated in Korean but only a few psychometric characteristics had been studied. Objects: The purpose of this study was to evaluate the validity and reliability of the Korean version of FGA scale using Rasch analysis. Methods: The study included 120 patients with stroke (age range=30~83 years; mean${\pm}$standard deviation=$58.3{\pm}11.1$). The FGA and Berg Balance Scale were performed, and were analysed for dimensionality of the scale, item difficulty, scale reliability and separation, and item-person map using Rasch analysis. Results: The 4 rating scale categories of FGA were satisfied with optimal rating scale criteria. The most items of the FGA showed sound item psychometric properties except 2 items ('gait with the horizontal head turns', and 'gait with narrow base of support'), and the 2 misfit items were excluded for all further analyses. The 8 items were arranged in order of difficulty. The most difficult item was 'gait with eyes closed', the middle difficult item was 'gait level surface', and the easiest item was 'gait with vertical head turns.' A person separation reliability was .93 and the person separation index was 3.57. Conclusion: This study suggests that the 8-item Korean FGA are valid measure of assess the gait-related balance performance, and to set the goal of rehabilitation plan in patient with stroke.

어린이집 프로그램 평가척도의 개발을 위한 예비연구 (The Development of Assessment Scales for Day Care Programs)

  • 이은해;송혜린;신혜영;최혜영
    • 아동학회지
    • /
    • 제23권4호
    • /
    • pp.199-213
    • /
    • 2002
  • This preliminary study aimed to develop 3 assessment scales for self-evaluation by day care directors and teachers. The development of major areas of evaluation and items as well as evaluation criteria for each item was based on the analysis of related research and major evaluation instruments. A panel of experts in early childhood education examined the contents. The 3 preliminary scales were administered in 87 day care centers; data were analyzed by item response distribution, item discrimination, and reliability of the scales. Items indicating low item discrimination were deleted and minor revisions were made to improve psychometric characteristics of each scale. The final version of the 3 scales is valid for use as self-evaluation instruments by day care directors and teachers.

  • PDF

온라인 평가 문항 관리 시스템의 설계 (Design of Online Assessment Item Management System)

  • 이영석;조정원
    • 컴퓨터교육학회논문지
    • /
    • 제15권6호
    • /
    • pp.33-41
    • /
    • 2012
  • 본 논문은 온라인 평가 문항 관리 시스템 및 방법에 관한 것이다. 제안하는 시스템은 학습자 정보 데이터베이스, 영역별, 난이도별로 분류된 문항을 저장하는 문제 은행 데이터베이스로 구성된다. 문제은행 데이터베이스는 특정 학습자 또는 특정 학습에 대한 문항을 선택하는 문항 선제부, 특정 학습자 또는 특정 학습에 대한 문제지를 제작하는 문제지 제작부, 문제지를 저장하였다가 온라인 테스트를 수행할 때 출력하는 문제지 은행 데이터베이스를 포함한다. 학습자가 온라인 테스트에 제출된 문항들에 대한 문항 선택순서, 보기 선택순서, 클릭회수, 반응시간에 대한 평가 데이터를 수집하는 온라인 테스트부, 및 온라인 테스트부로부터 수집된 학습자의 온라인 테스트 평가 데이터를 분석하여 학습자의 학습수준과 학습능력, 성향을 평가하고 진단하여 보고하는 반응패턴 분석부를 포함한다. 제안하는 시스템은 보기 선택순서, 클릭회수, 반응시간을 학습자의 평가결과에 반영하여 효율적으로 학습자의 학습수준과 학습능력, 성향을 평가하고 진단할 수 있다.

  • PDF

경관영향평가 항목선정을 위한 타당성 평가 (Estimation of Validity for Item Selecting of Landscape Impact Assessment)

  • 오명성;조현주;이현택;나정화
    • Current Research on Agriculture and Life Sciences
    • /
    • 제26권
    • /
    • pp.7-15
    • /
    • 2008
  • This research is significant in terms of estimating the validity by setting evaluation items in view of integrating not only original beauty and visual areas but also natural ecological areas based on questionnaire. The results are as follows. 1) According to literature study, 17 items such as variety, the character of sites in terms of landscape, the beauty of landscape, visibility, and ratio of green visibility are selected. Also, 21 items such as variety of animals and plants species, size of green area, and ecological naturalness are selected in the area of landscape ecological resources. 2) As a result of questionnaire of the group of landscape experts, animals and plants ecological areas show 5.6341, the highest in importance analysis according to assessment areas. Also, as a result of importance analysis on items in each area, for example, in the area of visual resources, the item of skyline analysis is 6.0488, the highest. 3) As a result of corelation of item meaning on landscape effect assesment, for example, meaningness of psychological assessment item and landscape site item indicate 0.710, the highest coefficient correlation. 4) As critical assessment items per unit project, items such as damage minimization of original land features for project in terms of point, ratio of green visibility, variety of animals and plants species marked above 8.0 as high important medium. The project in terms of line, minimization of original land features, preservation of worthy biotope showed high point and the character of sites in terms of landscape, minimization of original land features, the size of green area, and analysis of skyline marked above 8.5 as high importance points. On the contrary, items for climate and soil showed relatively low points.

  • PDF

수학과 평가 문항의 역량 및 표현 형식 특성 분석 (Feature analysis for competency and representation type of mathematics assessment)

  • 박지현
    • 한국수학교육학회지시리즈A:수학교육
    • /
    • 제60권2호
    • /
    • pp.209-228
    • /
    • 2021
  • 본 연구의 목적은 수학과 교육과정에 기반한 평가 문항을 출제할 때 핵심적으로 고려되는 수학 역량 및 표현 형식 특성을 규명하고, 수학과 평가 문항에서 수학 역량 및 표현 형식 특성이 나타나는 양상을 밝히는 것이다. 선행 연구 분석 및 2차에 걸친 타당성 조사를 통해, 수학과 교육과정에 기반한 평가 문항의 특성으로 수학 역량 범주는 12가지, 표현 형식 범주는 8가지를 추출하고, 이 특성들로 구성된 문항 특성 분석틀을 개발하였다. 또한 문항 특성 분석틀을 활용하여 2019년 국가수준 학업성취도 평가 문항을 분석한 결과, 중 고등학교의 수학 역량 및 표현 형식 특성에 대한 출제 경향에 차이가 있음을 밝혔다.

고등학교 가정과의 역량 기반 평가 프레임워크 개발 (Development of a Competency-Based Assessment Framework for High School Home Economics)

  • 최영선;박미정
    • Human Ecology Research
    • /
    • 제62권2호
    • /
    • pp.197-216
    • /
    • 2024
  • The dual aims of this study were i) to explore the effectiveness of an assessment method that can measure the competencies learned through high school home economics and ii) develop a competency-based assessment framework for high school home economics, including components of competency-based assessment, item development system, and other elements. The study involved the following stages: literature analysis, draft development, concretization, validity verification, and final version. Based on the results of the literature analysis and expert deliberation, a competency-based assessment framework for high school home economics was developed which comprised three dimensions: 'core idea' × 'process' × 'context and value'. Based on this, an item analysis table was presented to systematize various activities for the competency-based assessment of high school home economics, following which a holistic rubric suitable for competency assessment tasks was proposed by referring to the newly developed 'process' dimension assessment criteria. This study is significant in that it is the first to propose a competency-based assessment framework that systematically presents a variety of assessment activities. These will facilitate the development of assessment standards and tools that teachers can refer to or trust in the changing school environment due to the introduction of the standard-based assessment system based on a high school credit system, and the guarantee of a minimum achievement level.

라쉬분석을 이용한 세 가지 요통 장애 설문지의 비교 (A Comparison of Three Low Back Disability Questionnaires With Rasch Analysis)

  • 김경모;박소연;이충휘
    • 한국전문물리치료학회지
    • /
    • 제18권3호
    • /
    • pp.94-102
    • /
    • 2011
  • The purpose of this study was to review existing assessment tools for patients with low back pain and improve them through combination. A total of 314 patients with low back pain participated. Their condition was assessed using the Oswestry Disability Questionnaire (ODQ), the Quebec Back Pain Disability Scale (QBPD), and the Back Pain Functional Scale (BPFS). Rasch analysis was applied to identify inappropriate items, item difficulties, and the separation index. In this study, the 'sex life' item of the ODQ (10 items) and the 'sleeping' item of the BPFS (12 items) showed misfit statistics, whereas all items of the QBPD (20 items) were appropriate. After combining the ODQ, QBPD and BPFS, Rasch analysis was applied. The 'pain intensity', and the 'sex life' item of the ODQ and the 'throw a ball' item of QBPD showed misfit statistics. These 3 items were retained for further analysis. The remaining 42 combined ODQ-QBPD-BPFS items were arranged according to difficulty. For all subjects, the most difficult item was 'pain intensity', whereas the easiest was 'take food out of the refrigerator'. As the separation index of 42 combined ODQ-QBPD-BPFS was higher than that of the three questionnaires separately, difficulty of items varied with some need for rearrangement. The results of this study confirmed the possibility and need for a new back pain disability assessment tool, and produced one. Further study is needed to refine the questionnaire in consideration of psychosocial and occupational factors.

2011-2016년 의학교육평가컨소시엄 임상종합평가의 효과성 (Effectiveness of Medical Education Assessment Consortium Clinical Knowledge Mock Examination (2011-2016))

  • 이상엽;이예리;김미경
    • 의학교육논단
    • /
    • 제20권1호
    • /
    • pp.20-31
    • /
    • 2018
  • Good assessment is crucial for feedback on curriculum and to motivate students to learn. This study was conducted to perform item analysis on the Medical Education Assessment Consortium clinical knowledge mock examination (MEAC CKME) (2011-2016) and to evaluate several effects to improve item quality using both classical test theory and item response theory. The estimated difficulty index (P) and discrimination index (D) were calculated according to each course, item type, A (single best answer)/R (extended matching) type, and grading of item quality. The cut-off values used to evaluate P were: >0.8 (easy); 0.6-0.8 (moderate); and <0.6 (difficult). The cut-off value for D was 0.3. The proportion of appropriate items was defined as those with P between 0.25-0.75 and D ${\geq}0.25$. Cronbach ${\alpha}$ was used to assess the reliability and was compared with those of the Korean Medical Licensing Examination (KMLE). The results showed the recent mean difficulty and decimation index was 0.62 and 0.20 for the first MEAC CKME and 0.71 and 0.19 for the second MEAC CKME, respectively. Higher grade items evaluated by a self-checklist system had better D values than lower grade items and higher grade items gradually increased. The preview and editing process by experts revealed maintained P, decreased recall items, increased appropriate items with better D values, and higher reliability. In conclusion, the MEAC CKME (2011-2016) is deemed appropriate as an assessment to evaluate students' competence and prepare year four medical students for the KMLE. In addition, the self-checklist system for writing good items was useful in improving item quality.