• Title/Summary/Keyword: 일반화가능도 분석

Search Result 268, Processing Time 0.024 seconds

An Analysis of the Reliability of Group Assessment of Logical Thinking (GALT) using Generalizability Theory (일반화가능도 이론을 이용한 집단논리적사고력검사(GALT)의 신뢰도 분석)

  • Ryu, Chun-Ryol;Lee, Yong-Geun
    • Journal of the Korean earth science society
    • /
    • v.31 no.1
    • /
    • pp.95-105
    • /
    • 2010
  • The purpose of this study lies in applying generalizability theory depending on the aim of the usage of GALT to analyze the sources of error of single-facet considering item and person only and to analyze the sources of error of multi-facet considering item, person and domain. The study was conducted with 1016 students of local elementary, middle, and high schools. The 21 items of a full version were answered for 40 minute and then the 12 items of short version were sampled to analyze reliability using generalizability theory. Both the full version and the short version of the items were analyzed using Cronbach's alpha for data analysis, and we applied generalizability theory and separate $p{\times}i$ design and $p{\times}(i:h)$ design, G study and D study were performed. Results of analysis are as follows: First, the result of D study after $p{\times}I$ design both on the full version and the short version showed that in the case of the full version, the generalizability coefficient was 0.87 exceeding a normal level of 0.80, and the normal level of generalizability coefficient was achieved in 13 items as well. In case of short version, when 12 items were evaluated, generalizability coefficient was 0.77 not reaching the normal level, and the normal level was achieved in case of more than 15 items. Second, the result of D study after $p{\times}(I:H)$ design on the short version showed that once one domain consists of 2 items in 6 domains, generalizability coefficient was 0.71 which is lower than the normal level of 0.80, the normal level was achieved in more than 5 item cases.

Exploring the Application of Generalizability Theory to Mathematics Teacher Evaluation for Professional Development in Korea Based on the Analysis of Instructional Quality Assessment of Mathematics Teachers in the U.S. (미국 수학교사의 교수 질 평가도구 분석을 통한 우리나라 수학 교원능력개발평가에서의 일반화가능도 이론 활용성 탐색)

  • Kim, Sungyeun
    • Communications of Mathematical Education
    • /
    • v.28 no.4
    • /
    • pp.431-455
    • /
    • 2014
  • The purpose of this study was to suggest methods to apply generalizability theory to mathematics teacher evaluation using classroom observations in Korea by analysing mathematics teachers in the U.S. using the instructional quality of assessment instrument as an illustrative example. The subjects were 96 teachers participating in Year 3 and Year 4 from the Middle-school Mathematics and the Institutional Setting of Teaching (MIST) project funded by the National Science Foundation since 2007. The MIST project investigates the following question: What does it takes to support mathematics teachers' development of ambitious and equitable instructional practices on a large scale (MIST, 2007). This study examined data based on both the univariate generalizability analysis using GENOVA program and the multivariate generalizability analysis using mGENOVA program. Specifically, this study determined the relative effects of each error source and investigated optimal measuring conditions to obtain the suitable generalizability coefficients. The methodology applied in this study can be utilized to find effective optimal measurement conditions for the mathematics teacher evaluation for professional development in Korea. Finally, this study discussed limitations of the results and suggested directions for future research.

Generalizability of Polygraph Test Procedures using Backster ZCT: Changes in reliability as a function of the number of relevant questions, the number of repeated tests, and the number of raters (Backster ZCT를 사용한 폴리그라프 검사절차의 일반화가능도: 관련 질문의 개수, 반복측정 횟수, 채점자의 수에 따른 신뢰도의 변화)

  • Eom, Jin-Sup;Han, Yu-Hwa;Ji, Hyung-Ki;Park, Kwang-Bai
    • Science of Emotion and Sensibility
    • /
    • v.11 no.4
    • /
    • pp.553-564
    • /
    • 2008
  • Generalizability theory was employed to examine how the reliability of polygraph test is affected by the number of relevant questions, the number of repeated tests (the number of of charts), and the number of raters(scorers). The data consisted of the results of the polygraph tests administered to 31 crime suspects. The sample was drawn from the real polygraph tests based on Backster ZCT and archived by the Prosecutor's Office of the Republic of Korea. The numerical scores assigned by thirteen raters to the test charts were analyzed to determine the generalizability of the scores. The largest variance component was accounted for by the examinee factor(43.97%) and the residual variance component was 16.84% of the total variance. The variance component due to the interaction between the examinee and the chart factors was 12.17% and the variance component due to the three way interaction of the examinee, the repeated test, and the relevant question factors was 10.31%. The generalizability coefficient for the current measurement procedure as practiced by the Korean Prosecutor's Office was 0.74 which suggests that the current procedure is acceptable. However, measurement procedures with the combination of more than two relevant questions, more than three repeated tests, and more than two raters were generally found to yield generalizability coefficients larger than 0.80. Therefore, such procedures need to be considered seriously in order to significantly improve the reliability of polygraph test.

  • PDF

Analysis of Korea Earth Science Olympiad Items for the Enhancement of Item Quality (한국 지구과학 올림피아드 문항 분석을 통한 문항의 질 향상 방안)

  • Lee Ki-Young;Kim Chan-Jong
    • Journal of the Korean earth science society
    • /
    • v.26 no.6
    • /
    • pp.511-523
    • /
    • 2005
  • The purpose of this study is to analyze the 1st and 2nd Korea Earth Science Olympiad (KESO) items, in order to find informations to enhance item quality. To do this, internal and external item classification frameworks are developed. Item difficulty (P), discrimination index (DI), correlation, and reliability are estimated by using classical test theory. Generalizability is also estimated by applying the generalizability theory. The results of item classification are as follows: (1) ‘Geology’, ‘astronomy’ and ‘data analysis and interpretation’ are dominant in content and inquiry process domain, respectively. Nearly every item has textbook context. (2) There is no difference between the preliminary and final tests in terms of their thinking skills sections. (3) As a whole, the ratio of items with pictures is high in item representation. However, multiple-choice and short answer items are more common in preliminary competition, and essay type items are found more often in final competition. The ratio of simple items is high in middle school section and preliminary competition, but composite items are dominant in high school section and final competition. The findings of item analysis are as follows: (1) In the middle school section, P is low and DI is moderate. But in the high school section, there is a considerable differences between science high schools and other high schools in general. (2) The highest correlation is reported between the scores of meteorology domain and total score in middle school, whereas in high school astronomy domain and total score show the highest correlation. (3) General high school section show the highest Cronbach $\alpha$ and generalizability. (4) General high school section show acceptable generalizability coefficient (> 0.80), but middle and science high school section should increase the number of items to reach acceptable generalizability level.

Analysis of Assessment Types, Scoring Methods and Reliability of Science Performance Assessment in Middle and High School (중등학교 과학 수행평가의 평가 유형과 채점 방식 및 신뢰도 분석)

  • Lee, Ki-Young;An, Hui-Soo
    • Journal of The Korean Association For Science Education
    • /
    • v.25 no.2
    • /
    • pp.173-183
    • /
    • 2005
  • In this study, we questioned what assessment types and scoring methods of science performance assessment(SPA) were being used in middle and high school, and how much these SPA scores were reliable(generalizable). To answer these questions, SPA data obtained from the seven schools were classified according to assessment type and scoring method. Based upon this classification, we analyzed the reliability by applying generalizability theory. The result, from the classification of assessment type and scoring method, showed that SPA types of the seven schools were divided into two types: paper-pencil type and task type. Paper-pencil type included answer(content)-restricted essay-type test solely. Task type has two parts: process and outcome assessment. As the results of analyzing scoring methods of the seven schools, there were two cases in the way of scoring methods: one case is scoring all essay-type items and performance tasks by one teacher, the other is scoring assigned performance tasks by two teachers. But the case of scoring assigned essay-type items or the case of cross scoring by two or more teachers were not found. The findings of the reliability analysis are as follows: (1) Effect of essay-type item to SPA score was larger than that of performance task. (2) There was remarkable difference among the seven schools' interaction effect of person and rater in scoring performance tasks. (3) Most of generalizability(reliability) coefficients of SPA for the seven schools were smaller than the acceptable generalizability coefficient(0.80). Therefore, the population of statistical parameters such as number of item, task and rater, should be increased for approaching the acceptable generalizability level.

An Application of Multivariate Generalizability Theory to Teacher Recommendation Letters and Self-introduction Letters Used in Selection of Mathematically Gifted Students by Observation and Nomination (관찰·추천제에 의한 수학영재 선발 시 사용되는 교사추천서와 자기소개서 평가에 대한 다변량 일반화가능도 이론의 활용)

  • Kim, Sung Yeun;Han, Ki Soon
    • Journal of Gifted/Talented Education
    • /
    • v.23 no.5
    • /
    • pp.671-695
    • /
    • 2013
  • This study provides an illustrative example of using the multivariate generalizability theory. Specifically, it investigates relative effects of each error source, and finds optimal measurement conditions for the number of items within each content domain that maximizes the reliability-like coefficients, such as a generalizability coefficient and an index of dependability. The method is based on teacher recommendation letters and self-introduction letters, using an analytic scoring method in the context of selection of mathematically gifted students by observation and nomination. This study analyzed data from the 2011 academic year in the science education institute for the gifted, which is attached to the university located in the Seoul metropolitan area. It should be noted that the optimal scoring structures of this study are not generalizable to other selection instruments. However, the methodology applied in this study can be utilized to find optimal measurement conditions for the number of raters, the number of content domains, and the number of items in other selection instruments self-developed by many institutions including: the education institutes for the gifted at provincial offices of education, gifted classes, and the science education institutes for the gifted attached to universities in general. In addition, the methodology will provide bases for making informed decisions in selection instruments of the gifted based on measurement traits.

Exchange of Electronic Document with Certification of Delivery and Contents (배달 및 내용증명이 가능한 전자 문서의 교환)

  • 황보성;이임영
    • Proceedings of the Korean Information Science Society Conference
    • /
    • 2000.10a
    • /
    • pp.623-625
    • /
    • 2000
  • 인터넷 환경의 발달에 의해 네트워크상의 콘텐츠의 전송이 활발해 지고 있다. 그 대표적인 예가 대중적으로 일반화되어 있는 전자메일일 것이다. 하지만 전자메일이 보다 일반화되고 보안상의 위협을 제거하기 위해선 전송되는 메일에 대한 내용증명과 배달증명이 가능해야 한다. 따라서, 본 논문에서는 먼저 내용증명과 배달증명이 가능한 기존의 방법들을 분석하고, 양사용자 사이에 문서를 교활할 수 있는 새로운 방법을 제안한다.

  • PDF

Multigroup Generalizability Analysis of Creative Attitude Scale-Korea for Mathematically Gifted and General Students in Middle Schools (수학적 창의성 태도 검사에서 수학영재와 일반학생의 다집단 일반화가능도 분석)

  • Kim, Sungyeun
    • Communications of Mathematical Education
    • /
    • v.31 no.1
    • /
    • pp.49-70
    • /
    • 2017
  • The purpose of this study was to investigate the relative influence of multiple error sources and to find optimal measurement conditions that obtain a desired level of reliability of a creative attitude test in mathematical creativity. This study analyzed the scores of the Creative Attitude Scale-Korea allowed to access publicly of 125 general students and 109 mathematically gifted students by performing a multivariate generalizability analysis. The main results were as follows. First, based on reliability, the Creative Attitude Scale-Korea was measured less precisely for mathematically gifted students. On the contrary, based on the conditional standard error of measurement, it was measured less precisely for general students. However, the Creative Attitude Scale-Korea showed strong reliability in both groups. Second, the optimal weights should adjust to .3, .3, .4 in mathematically gifted students and .4, .4, .2 in general students with three scoring components of divergent attitude, problem solving attitude, and convergent attitude based on the maximum reliability. Third, to approach desirable reliability, it is possible to use one component of divergent attitude in general students but three components of divergent attitude, problem solving attitude, and convergent attitude in mathematically gifted students. Finally this study proposed application plans for the Creative Attitude Scale-Korea and future directions of research.

A study on validity and reliability of students' evaluation (강의평가의 타당성과 신뢰성에 관한 연구 전주대학교 강의평가 결과를 중심으로)

  • Lee, Ki-Hoon
    • Journal of the Korean Data and Information Science Society
    • /
    • v.21 no.1
    • /
    • pp.87-98
    • /
    • 2010
  • This research deals the method to assess the validity and reliability of students' evaluation for lectures. Most papers for student's evaluation have focused the procedures for controlling the external effects, but this paper is trying to answer for "How reliable is the student rating?" An empirical study shows that the evaluations in Jeonju University have the fair validity and reliability. The generalizability theory is suggested to obtain the more comprehensive results rather than Cronbach's alpha to examine internal consistency.

Exploring the Reliability of an Assessment based on Automatic Item Generation Using the Multivariate Generalizability Theory (다변량일반화가능도 이론을 적용한 자동문항생성 기반 평가에서의 신뢰도 탐색)

  • Jinmin Chung;Sungyeun Kim
    • Journal of Science Education
    • /
    • v.47 no.2
    • /
    • pp.211-224
    • /
    • 2023
  • The purpose of this study is to suggest how to investigate the reliability of the assessment, which consists of items generated by automatic item generation using empirical example data. To achieve this, we analyzed the illustrative assessment data by applying the multivariate generalizability theory, which can reflect the design of responding to different items for each student and multiple error sources in the assessment score. The result of the G-study showed that, in most designs, the student effect corresponding to the true score of the classical test theory was relatively large after residual effects. In addition, in the design where the content domain was fixed, the ranking of students did not change depending on the item types or items. Similarly, in the design where the item format was fixed, the difficulty showed little variation depending on the content domains. The result of the D-study indicated that the original assessment data achieved a sufficient level of reliability. It was also found that higher reliability than the original assessment data could be obtained by reducing the number of items in the content domains of operation, geometry, and probability and statistics, or by assigning higher weights to the domains of letters and formulas, and function. The efficient measurement conditions presented in this study are limited to the illustrative assessment data. However, the method applied in this study can be utilized to determine the reliability and to find efficient measurement conditions for the various assessment situations using automatic item generation based on measurement traits.