• Title/Summary/Keyword: item difficulty

Search Result 288, Processing Time 0.03 seconds

Study on Estimating the Optimal Number-right Score in Two Equivalent Mathematics-test by Linear Score Equating (수학교과의 동형고사 문항에서 양호도 향상에 유효한 최적정답율 산정에 관한 연구)

  • 홍석강
    • The Mathematical Education
    • /
    • v.37 no.1
    • /
    • pp.1-13
    • /
    • 1998
  • In this paper, we have represented the efficient way how to enumerate the optimal number-right scores to adjust the item difficulty and to improve item discrimination. To estimate the optimal number-right scores in two equivalent math-tests by linear score equating a measurement error model was applied to the true scores observed from a pair of equivalent math-tests assumed to measure same trait. The model specification for true scores which is represented by the bivariate model is a simple regression model to inference the optimal number-right scores and we assume again that the two simple regression lines of raw scores and true scores are independent each other in their error models. We enumerated the difference between mean value of $\chi$* and ${\mu}$$\_$$\chi$/ and the difference between the mean value of y*and a+b${\mu}$$\_$$\chi$/ by making an inference the estimates from 2 error variable regression model. Furthermore, so as to distinguish from the original score points, the estimated number-right scores y’$\^$*/ as the estimated regression values of true scores with the same coordinate were moved to center points that were composed of such difference values with result of such parallel score moving procedure as above mentioned. We got the asymptotically normal distribution in Figure 5 that was represented as the optimal distribution of the optimal number-right scores so that we could decide the optimal proportion of number-right score in each item. Also by assumption that equivalence of two tests is closely connected to unidimensionality of a student’s ability. we introduce new definition of trait score to evaluate such ability in each item. In this study there are much limitations in getting the real true scores and in analyzing data of the bivariate error model. However, even with these limitations we believe that this study indicates that the estimation of optimal number right scores by using this enumeration procedure could be easily achieved.

  • PDF

Comparison of Kinesthesia Test of SIPT for Preschool Children (전 학령기 아동의 SIPT 운동감각(kinesthesia) 검사에 대한 비교 연구)

  • Chang, Moon-Young;Hwang, Ki-Chul
    • The Journal of Korean Academy of Sensory Integration
    • /
    • v.2 no.1
    • /
    • pp.11-19
    • /
    • 2004
  • Objective : This study is to provide the norms of normal children when comparing the performance ability of preschool children while using the kinesthesia test of Sensory Integration and Praxis Tests(SIPT). Methods : Participants consisted of 90 normal children ranging in age from four to six years. The kinesthesia test of SIPT was utilized to investigate the performance ability. Results : 1. Regarding the kinesthesia ability according to age, the average value of kinesthesia performance error decreased as age getting older and that value showed the statistically significant differences between four and five, six age(p<0.05). 2. The kinesthesia performance ability according to gender, the accuracy of both hands and the dominant hand did not show the statistically significant differences. 3. Regarding the kinesthesia performance ability of test items, 1R item and 6R item(26.2cm), 5R item and 2L item(20.2cm) passing through the midline of body and having the large movement in distance and angle showed the difficulty to perform in all the children between 4 and 6 age. Conclusion : By providing the norms of the kinesthesia performance ability in normal children of the above results to the occupational therapists treating children, the helpful data to the hand skill development of children, exercise plan and implementation, and the performance therapy of ADL through the proper evaluation and training of kinesthesia is considered for the occupational therapists to be provided.

  • PDF

Responsiveness Comparisons of Self-Report Versus Therapist-Scored Functional Capacity for Workers With Low Back Pain

  • Choi, Bongsam;Park, So-Yeon
    • Physical Therapy Korea
    • /
    • v.19 no.3
    • /
    • pp.91-97
    • /
    • 2012
  • The primary aim of this study was to compare responsiveness of self-report by worker and therapist-scored functional capacity instrument. Self-report and therapist-scored interval-level person measures and item difficulties were compared at admission and discharge. Therapist and worker ratings were collected on 230 clients from 27 rehabilitation sites using the newly developed Occupational Rehabilitation Data Base (ORDB) functional capacity instrument. ORDB comprises several subscales measuring relevant variables of "a return-to-work model" in work-related rehabilitation clinics. The functional capacity scale deals with 10 DOT job factors. The rating scale categories were 1-severely impaired, 2-moderately impaired, 3-mildly impaired, and 4-not impaired. Only data from clients with low back pain (n=98) with complete data (both admission and discharge scores) were used for the present study. Therapists and workers completed the functional capacity instrument at admission and discharge. Rasch analysis [1-parameter item response theory model (IRT)] was applied to calibrate item difficulty and person ability measure of therapist and workers ratings. Effect sizes for therapist and self-report ratings were slightly different, .69 and .30, respectively. Therapist and worker ratings were more consistent at discharge (r=.54) than at admission (r=.32). Workers have a tendency to be more severe in their ratings (show higher item difficulties) than therapists at admission and discharge. Therapists and workers report similar magnitudes of improvement following treatment program. These findings challenge the belief that injured workers may unreliable source for monitoring therapeutic outcomes. Self-report measures have the advantage of conserving therapist time for treatment (versus evaluation). While the therapist and self-report ratings are comparable at discharge, there is less consistency at admission. Comparable therapist-worker ratings may be achieved by controlling for rating severity using IRT methodologies.

Combining Two Scales to Assess Risk Factors of Falling in Community-Dwelling Elderly Persons: A Preliminary Study (노인의 낙상에 영향을 주는 요인을 평가하기 위한 ABC-BBS의 적용: 사전연구)

  • Park, So-Yeon
    • Physical Therapy Korea
    • /
    • v.15 no.2
    • /
    • pp.44-53
    • /
    • 2008
  • The purpose of this preliminary study was to develop a measurement for assessing risk factors for falling in community-dwelling elderly persons. Rasch analysis and principal component analysis were performed to examine whether items on the Activities-Specific Balance Confidence (ABC), assessing self-efficacy, and items on the Berg Balance Scale (BBS), assessing balance function, contribute jointly to a unidimensional construct in the elderly. A total of 35 elderly persons (4 men, 31 women) participated. In this study, each item of ABC (16 items) and BBS (14 items) was scored on a 5-point ordinal rating scale from 0 to 4. The initial Rasch and principal component analysis indicated that 3 of the ABC items and 2 of the BBS items were misfit for this study. These 5 items were excluded from further study. After combining ABC and BBS, Rasch and principal component analyses were examined and finally 23 items selected; 12 items from ABC, 11 items from BBS. The 23 combined ABC-BBC items were arranged in order of difficulty. The hardest item was 'walk outside on icy sidewalks' and the easiest item was 'pivot transfer'. Although structural calibration of each 5 rating scale categories was not ordered, the other three essential criteria of Linacre's optimal rating scale were satisfied. Overall, the ABC-BBS showed sound item psychometric properties. Each of the 5 rating scale categories appeared to distinctly identify subjects at different ability levels. The findings of this study support that the new ABC-BBS scale measure balance function and self-efficacy. It will be a clinically useful assessment of risk factors for falling in the elderly. However, the number of subjects was too small to generalize our results. Further study is needed to develop a new assessment considering more risk factors of falling in elderly.

  • PDF

Item Analysis for Selecting Science Gifted Middle School Students at Physics Class (과학영재교육원 중학교 물리 전공 선발 문항 분석)

  • Lim, Chun-Woo;Park, Yune-Bae
    • Journal of Gifted/Talented Education
    • /
    • v.20 no.1
    • /
    • pp.61-77
    • /
    • 2010
  • The purpose of this study was to analyze the items that were used in entrance examination for science gifted education center for middle school students by using content analysis and classical item analysis. In content analysis, objective type items exhibited mathematics and physics were dominant. Science giftedness & creativity items were dominant. And essay type items consisted of physics items, have evaluated creative problem solving ability. Item difficulty and discrimination index, on the whole, were appropriate. Comparing with objective type, essay type has higher discrimination index. In correlation analysis between total score and score of each type of items, total score has the highest correlation with essay type items and science giftedness & creativity. It was recommended that mathematics, physics and chemistry items with focusing giftedness & creativity could give some implications for future selection methods of science gifted education center.

A study on the perception of occupational therapy majors on Cognitive Impairment Screening Test (CIST)

  • Lee, Sun-myung;Chae, Joo-hyun;Sung, I-sul;Lee, Soo-jin;Moon, Soo-bin;Park, Da-hee;Park, So-hyun
    • Journal of Korean Clinical Health Science
    • /
    • v.9 no.2
    • /
    • pp.1493-1501
    • /
    • 2021
  • Purpose: The purpose of this study is to classify the characteristics of each item of CIST evaluation and to find out the degree of recognition of the characteristics of the cognitive tool. Methods: This study was conducted for occupational therapy majors at M University located in Gyeongsangnam-do. The data collection from May to June 2021. Total of 25 copies of the data were finally analyzed, SPSS Statistics 26 was used for data analysis. Results: As a result of the study, the significance level was visual reasoning 1 test strip and the visual reasoning 1 tool. In the relationship between the correspondence 1 figure simulation sheet and the figure simulation tool for each item and statistically significant, and the correspondence 2 visual reasoning 2 sheet. Visual reasoning 2 sheet and visual reasoning tool also showed that was found to be statistically significant. The correlation for visual reasoning 1 sheet and the visual reasoning 1 tool, reasoning 2 tool and visual reasoning 1 sheet, and the visual reasoning 2 tool and the verbal reasoning sheet. Conclusion: In this study, in the CIST items that may be difficult, it is better to attach the actual tool rather than the verbal explanation of the test paper to increase the efficiency of the test and the understanding of subjects with mild cognitive impairment. It was implemented by applying the tool, and it was found that the use of the tool in the visual reasoning item showed a high correlation by item. Furthermore, based on this study, it will be possible to suggest a method to control the difficulty of each subject of the cognitive evaluation tool, and to prepare a standard for future research.

Student Responses to Smart Device-Based Test on Competency Evaluation in Dental Education

  • Kim, Jooah;Kim, Soo-Yoon
    • Journal of Korean Dental Science
    • /
    • v.12 no.2
    • /
    • pp.58-65
    • /
    • 2019
  • Purpose: This study was aimed to investigate the possibility of utilizing smart device-based test (SBT) for competency evaluation in dental education and to analyze the student responses on overall competency evaluation using SBT method, in comparison to ubiquitous-based test (UBT). Materials and Methods: Questionnaire surveys have been conducted at Yonsei University College of Dentistry from 2015 to 2018 to obtain students' feedback on the application of SBT to competency evaluation. In addition, in order to supplement the competency evaluation procedure, considerations were explored by comparing the expected and actual difficulty of each item when preparing items for competency evaluation with SBT. Result: According to the survey results, student responses between the initial two years (2015 and 2016) differed from those in next two years (2017 and 2018). Students in 2017 and 2018 had more positive responses on competency evaluation with SBT. To determine the test validity, criterion-referenced evaluation was adopted to compare the data in 2017 and 2018 and slight differences in test difficulty in 2018 between the expected and actual difficulty of items were found. Conclusion: The results indicated that SBT was more appropriate for competency evaluation than UBT, based on four-year period of competency evaluation. The SBT was not affected by either the file size or the number of test-takers. Interestingly, students were not sensitive to test version of competency evaluation (paper-based test and SBT). This study suggests that the quality of the test items should be measured by continuous monitoring of the expected and actual difficulty of items for determining test validity. More detailed results and discussions of the findings are given for the development of test procedure and further potential research directions in dental education.

Integrating Multi-view Stereoscopic Transmission System into MPEG-21 DIA (Digital Item Adaptation)

  • Lee, Seung-Won;Kim, Man-Bae;Byun, Hye-Ran;Park, Il-Kwon
    • Journal of Broadcast Engineering
    • /
    • v.12 no.4
    • /
    • pp.342-349
    • /
    • 2007
  • In general multi-view system, all the view sequences acquired at the server are transmitted to the client. However, this kind of system requires high processing power of the server as well as the client, thus it is posing a difficulty in practical applications. To overcome this problem, a relatively simple method is to transmit only two view-sequences requested by the client in order to deliver a stereoscopic video. In this system, effective communication between the server and the client is one of important aspects. Therefore, we propose an efficient multi-view system that transmits two view-sequences according to user's request. The view selection process is integrated into MPEG-21 DIA (Digital Item Adaptation) so that our system is compatible to MPEG-21 multimedia framework. Furthermore, multi-view descriptors related to multi-view camera and systems are newly introduced. The syntax of the descriptions and their elements is represented in XML (extensible Markup Language) schema. Intermediate view reconstruction (IVR) is used to reduce such discomfort with excessive disparity. Furthermore, IVR is useful for smooth transition between two stereoscopic view sequences. Finally, through the implementation of testbed, we can show the valuables and possibilities of our system.

Development of Critical Thinking Skill Evaluation Scale for Nursing Students (간호대학생의 비판적 사고력 평가도구 개발)

  • You, So Young;Kim, Nam Cho
    • Journal of Korean Academy of Nursing
    • /
    • v.44 no.2
    • /
    • pp.129-138
    • /
    • 2014
  • Purpose: To develop a Critical Thinking Skill Test for Nursing Students. Methods: The construct concepts were drawn from a literature review and in-depth interviews with hospital nurses and surveys were conducted among students (n=607) from nursing colleges. The data were collected from September 13 to November 23, 2012 and analyzed using the SAS program, 9.2 version. The KR 20 coefficient for reliability, difficulty index, discrimination index, item-total correlation and known group technique for validity were performed. Results: Four domains and 27 skills were identified and 35 multiple choice items were developed. Thirty multiple choice items which had scores higher than .80 on the content validity index were selected for the pre test. From the analysis of the pre test data, a modified 30 items were selected for the main test. In the main test, the KR 20 coefficient was .70 and Corrected Item-Total Correlations range was .11-.38. There was a statistically significant difference between two academic systems (p=.001). Conclusion: The developed instrument is the first critical thinking skill test reflecting nursing perspectives in hospital settings and is expected to be utilized as a tool which contributes to improvement of the critical thinking ability of nursing students.

Math Creative Problem Solving Ability Test for Identification of the Mathematically Gifted

  • Cho Seok-Hee;Hwang Dong-Jou
    • Research in Mathematical Education
    • /
    • v.10 no.1 s.25
    • /
    • pp.55-70
    • /
    • 2006
  • The purpose of this study was to develop math creative problem solving test in order to identify the mathematically gifted on the basis of their math creative problem solving ability and evaluate the goodness of the test in terms of its reliability and validity of measuring creativity in math problem solving on the basis of fluency in producing valid solutions. Ten open math problems were developed requiring math thinking abilities such as intuitive insight, organization of information, inductive and deductive reasoning, generalization and application, and reflective thinking. The 10 open math test items were administered to 2,029 Grade 5 students who were recommended by their teachers as candidates for gifted education programs. Fluency, the number of valid solutions, in each problem was scored by math teachers. Their responses were analyzed by BIGSTEPTS based on Rasch's 1-parameter item-response model. The item analyses revealed that the problems were good in reliability, validity, difficulty, and discrimination power even when creativity was scored with the single criteria of fluency. This also confirmed that the open problems which are less-defined, less-structured and non-entrenched were good in measuring math creativity of the candidates for math gifted education programs. In addition, it discriminated applicants for two different gifted educational institutions and between male and female students as well.

  • PDF