• 제목/요약/키워드: homogeneity test. Pearson test

검색결과 22건 처리시간 0.019초

Test of Homogeneity Baseon Complex Survey Data : Discussion Based on Power of Test

  • Heo, Sun-Yeong;Yi, Su-Cheol
    • Journal of the Korean Data and Information Science Society
    • /
    • 제16권3호
    • /
    • pp.609-620
    • /
    • 2005
  • In the secondary data analysis for categorical data, situations often arise in which the estimated cell variances are available, but not the full matrix of variances. In this case researchers are often inclined to use Pearson-type test statistics for homogeneity. However, for a complex sample observed cell proportions are not distributed as multinomial and Pearson-type test statistic generally is not distributed asymptotically as chi-square distribution. This paper evaluates powers for Wald test and Pearson-type test and the first order corrected test of Pearson-type test for homogeneity. The resulting power curves indicate that as the misspecification effect increases, the amount of inflation of significance level and the loss of power Pearson-type test are getting more severe.

  • PDF

복합표본자료에서 동질성검정을 위한 피어슨 검정통계량의 효과 (Effect of complex sample design on Pearson test statistic for homogeneity)

  • 허순영;정영애
    • Journal of the Korean Data and Information Science Society
    • /
    • 제23권4호
    • /
    • pp.757-764
    • /
    • 2012
  • 복합표본설계에 기초한 범주형 조사자료는 통상적인 피어슨 카이제곱검정에 필요한 조건을 만족하지 못한다. 그러나 많은 조사연구에서 복잡한 표본설계 방법을 적용하고 있지만, 종래의 피어슨 검정결과를 제시하고 있다. 본 연구는 복합표본설계에 의한 범주형자료의 동질성검정에 대한 실증분석을 통해, 종래의 피어슨 검정과 불편검정인 왈드검정, 표본설계를 반영한 비율추정치를 사용하는 피어슨 검정을 비교하였다. 분석결과, 종래의 피어슨검정은 표본설계를 반영하는 검정들에 비해 통계량 값이 매우 크고, 유의확률이 심각하게 작게 나타나는 것을 확인하였다. 복합표본설계를 반영하되 추정량의 분산을 아는 경우와 모르는 경우의 비교에서는 범주수, 설계효과행렬의 고유치들의 평균과 표준편차에 영향을 받는 것을 확인하였다.

Effect of Bias on the Pearson Chi-squared Test for Two Population Homogeneity Test

  • Heo, Sunyeong
    • 통합자연과학논문집
    • /
    • 제5권4호
    • /
    • pp.241-245
    • /
    • 2012
  • Categorical data collected based on complex sample design is not proper for the standard Pearson multinomial-based chi-squared test because the observations are not independent and identically distributed. This study investigates effects of bias of point estimator of population proportion and its variance estimator to the standard Pearson chi-squared test statistics when the sample is collected based on complex sampling scheme. This study examines the effect under two population homogeneity test. The standard Pearson test statistic can be partitioned into two parts; the first part is the weighted sum of ${\chi}^2_1$ with eigenvalues of design matrix as their weights, and the additional second part which is added due to the biases of the point estimator and its variance estimator. Our empirical analysis shows that even though the bias of point estimator is small, Pearson test statistic is very much inflated due to underestimate the variance of point estimator. In the connection of design-based variance estimator and its design matrix, the bigger the average of eigenvalues of design matrix is, the larger relative size of which the first component part to Pearson test statistic is taking.

Tests for homogeneity of proportions in clustered binomial data

  • Jeong, Kwang Mo
    • Communications for Statistical Applications and Methods
    • /
    • 제23권5호
    • /
    • pp.433-444
    • /
    • 2016
  • When we observe binary responses in a cluster (such as rat lab-subjects), they are usually correlated to each other. In clustered binomial counts, the independence assumption is violated and we encounter an extra-variation. In the presence of extra-variation, the ordinary statistical analyses of binomial data are inappropriate to apply. In testing the homogeneity of proportions between several treatment groups, the classical Pearson chi-squared test has a severe flaw in the control of Type I error rates. We focus on modifying the chi-squared statistic by incorporating variance inflation factors. We suggest a method to adjust data in terms of dispersion estimate based on a quasi-likelihood model. We explain the testing procedure via an illustrative example as well as compare the performance of a modified chi-squared test with competitive statistics through a Monte Carlo study.

k-모집단 동질성검정에서 피어슨검정의 오차성분 분석에 관한 연구 (Error cause analysis of Pearson test statistics for k-population homogeneity test)

  • 허순영
    • Journal of the Korean Data and Information Science Society
    • /
    • 제24권4호
    • /
    • pp.815-824
    • /
    • 2013
  • 국가단위의 조사와 같은 대규모 표본조사에서는 표본의 대표성을 확보하기 위해 층화, 집락, 계통, 불균등확률추출 등을 종합적으로 사용하는 복합표본설계가 일반화되어 있다. 이러한 복합표본설계에 기초한 범주형 자료분석에서는 자료의 독립성과 다항분포를 가정하는 전통적인 피어슨검정이 왜곡된 검정결과를 가져올 수 있다. 본 연구는 복합표본설계에 의한 범주형조사자료의 k-모집단 동질성검정에서 설계기반 일치통계량인 Wald 검정통계량을 유도하고, 전통적인 피어슨검정통계량을 사용할 경우 발생할 수 있는 오차요인을 항목별로 분해하여, 분산의 편의에 의한 영향, 추정량의 편의에 의한 영향, 기타 분산의 편의와 추정량의 편의가 교락되어 미치는 영향으로 각각 분해하는 식을 도출하였다. 또한, 도출된 식의 각 항목이 피어슨 카이제곱검정통계량에 미치는 상대적 크기를 경험적으로 확인하기 위해 국민건강영양조사 제4기 2차년도 자료를 이용해 경험분석 하였다. 분석결과, 변수에 따른 차이는 있지만 대체로 분산의 편의가 미치는 영향이 추정량의 편의가 미치는 영향보다 크다는 것을 명확히 확인할 수 있었다.

Empirical Analysis on Rao-Scott First Order Adjustment for Two Population Homogeneity test Based on Stratified Three-Stage Cluster Sampling with PPS

  • Heo, Sunyeong
    • 통합자연과학논문집
    • /
    • 제7권3호
    • /
    • pp.208-213
    • /
    • 2014
  • National-wide and/or large scale sample surveys generally use complex sample design. Traditional Pearson chi-square test is not appropriate for the categorical complex sample data. Rao-Scott suggested an adjustment method for Pearson chi-square test, which uses the average of eigenvalues of design matrix of cell probabilities. This study is to compare the efficiency of Rao-Scott first order adjusted test to Wald test for homogeneity between two populations using 2009 Gyeongnam regional education offices's customer satisfaction survey (2009 GREOCSS) data. The 2009 GREOCSS data were collected based on stratified three-stage cluster sampling with probability proportional to size. The empirical results show that the Rao-Scott adjusted test statistic using only the variances of cell probabilities is very close to the Wald test statistic, which uses the covariance matrix of cell probabilities, under the 2009 GREOCSS data based. However it is necessary to be cautious to use the Rao-Scott first order adjusted test statistic in the place of Wald test because its efficiency is decreasing as the relative variance of eigenvalues of the design matrix of cell probabilities is increasing, specially more when the number of degrees of freedom is small.

3변수 확률분포에 의한 설계강우량 추정 (Estimation of Design Rainfall Using 3 Parameter Probability Distributions)

  • 이순혁;맹승진;류경식
    • 한국수자원학회:학술대회논문집
    • /
    • 한국수자원학회 2004년도 학술발표회
    • /
    • pp.595-598
    • /
    • 2004
  • This research seeks to derive the design rainfalls through the L-moment with the test of homogeneity, independence and outlier of data on annual maximum daily rainfall at 38 rainfall stations in Korea. To select the appropriate distribution of annual maximum daily rainfall data by the rainfall stations, Generalized Extreme Value (GEV), Generalized Logistic (GLO), Generalized Pareto (GPA), Generalized Normal (GNO) and Pearson Type 3 (PT3) probability distributions were applied and their aptness were judged using an L-moment ratio diagram and the Kolmogorov-Smirnov (K-S) test. Parameters of appropriate distributions were estimated from the observed and simulated annual maximum daily rainfall using Monte Carlo techniques. Design rainfalls were finally derived by GEV distribution, which was proved to be more appropriate than the other distributions.

  • PDF

3변수 확률분포형에 의한 극치강우의 빈도분석 (Frequency Analysis of Extreme Rainfall Using 3 Parameter Probability Distributions)

  • 김병준;맹승진;류경식;이순혁
    • 한국농공학회논문집
    • /
    • 제46권3호
    • /
    • pp.31-42
    • /
    • 2004
  • This research seeks to derive the design rainfalls through the L-moment with the test of homogeneity, independence and outlier of data on annual maximum daily rainfall at 38 rainfall stations in Korea. To select the appropriate distribution of annual maximum daily rainfall data by the rainfall stations, Generalized Extreme Value (GEV), Generalized Logistic (GLO), Generalized Pareto (GPA), Generalized Normal (GNO) and Pearson Type 3 (PT3) probability distributions were applied and their aptness were judged using an L-moment ratio diagram and the Kolmogorov-Smirnov (K-S) test. Parameters of appropriate distributions were estimated from the observed and simulated annual maximum daily rainfall using Monte Carlo techniques. Design rainfalls were finally derived by GEV distribution, which was proved to be more appropriate than the other distributions.

시추공 영상자료와 카이제곱 검정을 이용한 절리 방향성의 수직적 변화양상에 관한 정량적 평가 (Pearson-type Chi-square Test on the Joint Orientations from Different Depths in Boreholes)

  • 김기석;박영도;박연준
    • 터널과지하공간
    • /
    • 제18권3호
    • /
    • pp.185-193
    • /
    • 2008
  • 이 연구에서는 시추공 분석 작업을 통해 획득된 암반절리 방향성이 심도에 따라 변화하는 양상의 확인을 위해 피어슨 카이제곱 통계검정이 실시되었다. 대상 암반은 모암이 화강암질 편마암인 두 지역으로서, 이와같은 엽리가 발달하지 않은 괴상의 암상 선정은, 엽리가 존재하는 암석의 경우 절리 방향성이 엽리에 의해 영향을 받고 엽리의 방향은 습속 등의 지질작용에 의해 심도에 따라 다를 수가 있기 때문이다. 암반 절리들의 방향 파악을 위해 시추공 영상이 이용되었다. 획득된 방향자료를 천부구간과 심부구간의 자료로 분류한 후 21 영역으로 구성된 분할망에 각각 투영 후, 분류표를 작성하여 통계검정을 실시하였으며, 분석결과 두 지역 중 한 지역의 자료는 비동질로 나타났다. 이러한 결과는 터널과 같은 지하구조물의 설계를 위한 암반공학적으로 중요한 절리면의 방향성에 대한 조사시 원위치 조사가 바람직함을 시사한다.

구조화된 환자교육이 뇌졸중 환자의 조기재활에 관한 지식과 활동수행에 미치는 영향 (The Effect of the Structured Education on the Early Rehabilitation Knowledge and Activity Performance of the C.V.A. Patients)

  • 이혜진;이향련
    • 대한간호학회지
    • /
    • 제27권1호
    • /
    • pp.109-119
    • /
    • 1997
  • This study has been attempted to set up the strategies of the nursing which can promote the activity performance for early rehabilitation for the patients by examining the effect of the structured patient education on the early rehabilitation knowledge and activity performance of the C.V.A patients. The study method has been done by investigating the experiment group and control group in advance through the question papers and interview and observation on 65 patients who had been hospitalized at oriental medicine hospital of K Medical Center from July 1st 1995 to the end of Sep, 1995. The analysis of the collected material had been done for the homogeneity test in which general characters of experiment group and control group had been tested by X²and the homogeneity test of ADL by t-test. To test the hypothesis the t-test had been given for the difference of the early rehabilitation knowledge and activity performance between the two groups and the correlation between early rehabilitation knowledge and activity performance had been tested by Pearson's Correlation Coefficient. The result of the test of the hypothesis is as the below. 1 The 1st hypothesis “The experiment group which had received the structured education should be higher in the early rehabilitation knowledge than the control group” was supported(t=4.45. p=.000). 2. The 2nd hypothesis “The experiment group which received the structured education should be higher in the early rehabilitation activity performance than the control group”was supported(t=2.11, p=.036). 3. The 3rd hypothesis “The higher the early rehabilitation knowledge of the patient the higher the activity performance degree” was rejected (r=.1546, p=.219). In conclusion, the patients who received the structured education showed the increase in the degree of early rehabilitation knowledge and activity performance, so it has been judged that education has been prerequisite in increasing the knowledge and activity performance of early rehabilitation.

  • PDF