• 제목/요약/키워드: contingency table

검색결과 118건 처리시간 0.023초

주변값이 주어진 이원분할표에 대한 카이제곱 검정통계량의 소표본 분포 및 대표본 분포와의 일치성 연구 (On the Small Sample Distribution and its Consistency with the Large Sample Distribution of the Chi-Squared Test Statistic for a Two-Way Contigency Table with Fixed Margins)

  • 박철용;최재성;김용곤
    • Journal of the Korean Data and Information Science Society
    • /
    • 제11권1호
    • /
    • pp.83-90
    • /
    • 2000
  • 이원분할표의 두 범주형 변수에 대한 독립성을 검정할 때 흔히 카이제곱 검정통계량이 사용된다. 표본추출 모형이 다항이나 곱다항인 경우 이 검정통계량이 독립성 가정하에서 근사적으로 카이제곱 분포를 따르게 되는 것은 잘 알려진 사실이다. 두 주변값이 모두 주어진 경우 독립성 가정하에서 표본추출 모형은 다중 초기하분포가 되며 앞의 모형과 마찬가지로 카이제곱 통계량에 근거한 검정을 사용할 수 있다. 이 연구에서는 주변값이 주어진 경우에 카이제곱 통계량의 소표본 분포를 대표본 분포인 카이제곱 분포와 비교하고자 한다. 표본크기가 작은 몇 개의 경우에 대해 카이제곱 통계량의 소표본 분포를 직접 계산해보았다. 표본크기가 큰 몇 개의 경우는 간단한 몬테칼로 알고리듬을 통해 소표본 분포를 생성하고 카이제곱 확률도와 콜모고로브-스미노브 단일표본 검정을 이용하여 대표본 분포와의 일치성을 알아보았다.

  • PDF

Contour Plot to Explore the Structure of Categorical Data

  • Kim, Hyun Chul;Huh, Moon Yul;Chung, Hee Suk
    • Communications for Statistical Applications and Methods
    • /
    • 제10권2호
    • /
    • pp.371-385
    • /
    • 2003
  • In this paper, contour plot is considered as a method to explore the structure of categorical data. For this purpose, the paper suggests a method to sort two-way contingency table with respect to the expected marginals. It is found that the suggested plot provides us with valuable information for the underlying data structure. Firstly, we can investigate independency between the categories by examining the differences of expected frequency contours and observed frequency contours. With the plot, we can also visually investigate the existence of outliers inherent in the data. These properties of the suggested contour plot will be demonstrated by several sets of real data.

A multivariate latent class profile analysis for longitudinal data with a latent group variable

  • Lee, Jung Wun;Chung, Hwan
    • Communications for Statistical Applications and Methods
    • /
    • 제27권1호
    • /
    • pp.15-35
    • /
    • 2020
  • In research on behavioral studies, significant attention has been paid to the stage-sequential process for multiple latent class variables. We now explore the stage-sequential process of multiple latent class variables using the multivariate latent class profile analysis (MLCPA). A latent profile variable, representing the stage-sequential process in MLCPA, is formed by a set of repeatedly measured categorical response variables. This paper proposes the extended MLCPA in order to explain an association between the latent profile variable and the latent group variable as a form of a two-dimensional contingency table. We applied the extended MLCPA to the National Longitudinal Survey on Youth 1997 (NLSY97) data to investigate the association between of developmental progression of depression and substance use behaviors among adolescents who experienced Authoritarian parental styles in their youth.

Graphical Methods for Hierarchical Log-Linear Models

  • Hong, Chong-Sun;Lee, Ui-Ki
    • Communications for Statistical Applications and Methods
    • /
    • 제13권3호
    • /
    • pp.755-764
    • /
    • 2006
  • Most graphical methods for categorical data can describe the structure of data and represent a measure of association among categorical variables. Among them the polyhedron plot represents sequential relationships among hierarchical log-linear models for a multidimensional contingency table. This kind of plot could be explored to describe the differences among sequential models. In this paper we suggest graphical methods, containing all the information, that reflect the relationship among all log-linear models in a certain hierarchical structure. We use the ideas of a correlation diagram.

비모수적(非母數的) 통계(統計) 프로그램의 개발(開發) (Computer Programs for Nonparametric Tests)

  • 배도선;장중순;김상복
    • 대한산업공학회지
    • /
    • 제12권2호
    • /
    • pp.101-108
    • /
    • 1986
  • Computer programs for IBM PC/XT/AT or compatibles, are presented for running 9 nonparametric tests. They include sign test, Wilcoxon signed rank test, Mann-Whitney Wilcoxon test, Kruskal-Wallis test, Kolmogorov-Smirnov one sample and two sample tests, Kendall and Spearman rank correlation coefficient tests, and Chi square test for contingency table. Each program is written with BASIC language and is combined into a statistical package, 'NONPARA'. It is easily accessible through the menu programs. The alogorithms on which each test is based, are also explained and 3 examples are given.

  • PDF

Estimation of Log-Odds Ratios for Incomplete $2{\times}2$ Tables with Covariates using FEFI

  • Kang, Shin-Soo;Bae, Je-Min
    • Journal of the Korean Data and Information Science Society
    • /
    • 제18권1호
    • /
    • pp.185-194
    • /
    • 2007
  • The information of covariates are available to do fully efficient fractional imputation(FEFI). The new method, FEFI with logistic regression is proposed to construct complete contingency tables. Jackknife method is used to get a standard errors of log-odds ratio from the completed table by the new method. Simulation results, when covariates have more information about categorical variables, reveal that the new method provides more efficient estimates of log-odds ratio than either multiple imputation(MI) based on data augmentation or complete case analysis.

  • PDF

Correspondence analysis for studying association between geography and cancer

  • Song, Joon-Jin;Yu, Pingjian;Ren, Yuan;Chung, Ming-Hua
    • Journal of the Korean Data and Information Science Society
    • /
    • 제20권5호
    • /
    • pp.919-924
    • /
    • 2009
  • Geographical location carries information such as demography, local economy, environment, and life styles, which could be the sources of cancer occurrence. Analyzing geographical location associated with cancer occurrence can be instructive to physicians, patients, and health administrators regarding resource allocation, expenditures, prophylaxis and treatments. In this paper, we explored the correspondence relationship between geographical locations and mortality rates of the cancers using correspondence analysis and illustrated the approach with the mortality rates of the top 10 cancers in the 75 counties in Arkansas from 2001 to 2005. Geographical variations with respect to the mortality rates of cancers are evaluated across Arkansas counties. Based on the contingency table, correspondence analysis model is developed and the simple indices which indicate the degree to which the regions and the cancers affect each other are calculated. Quantitative results are visualized and mapped in two-dimensional graphs.

  • PDF

모비율의 NONINFERIORITY에 대한 연구 (A study on Noninferiority of Proportions)

  • 강승호
    • 응용통계연구
    • /
    • 제16권1호
    • /
    • pp.117-128
    • /
    • 2003
  • 새로운 약의 치료 효과가 기존의 약의 치료 효과보다 못하지 않음을 보이는 것이 목적인 실험을 noninferiority 실험이라고 한다. 본 논문에서는 이표본에서 모비율의 noninferiority 실험에서 무조건부 정확검정에 사용되는 세 가지 분산 추정방법을 비교하였다. 가능한 모든 경우를 조사하는 방법을 이용하여 세 가지 분산 추정방법에 따라 소표본에서 크기와 검정력을 비교하였다

도시지역내 농지소유자의 농지이용 의향 분석 - 서울특별시의 사례를 중심으로 - (Landowner's views on their agricultural land uses in an urban area : The case of Seoul)

  • 황한철;박선용;최수명
    • 농촌계획
    • /
    • 제6권1호
    • /
    • pp.50-58
    • /
    • 2000
  • In spite of importance of the farm area in the city, the urbanization and industrialization strongly results in decrease of the farm area. The purpose of this study is to establish an effective way of agricultural land uses by examining on the intention of the farmers based on the survey in Seoul area. The areas, the agricultural types, the landowner's ages, and farm land sizes, were surveyed and analyzed with respect to urban agricultural planning and land use planning. All the collected data were basically analyzed with Contingency Table and Chi-square Test using SAS statistical package. The structures of the intention of agricultural land uses were understood with the comparative analyses of the agricultural land owners, the agricultural land leaseholders, the areas, landowner's ages, farming types, and so on.

  • PDF

확률화응답에 대한 대수선형모형

  • 최경호
    • Communications for Statistical Applications and Methods
    • /
    • 제4권3호
    • /
    • pp.725-734
    • /
    • 1997
  • 많은 사회과학 조사에서 분할표 형태로 얻어진 범주형 자료에는 오분류(misclassification)로 인한 오차가 내재되는 경우가 종종 있다. 질적속성 추정을 위한 확률화응답은 이러한 오분류 문제의 한 특수한 경우로 여겨지기도 한다. 그래서 확률화응답을 통한 범주형자료는 혼합된 분할표(mixed-up contingency table)로 여길 수 있는 바, 본 논문에서는 이에 대해 대수선형모형(log-linear model)을 설정하고 Chen과 Fienberg(1976)의 Iterative scaling procedure(ISP)에 의하여 얻어진 최우추정량의 극한을 이용하였다. 이 결과 Warner(1965) 형태의 대칭기법에 대해서는 Singh(1976)에 의하여 제안된 최우추정량과 같아지게 됨을 보임으로써 Warner에 의해서 제시된 추정량이 최우추정량으로 적절하지 않음을 확인해 보고, 무관질문기법에 대해서는 Greenberg, et al.(1969)에 의해서 제안된 추정량이 추정의 관점에서 최우추정량으로 적절하지 않음을 알아 보았다.

  • PDF