• Title/Summary/Keyword: square contingency table

Search Result 19, Processing Time 0.026 seconds

Goodness-of-fit tests for a proportional odds model

  • Lee, Hyun Yung
    • Journal of the Korean Data and Information Science Society
    • /
    • v.24 no.6
    • /
    • pp.1465-1475
    • /
    • 2013
  • The chi-square type test statistic is the most commonly used test in terms of measuring testing goodness-of-fit for multinomial logistic regression model, which has its grouped data (binomial data) and ungrouped (binary) data classified by a covariate pattern. Chi-square type statistic is not a satisfactory gauge, however, because the ungrouped Pearson chi-square statistic does not adhere well to the chi-square statistic and the ungrouped Pearson chi-square statistic is also not a satisfactory form of measurement in itself. Currently, goodness-of-fit in the ordinal setting is often assessed using the Pearson chi-square statistic and deviance tests. These tests involve creating a contingency table in which rows consist of all possible cross-classifications of the model covariates, and columns consist of the levels of the ordinal response. I examined goodness-of-fit tests for a proportional odds logistic regression model-the most commonly used regression model for an ordinal response variable. Using a simulation study, I investigated the distribution and power properties of this test and compared these with those of three other goodness-of-fit tests. The new test had lower power than the existing tests; however, it was able to detect a greater number of the different types of lack of fit considered in this study. I illustrated the ability of the tests to detect lack of fit using a study of aftercare decisions for psychiatrically hospitalized adolescents.

Applying Randomization Tests to Collocation Analyses in Large Corpora (언어의 공기관계 분석을 위한 임의화검증의 응용)

  • Yang Kyung-Sook;Kim HeeYoung
    • The Korean Journal of Applied Statistics
    • /
    • v.18 no.3
    • /
    • pp.583-595
    • /
    • 2005
  • Contingency tables are used to compare counts of n-grams to determine if the n-gram is a true collocation, meaning that the words that make up the n-gram are highly associated in the text. Some statistical methods for identifying collocation are used. They are Kulczinsky coefficient, Ochiai coefficient, Frager and McGowan coefficient, Yule coefficient, mutual information, and chi-square, and so on. But the main problem is that these measures are based ell the assumption of a nor-mal or approximately normal distribution of the variables being sampled. While this assumption is valid in most instances, it is not valid when comparing the rates of occurrence of rare events, and texts are composed mostly of rare events. In this paper we have simply reviewed some statistics about testing association of two words. Some randomization tests to evaluate the significance level in analyzing collocation in large corpora are proposed. A related graph can be used to compare different lest statistics that ran be used to analyze the same contingency table.

Computer Programs for Nonparametric Tests (비모수적(非母數的) 통계(統計) 프로그램의 개발(開發))

  • Bae, Do-Seon;Jang, Jung-Sun;Kim, Sang-Bok
    • Journal of Korean Institute of Industrial Engineers
    • /
    • v.12 no.2
    • /
    • pp.101-108
    • /
    • 1986
  • Computer programs for IBM PC/XT/AT or compatibles, are presented for running 9 nonparametric tests. They include sign test, Wilcoxon signed rank test, Mann-Whitney Wilcoxon test, Kruskal-Wallis test, Kolmogorov-Smirnov one sample and two sample tests, Kendall and Spearman rank correlation coefficient tests, and Chi square test for contingency table. Each program is written with BASIC language and is combined into a statistical package, 'NONPARA'. It is easily accessible through the menu programs. The alogorithms on which each test is based, are also explained and 3 examples are given.

  • PDF

Landowner's views on their agricultural land uses in an urban area : The case of Seoul (도시지역내 농지소유자의 농지이용 의향 분석 - 서울특별시의 사례를 중심으로 -)

  • Hwang, Han-Cheol;Park, Sun-Yong;Choi, Soo-Myung
    • Journal of Korean Society of Rural Planning
    • /
    • v.6 no.1 s.11
    • /
    • pp.50-58
    • /
    • 2000
  • In spite of importance of the farm area in the city, the urbanization and industrialization strongly results in decrease of the farm area. The purpose of this study is to establish an effective way of agricultural land uses by examining on the intention of the farmers based on the survey in Seoul area. The areas, the agricultural types, the landowner's ages, and farm land sizes, were surveyed and analyzed with respect to urban agricultural planning and land use planning. All the collected data were basically analyzed with Contingency Table and Chi-square Test using SAS statistical package. The structures of the intention of agricultural land uses were understood with the comparative analyses of the agricultural land owners, the agricultural land leaseholders, the areas, landowner's ages, farming types, and so on.

  • PDF

Pearson-type Chi-square Test on the Joint Orientations from Different Depths in Boreholes (시추공 영상자료와 카이제곱 검정을 이용한 절리 방향성의 수직적 변화양상에 관한 정량적 평가)

  • Kim, Ki-Seog;Park, Young-Do;Park, Yeon-Jun
    • Tunnel and Underground Space
    • /
    • v.18 no.3
    • /
    • pp.185-193
    • /
    • 2008
  • We have carried out Pearson-type chi-square tests on the orientation data of joints from different depths in order to estimate the homogeneity of joint orientations obtained from a borehole. The orientation data of joints were collected from two non-foliated massive rocks of granitic gneisses in South Korea since orientations of joints in folded metamorphic rocks, for example, are controlled by foliation and also changes as the orientations of foliation change by folding. Borehole images were used for the analysis of the orientations of individual joints. The orientation data were subdivided into the upper level data and lower level data. The data from these two levels are plotted on the patch net consisting of 21 orientation patches. Then, the two patterns on the patch net were analyzed using a contingency table. From the chi-square test on the data collected from two sites, we found that some data sets show statistically meaningful differences in orientations of joints. Since joints are one of the important parameters in determining the physical properties of rock masses, in situ investigation of joints are desirable in the geotechnical investigation and also in design of subsurface structures (e.g. tunnels and underground storages).

Analysis of Preferences Based on Purchasing Frequencies and Recognitions of Sizing System for Female Garments (여성복의 구매빈도에 의한 선호도 및 치수 시스템 인지도에 관한 분석)

  • Koo, Hee-Kyung
    • Journal of the Korea Fashion and Costume Design Association
    • /
    • v.13 no.1
    • /
    • pp.125-134
    • /
    • 2011
  • The research is a survey and analysis of female apparel preferences, and recognition of the KS sizing system for adult female garments. The practical surveys in this research are examined by total number of subjects, 200 women who are living in Seoul, South Korea. The homogeneity test using Chi-square statistics, and the analysis of frequencies and ratios of contingency tables were performed with the data which are classified by age, education level, income level and housing modalities. The findings in this study are as follows: 1. Women's preferences for purchasing female garments indicate significant differences between subjects, such as age, education level, income level and housing modalities. Moreover, the following five types of adult female garments were analyzed in this study: upper garment, lower garment, one-piece apparel, sportswear and sleepwear. The results of the preference study show an indirect understanding of the KSK 0051 classification system for subjects of the survey. Therefore the preference study can be used as a pilot study for the sizing recognition survey. 2. Women's recognition of the KSK 0051 sizing system for adult female adult garments do not indicate significant differences based on the characteristics of age, education level, income level and housing modalities. The low recognition of the KS sizing system is due to too many details and complex numbers of application for users. Therefore, the sizing system should be simplified and rearranged to be more effective and have more recognizable categories.

  • PDF

Small diagnostic scale for internet addiction (인터넷 중독 자가진단 소형척도 개발)

  • Oh, Kwang-Sik
    • Journal of the Korean Data and Information Science Society
    • /
    • v.21 no.6
    • /
    • pp.1203-1209
    • /
    • 2010
  • Internet addiction is a serious social problem in information society. The purpose of this study is to develope a small diagnostic scale in order to detect internet addiction easily. The reliability and validity of K-scale and Kimberly Young-scale is investigated. Five small diagnostic scale is suggested by factor analysis and regression. The comparision of these small scale is established by correlation coefficient, chi-square test, gamma value of concordance in contingency table. In view of reliability and validity, we suggest a small diagnostic scale. The results of this study may be useful to detect internet addiction by oneself.

Benthic Marine Algal of Dolsan-Island in the Southern Coast of Korea II Structure of Algal Communities of Subtidal Zone (남해안 돌산도의 해조 II 조하대 해조군락의 구조)

  • SOHN Chul Hyun;LEE In Kyu;KANG Jae Won
    • Korean Journal of Fisheries and Aquatic Sciences
    • /
    • v.16 no.4
    • /
    • pp.379-388
    • /
    • 1983
  • The subtidal benthic algal communities of Dolsan-island in the southern coast of Korea were surveyed by means of SCUBA diving. The vertical zonation is recognized into three groups; upper, middle and lower subtidal zones. The representative species in each group throughout the year are Ulva pertusa, Codium fragile, Chondria crassicaulis and Gigartina tenera in the upper, Myagropsis myagroides and Sargassum tortile in the middle, and Plocamium telfairiae, Callophyllis japonica and Symphyocladia linearis in the lower zone. According to the normal association analysis by $2{\times}2$ contingency table and chi-square calculation among 29 quadrats, the algal communities are divided into 9 groups which are dominated by Sargassum tortile, Myagropsis myagroides, Chondria crassicaulis, Codium fragile, Pterocladia tenuis, Gigartina tenera, Gracilaria textorii. The thallus length and standing crops of Myagropsis myagroides show the highest value in spring and the lowest in summer.

  • PDF

The NCAM Land-Atmosphere Modeling Package (LAMP) Version 1: Implementation and Evaluation (국가농림기상센터 지면대기모델링패키지(NCAM-LAMP) 버전 1: 구축 및 평가)

  • Lee, Seung-Jae;Song, Jiae;Kim, Yu-Jung
    • Korean Journal of Agricultural and Forest Meteorology
    • /
    • v.18 no.4
    • /
    • pp.307-319
    • /
    • 2016
  • A Land-Atmosphere Modeling Package (LAMP) for supporting agricultural and forest management was developed at the National Center for AgroMeteorology (NCAM). The package is comprised of two components; one is the Weather Research and Forecasting modeling system (WRF) coupled with Noah-Multiparameterization options (Noah-MP) Land Surface Model (LSM) and the other is an offline one-dimensional LSM. The objective of this paper is to briefly describe the two components of the NCAM-LAMP and to evaluate their initial performance. The coupled WRF/Noah-MP system is configured with a parent domain over East Asia and three nested domains with a finest horizontal grid size of 810 m. The innermost domain covers two Gwangneung deciduous and coniferous KoFlux sites (GDK and GCK). The model is integrated for about 8 days with the initial and boundary conditions taken from the National Centers for Environmental Prediction (NCEP) Final Analysis (FNL) data. The verification variables are 2-m air temperature, 10-m wind, 2-m humidity, and surface precipitation for the WRF/Noah-MP coupled system. Skill scores are calculated for each domain and two dynamic vegetation options using the difference between the observed data from the Korea Meteorological Administration (KMA) and the simulated data from the WRF/Noah-MP coupled system. The accuracy of precipitation simulation is examined using a contingency table that is made up of the Probability of Detection (POD) and the Equitable Threat Score (ETS). The standalone LSM simulation is conducted for one year with the original settings and is compared with the KoFlux site observation for net radiation, sensible heat flux, latent heat flux, and soil moisture variables. According to results, the innermost domain (810 m resolution) among all domains showed the minimum root mean square error for 2-m air temperature, 10-m wind, and 2-m humidity. Turning on the dynamic vegetation had a tendency of reducing 10-m wind simulation errors in all domains. The first nested domain (7,290 m resolution) showed the highest precipitation score, but showed little advantage compared with using the dynamic vegetation. On the other hand, the offline one-dimensional Noah-MP LSM simulation captured the site observed pattern and magnitude of radiative fluxes and soil moisture, and it left room for further improvement through supplementing the model input of leaf area index and finding a proper combination of model physics.