• Title/Summary/Keyword: 집락

Search Result 491, Processing Time 0.026 seconds

Testing Independence in Contingency Tables with Clustered Data (집락자료의 분할표에서 독립성검정)

  • 정광모;이현영
    • The Korean Journal of Applied Statistics
    • /
    • v.17 no.2
    • /
    • pp.337-346
    • /
    • 2004
  • The Pearson chi-square goodness-of-fit test and the likelihood ratio tests are usually used for testing independence in two-way contingency tables under random sampling. But both of these tests may provide false results for the contingency table with clustered observations. In this case we consider the generalized linear mixed model which includes random effects of clustering in addition to the fixed effects of covariates. Both the heterogeneity between clusters and the dependency within a cluster can be explained via generalized linear mixed model. In this paper we introduce several types of generalized linear mixed model for testing independence in contingency tables with clustered observations. We also discuss the fitting of these models through a real dataset.

A Study of Sample Size for Two-Stage Cluster Sampling (이단계 집락추출에서의 표본크기에 대한 연구)

  • Song, Jong-Ho;Jea, Hea-Sung;Park, Min-Gue
    • The Korean Journal of Applied Statistics
    • /
    • v.24 no.2
    • /
    • pp.393-400
    • /
    • 2011
  • In a large scale survey, cluster sampling design in which a set of observation units called clusters are selected is often used to satisfy practical restrictions on time and cost. Especially, a two stage cluster sampling design is preferred when a strong intra-class correlation exists among observation units. The sample Primary Sampling Unit(PSU) and Secondary Sampling Unit(SSU) size for a two stage cluster sample is determined by the survey cost and precision of the estimator calculated. For this study, we derive the optimal sample PSU and SSU size when the population SSU size across the PSU are di erent by extending the result obtained under the assumption that all PSU have the same number of SSU. The results on the sample size are then applied to the $4^{th}$ Korea Hospital Discharge results and is compared to the conventional method. We also propose the optimal sample SSU (discharged patients) size for the $7^{th}$ Korea Hospital Discharge Survey.

집락 표본추출에 있어서 이-단계 표본 추출

  • 신민웅;이주영
    • Communications for Statistical Applications and Methods
    • /
    • v.4 no.2
    • /
    • pp.403-409
    • /
    • 1997
  • 일단-집락 추출을 할 때에 예비표본으로 부터 얻은 정보를 활용하여 추가표본을 추출한다. 특히, 예비표본의 크기(예비표본의 집락의수) $n_1$ 과 추가표본의 크기$n_2$를 모두 변수로 간주하여 베이즈 위험을 최소로 하는 $n_1$$n_2$의 크기를 결정한다.

  • PDF

효과적인 적응집락추출계획

  • Kim, Yeon-U;Son, Chang-Gyun;Park, Jeong-Su
    • Proceedings of the Korean Statistical Society Conference
    • /
    • 2005.05a
    • /
    • pp.245-250
    • /
    • 2005
  • 보통 생태학 분야 등에 적용될 수 있는 적응집락추출계획(adaptive cluster sampling plan)을 수정하여, 표본의 크기 면에서 더 효율적인 Jumped 및 일반화 적응집락추출계획을 제안하였다. 이러한 계획 하에서 Hansen-Hurwitz(HH)와 Horvitz-Thompson (HT) 추정량으로 모수를 추정하였다. 제안한 새로운 계획들을 시뮬레이션을 통하여 기존의 계획과 비교하였다.

  • PDF

A clustering method using the Coulomb Energy Network (쿨롱네트워크를 이용한 집락분석)

  • 이석훈;박래현;김응환
    • The Korean Journal of Applied Statistics
    • /
    • v.8 no.1
    • /
    • pp.39-50
    • /
    • 1995
  • This article deals with the problem that all the statistical clustering methods do not supply the clustering rule after the analysis. We modify the Coulomb Energy Network model basically developed in physics and suggest one model appropriate for our purpose and show the implementation using an actual data. Finally the method suggested is compared with one of the well known methods, K-means algorithm using Rand C.

  • PDF

Colony Size Distributions according to in vitro Aging in Human Skin Fibroblasts (피부 섬유모세포 노화에 따른 세포집락 크기의 분포)

  • Kim, Jun-Sang;Kim, Jae-Sung;Cho, Moon-June;Park, Jeong-Kyu;Park, Tae-Hyun
    • Radiation Oncology Journal
    • /
    • v.17 no.2
    • /
    • pp.158-165
    • /
    • 1999
  • Purpose : To investigate the percentage of colonies wi1h16or more cells distribution of human skin fibroblast according to in vitro aging, and to evaluate the relationship between percentage of colonies with 10 or more cells and in vivo donor age in human skin fibroblast culture. Material and Method : C1, C2, C3a, and C3b human skin fibroblast samples from three breast cancer patients were used as subjects. The C1, C2, and C3a donor were 44, 54, and 55 years old, respectively. C3a and C3b cells were isolated from the same person. Single cell suspension of skin fibroblasts was prepared with primary explant technique. One hundred cells are plated into 100m1 tissue culture flask and cultured for two weeks. The colony size was defined as colonies with 16 or more cells. The cultured cell was stained with crystal violet, and number of cells in each colony was determined with stereo microscope at $\times$10 magnification. Passage number of C1, C2, C3a and C3b skin fibroblast were 12th, 17th, and 14th, respectively. Results : Percentage of colonies with 16 or more cells of skin fibroblast samples decreased with increasing in vitro passage number. In contrast, cumulative population doublings of skin fibroblast sample increased with increasing in vitro passage number. Percentage of colonies with 16 or more cells also decreased with increasing population doublings in human skin fibroblast culture. There was strong correlation with percentage of colonies with 16 or more cells and population doublings En C3a skin fibroblast sampie. At the same point of population doublings, the percentage of colonies with 16 or more cells of the young C1 donor was higher level than the old C3a donor. Conclusion : The population doublings increased with increasing in vitro passage number but percentage of colonies with 16 or more cells decreased. The results of this study imply that percentage of colonies with 16 or more cell is useful as a indicator of in vitro human skin fibroblast aging and may estimate the in vivo donor age.

  • PDF

A study on the relation between dissimilarity and hierarchical agglomerative in clust analysis (집락분석법에 있어서 비유사도와 계층적 응집법의 관계에 관한 연구)

  • 조완현
    • The Korean Journal of Applied Statistics
    • /
    • v.5 no.2
    • /
    • pp.211-227
    • /
    • 1992
  • In this paper we consider the definition and mathematical properties of similarity or dissimilarity which have often used in clust analysis, and we apply a hierarchical agglomerative cluster algorithm to a dissimilarity metrx generated by these distance. Here we investigate the effect of relation between distance function and cluster algorithm on the retrieval ability of natural clusters. We present an empirical results for qualitative data as well as quantitative data.

  • PDF

A Comparative Study on the Statistical Methodology to Determine the Optimal Aggregation Interval for Travel Time Estimation of the Interrupted Traffic Flow (단속류 통행시간 추정을 위한 적정 집락간격 결정에 관한 통계적 방법론 비교 연구)

  • Lim, Houng-Seok;Lee, Seung-Hwan;Lee, Hyun-Jae
    • Journal of Korean Society of Transportation
    • /
    • v.23 no.3 s.81
    • /
    • pp.109-123
    • /
    • 2005
  • The goals of this paper are two folds: i) to evaluate whether the data collected by a license plate matching AVI equipment being operated on some segment of a national highway are suitable or not for use in travel time estimation of interrupted traffic flows; ii) to study the statistical methodologies to be used for the determination of the optimal aggregation interval for travel time estimation. In this study it was found that the AVI data are not representative because the data are collected on some selected lanes of a roadway where main traffic is thru-traffic and, thus the AVI data are different from those collected from all lanes in traffic characteristics. For the determination of the optimal aggregation interval for travel time estimation. two statistical methods. namely point estimation and interval estimation. were tested. The test shows that the point estimation method is more sensitive and gives more desirable results in determing the optimal aggregation interval than the interval estimation method. And it turned out that the optimal aggregation interval on interrupted traffic flows has been calculated as 5 minute and thus the existing aggregation interval. 5 minute is proper.

A Study on the Optimal Aggregation Interval for Travel Time Estimation on the Rural Arterial Interrupted Traffic flow (지방부 간선도로 단속류 통행시간 추정을 위한 적정 집락간격 결정에 관한 연구)

  • Lim Houng-Seak;Lee Seung-Hwan;Lee Hyun-Jae
    • The Journal of The Korea Institute of Intelligent Transport Systems
    • /
    • v.3 no.2 s.5
    • /
    • pp.129-140
    • /
    • 2004
  • In this paper, we conduct the research about optimal aggregation interval of travel time data on interrupted traffic flow and verify the reliability of AVI collected data by using car plate matching method in RTMS for systematic collection and analysis of link travel time data on interrupted traffic flow rural arterial. We perform Kolmosorov-Smirnov test on AVT collected sample data and on entire population data, and conclude that the sample data does not represent pure random sampling and hence includes sample collection error. We suggest that additional review is necessary to investigate the effectiveness of AVI collected sample data as link representative data. We also develop statistical model by applying two estimation techniques namely point estimation and interval estimation for calculating optimal aggregation interval. We have implemented our model and determine that point estimate is preferable over interval estimate for exactly selecting and deciding optimal aggregation interval. Our final conclusion is that 5-minute aggregation interval is optimal to estimate travel time in RTMS, as is currently being used our investigation is based on AVI data collected from Yang-ji to Yong-in $42^{nd}$ National road.

  • PDF