Browse > Article
http://dx.doi.org/10.29220/CSAM.2018.25.4.373

Optimizing the maximum reported cluster size for normal-based spatial scan statistics  

Yoo, Haerin (Division of Biostatistics, Department of Biomedical Systems Informatics, Yonsei University College of Medicine)
Jung, Inkyung (Division of Biostatistics, Department of Biomedical Systems Informatics, Yonsei University College of Medicine)
Publication Information
Communications for Statistical Applications and Methods / v.25, no.4, 2018 , pp. 373-383 More about this Journal
Abstract
The spatial scan statistic is a widely used method to detect spatial clusters. The method imposes a large number of scanning windows with pre-defined shapes and varying sizes on the entire study region. The likelihood ratio test statistic comparing inside versus outside each window is then calculated and the window with the maximum value of test statistic becomes the most likely cluster. The results of cluster detection respond sensitively to the shape and the maximum size of scanning windows. The shape of scanning window has been extensively studied; however, there has been relatively little attention on the maximum scanning window size (MSWS) or maximum reported cluster size (MRCS). The Gini coefficient has recently been proposed by Han et al. (International Journal of Health Geographics, 15, 27, 2016) as a powerful tool to determine the optimal value of MRCS for the Poisson-based spatial scan statistic. In this paper, we apply the Gini coefficient to normal-based spatial scan statistics. Through a simulation study, we evaluate the performance of the proposed method. We illustrate the method using a real data example of female colorectal cancer incidence rates in South Korea for the year 2009.
Keywords
spatial cluster detection; Gini coefficient; maximum scanning window size; weighted normal model;
Citations & Related Records
연도 인용수 순위
  • Reference
1 Huang L, Tiwari RC, Zou Z, Kulldorff M, and Feuer EJ (2009). Weighted normal spatial scan statistic for heterogeneous population data, Journal of the American Statistical Association, 104, 886-898.   DOI
2 Jung I, Kulldorff M, and Klassen AC (2007). A spatial scan statistic for ordinal data, Statistics in Medicine, 26, 1594-1607.   DOI
3 Jung I, Kulldorff M, and Richard OJ (2010). A spatial scan statistic for multinomial data, Statistics in Medicine, 29, 1910-1918.   DOI
4 Kim S and Jung I (2017). Optimizing the maximum reported cluster size in the spatial scan statistic for ordinal data, PLoS ONE, 12, e0182234.   DOI
5 Kulldorff M (1997). A spatial scan statistic, Communications in Statistics - Theory and Methods, 26, 1481-1496.   DOI
6 Han J, Zhu L, Kulldorff M, Hostovich S, Stinchcomb DG, Tatalovich Z, Lewis DR, and Feuer EJ (2016). Using Gini coefficient to determining optimal cluster reporting sizes for spatial scan statistics, International Journal of Health Geographics, 15, 27.   DOI
7 Huang L, Kulldorff M, and Gregorio D (2007). A spatial scan statistic for survival data, Biometrics, 63, 109-118.   DOI
8 Kulldorff M and Information Management Services Inc (2018). SaTScan$^{TM}$ v9.5: Software for the spatial and space-time spatial scan statistics, from: http://www.satscan.org/
9 Kulldorff M, Huang L, and Konty K (2009). A scan statistic for continuous data based on the normal probability model, International Journal of Health Geographics, 8, 58.   DOI
10 Kulldorff M, Huang L, Pickle L, and Duczmal L (2006). An elliptic spatial scan statistic, Statistics in Medicine, 25, 3929-3943.   DOI
11 Lorenz MO (1905). Methods of measuring the concentration of wealth, Publications of the American Statistical Association, 9, 209-219.   DOI
12 Ribeiro SHR and Costa MA (2012). Optimal selection of the spatial scan parameters for cluster detection: a simulation study, Spatial and Spatio-Temporal Epidemiology, 3, 107-120.   DOI
13 Patil GP and Taillie C (2004). Upper level set scan statistic for detecting arbitrarily shaped hotspots, Environmental and Ecological Statistics, 11, 183-197.   DOI
14 Gastwirth JL (1972). The estimation of the Lorenz curve and Gini index, The Review of Economics and Statistics, 54, 306-316.   DOI
15 Tango T and Takahashi K (2005). A flexibly shaped spatial scan statistic for detecting clusters, International Journal of Health Geographics, 4, 11.   DOI
16 Duczmal L and Assuncao R (2004). A simulated annealing strategy for the detection of arbitrarily shaped spatial clusters, Computational Statistics & Data Analysis, 45, 269-286.   DOI
17 Dwass M (1957). Modified randomization tests for nonparametric hypotheses, The Annals of Mathematical Statistics, 20, 181-187.