• Title/Summary/Keyword: spatial scan statistics

Search Result 19, Processing Time 0.025 seconds

Categorical Data Analysis by Means of Echelon Analysis with Spatial Scan Statistics

  • Moon, Sung-Ho
    • Journal of the Korean Data and Information Science Society
    • /
    • v.15 no.1
    • /
    • pp.83-94
    • /
    • 2004
  • In this study we analyze categorical data by means of spatial statistics and echelon analysis. To do this, we first determine the hierarchical structure of a given contingency table by using echelon dendrogram then, we detect candidates of hotspots given as the top echelon in the dendrogram. Next, we evaluate spatial scan statistics for the zones of significantly high or low rates based on the likelihood ratio. Finally, we detect hotspots of any size and shape based on spatial scan statistics.

  • PDF

Detection of Hotspots for Geospatial Lattice Data

  • Moon, Sung-Ho;Kim, Jong-Duk
    • Journal of the Korean Data and Information Science Society
    • /
    • v.17 no.1
    • /
    • pp.131-139
    • /
    • 2006
  • Statistical analyses for spatial data are important features for various types of fields. Spatial data are taken at specific locations or within specific regions and their relative positions are recorded. Lattice data are synoptic observation covering an entire spatial region, like cancer rates corresponding to each county in a state. The main purpose of this paper is to detect hotspots for the region with significantly high or low rates. Kulldorff(1997) detected hotspots based on circular spatial scan statistics. We propose a new method to find any shapes of hotspots by use of echelon analysis with spatial scan statistics.

  • PDF

Optimizing the maximum reported cluster size for normal-based spatial scan statistics

  • Yoo, Haerin;Jung, Inkyung
    • Communications for Statistical Applications and Methods
    • /
    • v.25 no.4
    • /
    • pp.373-383
    • /
    • 2018
  • The spatial scan statistic is a widely used method to detect spatial clusters. The method imposes a large number of scanning windows with pre-defined shapes and varying sizes on the entire study region. The likelihood ratio test statistic comparing inside versus outside each window is then calculated and the window with the maximum value of test statistic becomes the most likely cluster. The results of cluster detection respond sensitively to the shape and the maximum size of scanning windows. The shape of scanning window has been extensively studied; however, there has been relatively little attention on the maximum scanning window size (MSWS) or maximum reported cluster size (MRCS). The Gini coefficient has recently been proposed by Han et al. (International Journal of Health Geographics, 15, 27, 2016) as a powerful tool to determine the optimal value of MRCS for the Poisson-based spatial scan statistic. In this paper, we apply the Gini coefficient to normal-based spatial scan statistics. Through a simulation study, we evaluate the performance of the proposed method. We illustrate the method using a real data example of female colorectal cancer incidence rates in South Korea for the year 2009.

Categorical Data Analysis by Using Spatial Scan Statistics and Echelon Analysis

  • Mun, Seung-Ho;Sin, Jae-Gyeong
    • 한국데이터정보과학회:학술대회논문집
    • /
    • 2004.04a
    • /
    • pp.183-194
    • /
    • 2004
  • 본 연구에서는 공간 검색 통계량(spatial scan statistics)과 에셜론 해석법을 이용한 범주형 자료분석을 다룬다. 이를 위해 우선, 에셜론 덴드로그램을 이용하여 주어진 분활표의 계층적 구조(hierarchical structure)를 결정하고서 이로부터 핫스팟(hotspot)의 후보를 검출한다. 다음으로 우도비(likelihood ratio)를 기초로 유의하게 높거나 낮게 나타나는 지역에 대한 공간 검색 통계량을 산출한다. 마지막으로, 이 통계량을 바탕으로 핫스팟을 검출한다.

  • PDF

A Study on Spatial Statistical Perspective for Analyzing Spatial Phenomena in the Framework of GIS: an Empirical Example using Spatial Scan Statistic for Detecting Spatial Clusters of Breast Cancer Incidents (공간현상 분석을 위한 GIS 기반의 공간통계적 접근방법에 관한 고찰: 공간 군집지역 탐색을 위한 공간검색통계량의 실증적 사례분석)

  • Lee, Gyoung-Ju;Kweon, Ihl
    • Spatial Information Research
    • /
    • v.20 no.1
    • /
    • pp.81-90
    • /
    • 2012
  • When analyzing geographical phenomena, two properties need to be considered. One is the spatial dependence structure and the other is a variation or an uncertainty inhibited in a geographic space. Two problems are encountered due to the properties. Firstly, spatial dependence structure, which is conceptualized as spatial autocorrelation, generates heterogeneous geographic landscape in a spatial process. Secondly, generic statistics, although suitable for dealing with stochastic uncertainty, tacitly ignores location information im plicit in spatial data. GIS is a versatile tool for manipulating locational information, while spatial statistics are suitable for investigating spatial uncertainty. Therefore, integrating spatial statistics to GIS is considered as a plausible strategy for appropriately understanding geographic phenomena of interest. Geographic hot-spot analysis is a key tool for identifying abnormal locations in many domains (e.g., criminology, epidemiology, etc.) and is one of the most prominent applications by utilizing the integration strategy. The article aims at reviewing spatial statistical perspective for analyzing spatial processes in the framework of GIS by carrying out empirical analysis. Illustrated is the analysis procedure of using spatial scan statistic for detecting clusters in the framework of GIS. The empirical analysis targets for identifying spatial clusters of breast cancer incidents in Erie and Niagara counties, New York.

Detection of Hotspots on Multivariate Spatial Data

  • Moon, Sung-Ho
    • Journal of the Korean Data and Information Science Society
    • /
    • v.17 no.4
    • /
    • pp.1181-1190
    • /
    • 2006
  • Statistical analyses for spatial data are important features for various types of fields. Spatial data are taken at specific locations or within specific regions and their relative positions are recorded. Lattice data are synoptic observation covering an entire spatial region, like cancer rates corresponding to each county in a state. Until now, the echelon analysis has been applied only to univariate spatial data. As a result, it is impossible to detect the hotspots on the multivariate spatial data In this paper, we expand the spatial data to time series structure. And then we analyze them on the time space and detect the hotspots. Echelon dendrogram has been made by piling up each multivariate spatial data to bring time spatial data. We perform the structural analysis of temporal spatial data.

  • PDF

Delineating CBD and Subcentres and Detecting Specialized Areas in that Central Places of Seoul (서울의 도심 및 부심 설정과 특화 기능 탐색)

  • Seo, Mincheol
    • Journal of the Korean Geographical Society
    • /
    • v.49 no.2
    • /
    • pp.275-298
    • /
    • 2014
  • This paper aims to delineate Central Business District(CBD) and subcentres of Seoul and compare the functional differences of them by spatial scan statistics. Most existing studies to delimit Seoul CBD have two limits in the methods to make boundaries. First, most Seoul CBD-defining studies presuppose some central area contains CBD and look into just that area in a concentrating manner because it is too difficult to collect the data in a whole city boundary. Therefore the CBD areas have been localized in that study areas. But I analysed the data of the whole area of Seoul and was able to define the CBD and subcentres of Seoul. Second, I analysed the data by a spatial scan statistics technique and was able to minimize the number of subjective items in constructing some conditions for CBD. The CBD area in this study is enlarged eastward over East Gate, a national treasure in Seoul, than the areas in existing studies. In the contrary, westwardly, our CBD is set back a little. The two competing central places in Seoul, CBD and Gangnam have some different specialized subareas. CBD has more governing authorities and wholesale stores and Gangnam has many conglomerates HQs, Offices and cosmetic clinics.

  • PDF

Spatial clustering of pedestrian traffic accidents in Daegu (대구광역시 교통약자 보행자 교통사고 공간 군집 분석)

  • Hwang, Yeongeun;Park, Seonghee;Choi, Hwabeen;Yoon, Sanghoo
    • Journal of Digital Convergence
    • /
    • v.20 no.3
    • /
    • pp.75-83
    • /
    • 2022
  • Korea, which has the highest pedestrian fatality rate among OECD countries, is making efforts to improve the safe walking environment by enacting laws focusing on pedestrian. Spatial clustering was conducted with scan statistics after examining the social network data related to traffic accidents for children and seniors. The word cloud was used to examine people's recognition Campaigns for children and literature survey for seniors were in main concern. Naedang and Yongsan are the regions with the highest relative risk of weak pedestrian for children and seniors. On the contrary, Bongmu and Beomeo are the lowest relative risk region. Naedang-dong and Yongsan-dong of Daegu Metropolitan City were identified as vulnerable areas for pedestrian safety due to the high risk of pedestrian accidents for children and the elderly. This means that the scan statistics are effective in searching for traffic accident risk areas.

County Level Clustering on Alcohol and HIV Mortality

  • Park, Byeonghwa
    • Communications for Statistical Applications and Methods
    • /
    • v.20 no.1
    • /
    • pp.53-62
    • /
    • 2013
  • This study focuses on spatial/temporal relationship deaths caused by Human Immunodeficiency Virus (HIV) and Alcohol Use Disorder (AUD). Several studies have found links between these two diseases. By looking for clusters in mortality of Alcohol and HIV related deaths this study contributes to the field through the identification of exact spatial/temporal time of high and low occurrence risks based on the observed over the expected number of deaths. This study does not provide political or social interpretations of the data. It merely wants to show where clusters are found.

Spatial Cluster Analysis for Earthquake on the Korean Peninsula

  • Kang, Chang-Wan;Moon, Sung-Ho;Cho, Jang-Sik;Lee, Jeong-Hyeong;Choi, Seung-Bae;Beum, Soo-Gyun
    • Journal of the Korean Data and Information Science Society
    • /
    • v.17 no.4
    • /
    • pp.1141-1150
    • /
    • 2006
  • In this study, we performed spatial cluster analysis which considered spatial information using earthquake data for Korean peninsula occurred on 1978 year to 2005 year. Also, we look into how to be clustered for regions using earthquake magnitude and frequency based on spatial scan statistic. And, on the basis of the results, we constructed earthquake map by earthquake outbreak risk and gave a possible explanation for the results of spatial cluster analysis.

  • PDF