• Title/Summary/Keyword: spatial data mining

Search Result 168, Processing Time 0.024 seconds

A Study on the Development of Model for Estimating the Thickness of Clay Layer of Soft Ground in the Nakdong River Estuary (낙동강 조간대 연약지반의 지역별 점성토층 두께 추정 모델 개발에 관한 연구)

  • Seongin, Ahn;Dong-Woo, Ryu
    • Tunnel and Underground Space
    • /
    • v.32 no.6
    • /
    • pp.586-597
    • /
    • 2022
  • In this study, a model was developed for the estimating the locational thickness information of the upper clay layer to be used for the consolidation vulnerability evaluation in the Nakdong river estuary. To estimate ground layer thickness information, we developed four spatial estimation models using machine learning algorithms, which are RF (Random Forest), SVR (Support Vector Regression) and GPR (Gaussian Process Regression), and geostatistical technique such as Ordinary Kriging. Among the 4,712 borehole data in the study area collected for model development, 2,948 borehole data with an upper clay layer were used, and Pearson correlation coefficient and mean squared error were used to quantitatively evaluate the performance of the developed models. In addition, for qualitative evaluation, each model was used throughout the study area to estimate the information of the upper clay layer, and the thickness distribution characteristics of it were compared with each other.

Financial Data Mining Using Time delay Neural Networks

  • Kim, Hyun-Jung;Shin, Kyung-Shik
    • Proceedings of the Korea Inteligent Information System Society Conference
    • /
    • 2001.01a
    • /
    • pp.122-127
    • /
    • 2001
  • This study investigates the effectiveness of time delay neural networks(TDNN) for the time dependent prediction domain. Although it is well-known fact that the back-propagation neural network(BPN) performs well in pattern recognition tasks, the method has some limitations in that it can only learn an input mapping of static (or spatial) patterns that are independent of time of sequences. The preliminary results show that the accuracy of TDNN is higher than the standard BPN with time lag. Our proposed approaches are demonstrated by the stork market prediction domain.

  • PDF

A Spatial Data Mining Method by Clustering Analysis (클러스터링 분석에 의한 공간데이터마이닝 방법)

  • 손은정;강인수;김태완;이기준
    • Proceedings of the Korean Information Science Society Conference
    • /
    • 1998.10b
    • /
    • pp.161-163
    • /
    • 1998
  • 지리정보시스템과 같이 방대한 양의 공간데이터를 다루는 응용시스템에서 공간데이터베이스로부터 규칙적인 특성이나, 혹은 관심 있는 지식을 추출해내는 공간데이터마이닝의 역할은 매우 중요하다. 이를 위해 지금까지 이루어진 방법들에는 여러 가지가 있지만 그 중에서 대표적인 방법이 클러스터링으로 이는 단지 기하학적인 거리에 기반을 둔 공간적인 집중성과 분포도를 찾는 데에만 한정되어 있다. 그러나, 공간데이터마이닝을 위해서는 공간클러스터가 형성된 원인을 분석하는 것 또한 필요하다. 따라서 본 연구에서는 공간 클러스터링에서 얻어진 결과를 다른 공간적인 객체와의 연관성을 분석하여 공간적 집중성과 분포도를 유발하는 원인을 찾는 방법을 다룬다. 우선 몇 가지의 거리를 정의하는 것에 의해 클러스터와 공간객체사이의 연관성을 분석하는 방법을 제시하고, 생성된 공간 클러스터가 다수의 공간객체에 영향을 받을 경우, 그 공간 클러스터를 각각 단위클러스터로 분리하는 방법을 제시한다.

Query Optimization Infrastructure in Spatial Data Mining (공간 데이터 마이닝에서의 질의 처리 최적화 전략)

  • 김충석;이현창;김경창
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.26 no.7A
    • /
    • pp.1200-1211
    • /
    • 2001
  • 최근 각광을 받고 있는 데이터 마이닝 분야에서 데이터 마이닝 툴과 시스템의 등장으로 상호적이고 사용하기 쉬운 GUI 환경의 강력한 데이터 마이닝 질의 언어가 필요하게 되었다. 공간 데이터 마이닝은 공간 데이터에서 유용한 지식을 발견하기 위한 데이터 마이닝의 한 부문이며 공간 데이터는 점, 선, 사각형, 다각형 등으로 이루어져 있다. 공간 데이터 마이닝은 지리정보시스템(GIS)과 더불어 최근에 많은 관심과 연구가 활발히 진행되고 있다. 한편, 공간 데이터 마이닝을 위한 질의 언어와 그 언어에 기반한 공간 데이터 마이닝 질의 처리 및 최적화에 대한 연구가 중요하게 대두되고 있다. 공간 데이터에 대한 마이닝은 일반 관계형 데이터베이스에서의 질의 언어로는 표현이 불가능하다. 본 연구에서는 먼저 공간 데이터 마이닝 질의언어를 정의, 설계하고 질의 언어에 결과 표현 방식과 결과 데이터 집합의 저장을 명시하여 질의 표현의 효율을 높이는 방식을 제시하였다. 또한 공간 데이터 마이닝을 위한 질의 처리 및 최적화 과정을 질의에 기반한 공간 실체화 뷰의 생성과 유지, 인덱스 활용을 통한 질의 재사용, sampling 마이닝 질의 option 등의 방법론을 이용하여 제시하였다.

  • PDF

Spatial Data Mining Query Language for SIMS (SIMS를 위한 공간 데이터 마이닝 질의 언어)

  • Park, Sun;Park, Sang-Ho;Ahn, Chan-Min;Lee, Youn-Seok;Lee, Ju-Hong
    • Proceedings of the Korean Information Science Society Conference
    • /
    • 2004.04b
    • /
    • pp.70-72
    • /
    • 2004
  • SIMS는 공간 정보 관리 환경을 지원하기 위한 통합 관리 시스템으로서 다양한 공간 및 비공간 자료를 관리하고 여러 응용작업을 지원한다. 본 논문에서는 기존의 공간 데이터 마이닝 질의 언어가 처리하는 공간자료에 한정되지 않고, 자동 데이터 수집, 인공위성 측위 서비스, 원격탐사, GPS, 모바일 컴퓨팅 등의 다양한 자료라 시공간(Spatio-Temporal) 자료로부터 유용한 정보를 발견 할 수 있도록 SIMS를 기반으로 한 공간 데이터 마이닝 전용 시스템을 지원하는 공간 데이터 마이닝 질의 언어를 설계하였다.

  • PDF

The Analysis of Geospatial Efficiency of Goheung-Gun Aquaculture Type Ochon-Gye Using Bootstrap-DEA (고흥군 양식어업형 어촌계의 입지에 따른 어업효율성 분석에 관한 연구)

  • Kim, Jong-Cheon;Lee, Chang-Soo
    • The Journal of Fisheries Business Administration
    • /
    • v.52 no.1
    • /
    • pp.23-46
    • /
    • 2021
  • The purpose of this study is to understand the production efficiency of individual fishing communities and provide directions for improvement. The subject of the study is aquaculture type Ochon-Gye in Goheung-gun. The analysis method used bootstrap-DEA to overcome the statistical reliability problem of the traditional DEA analysis technique. In addition, data mining-GIS was applied to identify the spatial productivity of fishing communities. The values of technology efficiency, pure technology efficiency, and scale efficiency were estimated for 32 aquaculture-type fishing villages. Then, using the benchmarking reference set and weights, the projection was presented through adjustment of the input factor excess, and furthermore, the confidence interval of the efficiency values considering statistical significance was estimated using bootstrap.

Evaluation of Classifiers Performance for Areal Features Matching (면 객체 매칭을 위한 판별모델의 성능 평가)

  • Kim, Jiyoung;Kim, Jung Ok;Yu, Kiyun;Huh, Yong
    • Journal of the Korean Society of Surveying, Geodesy, Photogrammetry and Cartography
    • /
    • v.31 no.1
    • /
    • pp.49-55
    • /
    • 2013
  • In this paper, we proposed a good classifier to match different spatial data sets by applying evaluation of classifiers performance in data mining and biometrics. For this, we calculated distances between a pair of candidate features for matching criteria, and normalized the distances by Min-Max method and Tanh (TH) method. We defined classifiers that shape similarity is derived from fusion of these similarities by CRiteria Importance Through Intercriteria correlation (CRITIC) method, Matcher Weighting method and Simple Sum (SS) method. As results of evaluation of classifiers performance by Precision-Recall (PR) curve and area under the PR curve (AUC-PR), we confirmed that value of AUC-PR in a classifier of TH normalization and SS method is 0.893 and the value is the highest. Therefore, to match different spatial data sets, we thought that it is appropriate to a classifier that distances of matching criteria are normalized by TH method and shape similarity is calculated by SS method.

An Efficient Algorithm for Spatio-Temporal Moving Pattern Extraction (시공간 이동 패턴 추출을 위한 효율적인 알고리즘)

  • Park, Ji-Woong;Kim, Dong-Oh;Hong, Dong-Suk;Han, Ki-Joon
    • Journal of Korea Spatial Information System Society
    • /
    • v.8 no.2 s.17
    • /
    • pp.39-52
    • /
    • 2006
  • With the recent the use of spatio-temporal data mining which can extract various knowledge such as movement patterns of moving objects in history data of moving object gets increasing. However, the existing movement pattern extraction methods create lots of candidate movement patterns when the minimum support is low. Therefore, in this paper, we suggest the STMPE(Spatio-Temporal Movement Pattern Extraction) algorithm in order to efficiently extract movement patterns of moving objects from the large capacity of spatio-temporal data. The STMPE algorithm generalizes spatio-temporal and minimizes the use of memory. Because it produces and keeps short-term movement patterns, the frequency of database scan can be minimized. The STMPE algorithm shows more excellent performance than other movement pattern extraction algorithms with time information when the minimum support decreases, the number of moving objects increases, and the number of time division increases.

  • PDF

Citizen Sentiment Analysis of the Social Disaster by Using Opinion Mining (오피니언 마이닝 기법을 이용한 사회적 재난의 시민 감성도 분석)

  • Seo, Min Song;Yoo, Hwan Hee
    • Journal of Korean Society for Geospatial Information Science
    • /
    • v.25 no.1
    • /
    • pp.37-46
    • /
    • 2017
  • Recently, disaster caused by social factors is frequently occurring in Korea. Prediction about what crisis could happen is difficult, raising the citizen's concern. In this study, we developed a program to acquire tweet data by applying Python language based Tweepy plug-in, regarding social disasters such as 'Nonspecific motive crimes' and 'Oxy' products. These data were used to evaluate psychological trauma and anxiety of citizens through the text clustering analysis and the opinion mining analysis of the R Studio program after natural language processing. In the analysis of the 'Oxy' case, the accident of Sewol ferry, the continual sale of Oxy products of the Oxy had the highest similarity and 'Nonspecific motive crimes', the coping measures of the government against unexpected incidents such as the 'incident' of the screen door, the accident of Sewol ferry and 'Nonspecific motive crime' due to misogyny in Busan, had the highest similarity. In addition, the average index of the Citizens sentiment score in Nonspecific motive crimes was more negative than that in the Oxy case by 11.61%p. Therefore, it is expected that the findings will be utilized to predict the mental health of citizens to prevent future accidents.

A Gap Analysis Using Spatial Data and Social Media Big Data Analysis Results of Island Tourism Resources for Sustainable Resource Management (지속가능한 자원관리를 위한 섬 지역 관광자원의 공간정보와 소셜미디어 빅데이터 분석 결과를 활용한 격차분석)

  • Lee, Sung-Hee;Lee, Ju-Kyung;Son, Yong-Hoon;Kim, Young-Jin
    • Journal of Korean Society of Rural Planning
    • /
    • v.30 no.2
    • /
    • pp.13-24
    • /
    • 2024
  • This study conducts an analysis of social media big data pertaining to island tourism resources, aiming to discern the diverse forms and categories of island tourism favored by consumers, ascertain predominant resources, and facilitate objective decision-making grounded in scientific methodologies. To achieve this objective, an examination of blog posts published on Naver from 2022 to 2023 was undertaken, utilizing keywords such as 'Island tourism', 'Island travel', and 'Island backpacking' as focal points for analysis. Text mining techniques were applied to sift through the data. Among the resources identified, the port emerged as a significant asset, serving as a pivotal conduit linking the island and mainland and holding substantial importance as a focal point and resource for tourist access to the island. Furthermore, an analysis of the disparity between existing island tourism resources and those acknowledged by tourists who actively engage with and appreciate island destinations led to the identification of 186 newly emerging resources. These nascent resources predominantly clustered within five regions: Incheon Metropolitan City, Tongyeong/Geoje City, Jeju Island, Ulleung-gun, and Shinan-gun. A scrutiny of these resources, categorized according to the tourism resource classification system, revealed a notable presence of new resources, chiefly in the domains of 'rural landscape', 'tourist resort/training facility', 'transportation facility', and 'natural resource'. Notably, many of these emerging resources were previously overlooked in official management targets or resource inventories pertaining to existing island tourism resources. Noteworthy examples include ports, beaches, and mountains, which, despite constituting a substantial proportion of the newly identified tourist resources, were not accorded prominence in spatial information datasets. This study holds significance in its ability to unearth novel tourism resources recognized by island tourism consumers through a gap analysis approach that juxtaposes the existing status of island tourism resource data with techniques utilizing social media big data. Furthermore, the methodology delineated in this research offers a valuable framework for domestic local governments to gauge local tourism demand and embark on initiatives for tourism development or regional revitalization.