• Title/Summary/Keyword: 공간 빅데이터

Search Result 308, Processing Time 0.027 seconds

Web crawler Improvement and Dynamic process Design and Implementation for Effective Data Collection (효과적인 데이터 수집을 위한 웹 크롤러 개선 및 동적 프로세스 설계 및 구현)

  • Wang, Tae-su;Song, JaeBaek;Son, Dayeon;Kim, Minyoung;Choi, Donggyu;Jang, Jongwook
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.26 no.11
    • /
    • pp.1729-1740
    • /
    • 2022
  • Recently, a lot of data has been generated according to the diversity and utilization of information, and the importance of big data analysis to collect, store, process and predict data has increased, and the ability to collect only necessary information is required. More than half of the web space consists of text, and a lot of data is generated through the organic interaction of users. There is a crawling technique as a representative method for collecting text data, but many crawlers are being developed that do not consider web servers or administrators because they focus on methods that can obtain data. In this paper, we design and implement an improved dynamic web crawler that can efficiently fetch data by examining problems that may occur during the crawling process and precautions to be considered. The crawler, which improved the problems of the existing crawler, was designed as a multi-process, and the work time was reduced by 4 times on average.

Introduction to Development of Comprehensive Land Management Technology Using Satellite Image Information Bigdata (위성정보 빅데이터 활용 국토종합관리 기술개발사업 소개)

  • Taejung Kim
    • Korean Journal of Remote Sensing
    • /
    • v.39 no.5_4
    • /
    • pp.1069-1073
    • /
    • 2023
  • A research project titled as Development of Comprehensive Land Management Technology using Satellite Image Information, funded by the Ministry of Land and Transportation, is being conducted to improve the efficiency of land management and to boost satellite image utilization in the private sector. This editorial describes the introduction of the project and papers presented in this special edition.

Development and Analysis of the Interchange Centrality Evaluation Index Using Network Analysis (네트워크 분석을 이용한 거점평가지표 개발 및 특성분석)

  • KIM, Suhyun;PARK, Seungtae;WOO, Sunhee;LEE, Seungchul
    • Journal of Korean Society of Transportation
    • /
    • v.35 no.6
    • /
    • pp.525-544
    • /
    • 2017
  • With the advent of the big data era, the interest in the development of land using traffic data has increased significantly. However, the current research on traffic big data lingers around organizing or calibrating the data only. In this research, a novel method for discovering the hidden values within the traffic data through data mining is proposed. Considering the fact that traffic data and network structures have similarities, network analysis algorithms are used to find valuable information in the actual traffic volume data. The PageRank and HITS algorithms are then employed to find the centralities. While conventional methods present centralities based on uncomplicated traffic volume data, the proposed method provides more reasonable centrality locations through network analysis. Since the centrality locations that we have found carry detailed spatiotemporal characteristics, such information can be used as an objective basis for making policy decisions.

Implementation of Crime Pattern Analysis Algorithm using Big Data (빅 데이터를 이용한 범죄패턴 분석 알고리즘의 구현)

  • Cha, Gyeong Hyeon;Kim, Kyung Ho;Hwang, Yu Min;Lee, Dong Chang;Kim, Sang Ji;Kim, Jin Young
    • Journal of Satellite, Information and Communications
    • /
    • v.9 no.4
    • /
    • pp.57-62
    • /
    • 2014
  • In this paper, we proposed and implemented a crime pattern analysis algorithm using big data. The proposed algorithm uses crime-related big data collected and published in the supreme prosecutors' office. The algorithm analyzed crime patterns in Seoul city from 2011 to 2013 using the spatial statistics analysis like the standard deviational ellipse and spatial density analysis. Using crime frequency, We calculated the crime probability and danger factors of crime areas, time, date, and places. Through a result we analyzed spatial statistics. As the result of the proposed algorithm, we could grasp differences in crime patterns of Seoul city, and we calculated degree of risk through analysis of crime pattern and danger factor.

Development and Application of CCTV Priority Installation Index using Urban Spatial Big Data (도시공간빅데이터를 활용한 CCTV 우선설치지수 개발 및 시범적용)

  • Hye-Lim KIM;Tae-Heon MOON;Sun-Young HEO
    • Journal of the Korean Association of Geographic Information Studies
    • /
    • v.27 no.2
    • /
    • pp.19-33
    • /
    • 2024
  • CCTV for crime prevention is expanding; however, due to the absence of guidelines for determining installation locations, CCTV is being installed in locations unrelated to areas with frequent crime occurrences. In this study, we developed a CCTV Priority Installation Index and applied it in a case study area. The index consists of crime vulnerability and surveillance vulnerability indexes, calculated using machine learning algorithms to predict crime incident counts per grid and the proportion of unmonitored area per grid. We tested the index in a pilot area and found that utilizing the Viewshed function in CCTV visibility analysis resolved the problem of overestimating surveillance area. Furthermore, applying the index to determine CCTV installation locations effectively improved surveillance coverage. Therefore, the CCTV Priority Installation Index can be utilized as an effective decision-making tool for establishing smart and safe cities.

Identification of Visitation Density and Critical Management Area Regarding Marine Spatial Planning: Applying Social Big Data (해양공간계획 수립을 위한 방문밀집도 및 중점관리지역 규명: 소셜 빅데이터를 활용하여)

  • Kim, Yoonjung;Kim, Choongki;Kim, Gangsun
    • Journal of Environmental Impact Assessment
    • /
    • v.29 no.2
    • /
    • pp.122-131
    • /
    • 2020
  • Marine Spatial Planning is an emerging strategy that promoting sustainable development at coastal and marine areas based on the concept of ecosystem services. Regarding its methodology, usage rate of resources and its impact should be considered in the process of spatial planning. Particularly, considering the rapid increase of coastal tourism, visitation pattern is required to be identified across coastal areas. However, actions to quantify visitation pattern have been limited due to its required high cost and labor for conducting extensive field-study. In this regard, this study aimed to pose the usage of social big data in Marine Spatial Planning to identify spatial visitation density and critical management zone throughout coastal areas. We suggested the usage of GPS information from Flickr and Twitter, and evaluated the critical management zone by applying spatial statistics and density analysis. This study's results clearly showed the coastal areas having relatively high visitors in the southern sea of South Korea. Applied Flickr and Twitter information showed high correlation with field data, when proxy excluding over-estimation was applied and appropriate grid-scale was identified in assessment approach. Overall, this study offers insights to use social big data in Marine Spatial Planning for reflecting size and usage rate of coastal tourism, which can be used to designate conservation area and critical zones forintensive management to promote constant supply of cultural services.

Federated Learning-based Route Choice Modeling for Preserving Driver's Privacy in Transportation Big Data Application (교통 빅데이터 활용 시 개인 정보 보호를 위한 연합학습 기반의 경로 선택 모델링)

  • Jisup Shim
    • The Journal of The Korea Institute of Intelligent Transport Systems
    • /
    • v.22 no.6
    • /
    • pp.157-167
    • /
    • 2023
  • The use of big data for transportation often involves using data that includes personal information, such as the driver's driving routes and coordinates. This study explores the creation of a route choice prediction model using a large dataset from mobile navigation apps using federated learning. This privacy-focused method used distributed computing and individual device usage. This study established preprocessing and analysis methods for driver data that can be used in route choice modeling and compared the performance and characteristics of widely used learning methods with federated learning methods. The performance of the model through federated learning did not show significantly superior results compared to previous models, but there was no substantial difference in the prediction accuracy. In conclusion, federated learning-based prediction models can be utilized appropriately in areas sensitive to privacy without requiring relatively high predictive accuracy, such as a driver's preferred route choice.

Introduction to Soil-grondwater monitoring technology for CPS (Cyber Physical System) and DT (Digital Twin) connection (CPS 및 DT 연계를 위한 토양-지하수 관측기술 소개)

  • Byung-Woo Kim;Doo-Houng Choi
    • Proceedings of the Korea Water Resources Association Conference
    • /
    • 2023.05a
    • /
    • pp.14-14
    • /
    • 2023
  • 산업발전에 따른 인구증가, 기후위기에 따른 가뭄 및 물 부족심화, 그리고 수질오염 등은 2015년 제79차 UN총회의 물 안보측면에서 국제사회의 물 분야 위기관리를 위해 2030년을 지속가능한 발전 목표(Sustainable Development Goals)로 하였다. 또한, 현재 물 산업은 빠르게 성장하고 있으며, 2016년 세계경제포럼(World Economic Forum) 의장 클라우스 슈밥(Klaus Schwab)부터 주창된 제4차 산업혁명로 인해 현재 물 산업의 패러다임 또한 급속히 변화하고 있다. 이는 컴퓨터를 기반으로 하는 CPS(Cyber Physical System) 및 DT(Digital Twin) 연계 분석방식의 혁신을 일컫는다. 2002년경에 DT의 기본개념이 제시되었고, 2006년경에는 Embedded System에서의 DT와 같은 개념으로 CPS의 용어가 등장했다. DT는 현실세계에 존재하는 사물, 시스템, 환경 등을 S/W시스템의 가상공간에 동일하게 모사(Virtualization) 및 모의(Simulation)할 수 있도록 하고, 모의결과를 가상시스템으로 현실세계를 최적화 체계 구현 기술을 말한다. DT의 6가지 기능은 ① 실제 데이터(Live Data), ② 모사, ③ 분석정보(Analytics), ④ 모의, ⑤ 예측(Predictions), ⑥ 자동화(Automation) 이다. 또한, CPS는 대규모 센서 및 액추에이터(Actuator)를 가지는 물리적 요소와 이를 실시간으로 제어하는 컴퓨팅 요소가 결합된 복합시스템을 말한다. CPS는 물리세계에서 발생하는 변화를 감지할 수 있는 다양한 센서를 통해 환경인지 기능을 수행한다. 센서로부터 수집된 정보와 물리세계를 재현 및 투영하는 고도화된 시스템 모델들을 기반으로 사이버 물리공간을 인지·분석·예측할 수 있다. CPS의 6가지 구성요소는 ① 상호 운용성(Interoperability), ② 가상화(Virtualization), ③ 분산화(Decentralization), ④ 실시간(Real-time Capability), ⑤ 서비스 오리엔테이션(Service Orientation), ⑥ 모듈화(Modularity)이다. DT와 CPS는 본질적으로 같은 목적, 내용, 그리고 결과를 만들어내고자 하는 같은 종류의 기술이라고 할 수 있다. CPS 및 DT는 물리세계에서 발생하는 변화를 감지할 수 있으며, 토양-지하수 센서를 포함한 관측기술을 통해 환경인지 기능을 수행한다. 지하수 관측기술로부터 수집된 정보와 물리세계를 재현 및 투영하는 고도화된 시스템 모델들을 기반으로 사이버 물리공간 및 디지털 트윈 공간을 인지·분석·예측할 수 있다. CPS 및 DT의 기본 요소들을 실현시키는 것은 양질의 데이터를 모니터링할 수 있는 정확하고 정밀한 1차원 연직 프로파일링 관측기술이며, 이를 토대로 한 수자원 관련 빅데이터의 증가, 빅데이터의 저장과 분석을 가능하게 하는 플랫폼의 개발이다. 본 연구는 CPS 및 DT 기반 토양수분-지하수 관측기술을 이용한 지표수-지하수 연계, 지하수 순환 및 관리, 정수 운영 및 진단프로그램 개발을 위한 토양수분-지하수 관측장치를 지하수 플랫폼 동시성과 디지털 트윈 시뮬레이터 시스템 개발 방향으로 제시하고자 한다.

  • PDF

A Study on the Utilization of Flood Damage Map with Crowdsourcing Data (크라우드 소싱 데이터를 적용한 홍수 피해지도 활용방안 연구)

  • Lee, Jeongha;Hwang, SeokHwan
    • Proceedings of the Korea Water Resources Association Conference
    • /
    • 2022.05a
    • /
    • pp.310-310
    • /
    • 2022
  • 최근 통신의 발달로 인하여 웹(Web)상에는 다양한 데이터들이 실시간으로 생산되고 있으며 해당 내용은 다양한 산업에서 활용되고 있다. 특히 최근에는 재난과 관련 상황에서도 소셜 네트워크 서비스(SNS) 데이터가 활용되기도 하며 기존의 수치 계측 데이터가 아닌 하나의 센서 역할을 하는 개인의 비정형데이터의 업로드가 다양한 재난 모니터링 부분에 활용되고 있는 실정이다. 특히 홍수 등의 자연재해 발생 시 개개인의 업로드 한 웹 데이터에는 시간에 따른 인구의 유동성이나 간단한 위치 정보 등을 포함하여 실제 피해의 정도를 보다 빠르고 다양한 정보로 모니터링이 가능하다. 홍수 발생 시 일반적으로 활용하는 수문 데이터는 피해의 규모가 크게 예측되는 대하천 위주로 관측이 이루어지며 관측지역과 데이터의 양이 한정되어있어 비정형데이터를 함께 활용한 연구가 필요하다. 따라서 본 연구에서는 웹에 있는 비정형 데이터들을 추출해내는 웹 크롤러를 구성하고 해당 프로그램을 활용하여 추출한 데이터들에 대해 강우 사상과 공간적 패턴을 비교 분석하여 크라우드 소싱 데이터를 적용한 홍수 피해지도의 활용방안을 제시하고자 한다.

  • PDF

A Study for Space-based Energy Management System to Minimizing Power Consumption in the Big Data Environments (소비전력 최소화를 위한 빅데이터 환경에서의 공간기반 에너지 관리 시스템에 관한 연구)

  • Lee, Yong-Soo;Heo, Jun;Choi, Yong-Hoon
    • The Journal of the Institute of Internet, Broadcasting and Communication
    • /
    • v.13 no.6
    • /
    • pp.229-235
    • /
    • 2013
  • This paper proposed the method to reduce and manage the amount of using power by using the Self-Learning of inference engine that evolves through learning increasingly smart ways for each spaces with in the Space-Based Energy Management System (SEMS, Space-based Energy Management System) that is defined as smallest unit space with constant size and similar characteristics by using the collectible Big Data from the various information networks and the informations of various sensors from the existing Energy Management System(EMS), mostly including such as the Energy Management Systems for the Factory (FEMS, Factory Energy Management System), the Energy Management Systems for Buildings (BEMS, Building Energy Management System), and Energy Management Systems for Residential (HEMS, Home Energy Management System), that is monitoring and controlling the power of systems through various sensors and administrators by measuring the temperature and illumination.