• Title/Summary/Keyword: 공간 빅 데이터

Search Result 306, Processing Time 0.026 seconds

Analyzing Influence Factors of Foodservice Sales by Rebuilding Spatial Data : Focusing on the Conversion of Aggregation Units of Heterogeneous Spatial Data (공간 데이터 재구축을 통한 음식업종 매출액 영향 요인 분석 : 이종 공간 데이터의 집계단위 변환을 중심으로)

  • Noh, Eunbin;Lee, Sang-Kyeong;Lee, Byoungkil
    • Journal of the Korean Society of Surveying, Geodesy, Photogrammetry and Cartography
    • /
    • v.35 no.6
    • /
    • pp.581-590
    • /
    • 2017
  • This study analyzes the effect of floating population, locational characteristics and spatial autocorrelation on foodservice sales using big data provided by the Seoul Institute. Although big data provided by public sector is growing recently, research difficulties are occurred due to the difference of aggregation units of data. In this study, the aggregation unit of a dependent variable, sales of foodservice is SKT unit but those of independent variables are various, which are provided as the aggregation unit of Korea National Statistical Office, administration dong unit and point. To overcome this problem, we convert all data to the SKT aggregation unit. The spatial error model, SEM is used for analysing spatial autocorrelation. Floating population, the number of nearby workers, and the area of aggregation unit effect positively on foodservice sales. In addition, the sales of Jung-gu, Yeongdeungpo-gu and Songpa-gu are less than that of Gangnam-gu. This study provides implications for further study by showing the usefulness and limitations of converting aggregation units of heterogeneous spatial data.

Spatial Computation on Spark Using GPGPU (GPGPU를 활용한 스파크 기반 공간 연산)

  • Son, Chanseung;Kim, Daehee;Park, Neungsoo
    • KIPS Transactions on Computer and Communication Systems
    • /
    • v.5 no.8
    • /
    • pp.181-188
    • /
    • 2016
  • Recently, as the amount of spatial information increases, an interest in the study of spatial information processing has been increased. Spatial database systems extended from the traditional relational database systems are difficult to handle large data sets because of the scalability. SpatialHadoop extended from Hadoop system has a low performance, because spatial computations in SpationHadoop require a lot of write operations of intermediate results to the disk, resulting in the performance degradation. In this paper, Spatial Computation Spark(SC-Spark) is proposed, which is an in-memory based distributed processing framework. SC-Spark is extended from Spark in order to efficiently perform the spatial operation for large-scale data. In addition, SC-Spark based on the GPGPU is developed to improve the performance of the SC-Spark. SC-Spark uses the advantage of the Spark holding intermediate results in the memory. And GPGPU-based SC-Spark can perform spatial operations in parallel using a plurality of processing elements of an GPU. To verify the proposed work, experiments on a single AMD system were performed using SC-Spark and GPGPU-based SC-Spark for Point-in-Polygon and spatial join operation. The experimental results showed that the performance of SC-Spark and GPGPU-based SC-Spark were up-to 8 times faster than SpatialHadoop.

A Simple Integer Sequence Code System Supporting Random Access (임의 접근을 지원하는 간단한 정수 배열 코드 시스템)

  • Lee, Junhee;Satti, Srinivasa Rao
    • KIISE Transactions on Computing Practices
    • /
    • v.23 no.10
    • /
    • pp.594-598
    • /
    • 2017
  • Tremendous quantities of numerical data are generated every day from various sources, including the stock market. Universal codes such as Elias gamma coding, Elias delta coding and Fibonacci coding are generally used to store arrays of integers. Studies have been conducted to support fast access to specific elements in an integer array, while occupying less space. We suggest an improved code system that utilizes the concepts of succinct data structures. This system is based on a data structure that allows compressing a delimiter bit array while supporting queries in constant time. The results of an experiment show that the encoded array uses lower space, while not sacrificing time efficiency.

A Study on Space Consumption Behavior of Contemporary Consumers -Focusing on Analysis of Social Media Big Data- (현대 소비자의 공간소비행동에 관한 연구 -소셜미디어 데이터 분석을 중심으로-)

  • Ahn, Suh Young;Koh, Ae-Ran
    • Journal of the Korean Society of Clothing and Textiles
    • /
    • v.44 no.5
    • /
    • pp.1019-1035
    • /
    • 2020
  • This study examines the millennial generation, who express themselves and share information on social media after experiencing constantly changing 'hot places' (places of interest) in contemporary cities, with the goal of analyzing space consumption behaviors. Data were collected via an Instagram crawler application developed with Python 3.4 administered to 19,262 posts using the term 'hot places' from November 1 and December 15, 2019. Issues were derived from a text mining technique using Textom 2.0; in addition, semantic network analysis using Ucinet6 and the NetDraw program were also conducted. The results are as follows. First, a frequency analysis of keywords for hot places indicated words frequently found in nouns were related to food, local names, SNS and timing. Words related to positive emotions felt in experience, and words related to behavior in hot places appeared in predicate. Based on importance, communication is the most important keyword and influenced all issues. Second, the results of visualization of semantic network analysis revealed four categories in the scope of the definition of "hot place": (1) culinary exploration, (2) atmosphere of cafés, (3) happy daily life of 'me' expressed in images, (4) emotional photos.

A Study on the Regionally Customized Urban Regeneration and Maintenance of Small and Medium Cities Using Spatial Big-Data - Focused on the Residential Census Output Area - (공간 빅데이터를 활용한 중소도시 지역맞춤형 도시재생·유지관리 연구 - 주거지역 집계구를 중심으로 -)

  • Han, Da-Hyuck;Lee, Min-Seok
    • Journal of the Korean Institute of Rural Architecture
    • /
    • v.23 no.2
    • /
    • pp.9-16
    • /
    • 2021
  • The purpose of this study is to maintain the existing characteristics of the city by utilizing the physical decline status and floating population in small and medium cities residential areas. In addition, it intends to present the direction of flexible urban regeneration and maintenance by reflecting regional characteristics and current status. A total of three data were used in this study. Building data, floating population data, and census output area data were used. Building data and floating population data were classified into five classes. The graded data were joined to the census output area data and analyzed by overlapping the two data. As a result of analysis of 17 residential areas in 5 small and medium cities in Jeollanam-do, 4 types, 2 management models, and 4 indicators could be presented by grade and regional characteristics. This study is meaningful in that it is possible to plan regionally customized urban regeneration/maintenance management plans and projects through the typology of the current status and characteristics of the region, which is an important step in the bottom-up form.

A Study on Map Mapping of Individual Vehicle Big Data Based on Space (공간 기반의 개별 차량 대용량 정보 맵핑에 관한 연구)

  • Chong, Kyusoo
    • The Journal of The Korea Institute of Intelligent Transport Systems
    • /
    • v.20 no.5
    • /
    • pp.75-82
    • /
    • 2021
  • The number of traffic accidents is about 230,000, and due to non-recurring congestion and high driving speed, the number of deaths per traffic accident on freeways is more than twice compared to other roads. Currently, traffic information is provided based on nodes and links using the centerline of the road, but it does not provide detailed speed information. Recently, installing sensors for vehicles to monitor obstacles and measure location is becoming common not only for autonomous vehicles but also for ordinary vehicles as well. The analysis using large-capacity location-based data from such sensors enables real time service according to processing speed. This study presents an mapping method for individual vehicle data analysis based on space. The processing speed of large-capacity data was increased by using method which applied a quaternary notation basis partition method that splits into two directions of longitude and latitude respectively. As the space partition was processed, the average speed was similar, but the speed standard deviation gradually decreased, and decrease range became smaller after 9th partition.

PPFP(Push and Pop Frequent Pattern Mining): A Novel Frequent Pattern Mining Method for Bigdata Frequent Pattern Mining (PPFP(Push and Pop Frequent Pattern Mining): 빅데이터 패턴 분석을 위한 새로운 빈발 패턴 마이닝 방법)

  • Lee, Jung-Hun;Min, Youn-A
    • KIPS Transactions on Software and Data Engineering
    • /
    • v.5 no.12
    • /
    • pp.623-634
    • /
    • 2016
  • Most of existing frequent pattern mining methods address time efficiency and greatly rely on the primary memory. However, in the era of big data, the size of real-world databases to mined is exponentially increasing, and hence the primary memory is not sufficient enough to mine for frequent patterns from large real-world data sets. To solve this problem, there are some researches for frequent pattern mining method based on disk, but the processing time compared to the memory based methods took very time consuming. There are some researches to improve scalability of frequent pattern mining, but their processes are very time consuming compare to the memory based methods. In this paper, we present PPFP as a novel disk-based approach for mining frequent itemset from big data; and hence we reduced the main memory size bottleneck. PPFP algorithm is based on FP-growth method which is one of the most popular and efficient frequent pattern mining approaches. The mining with PPFP consists of two setps. (1) Constructing an IFP-tree: After construct FP-tree, we assign index number for each node in FP-tree with novel index numbering method, and then insert the indexed FP-tree (IFP-tree) into disk as IFP-table. (2) Mining frequent patterns with PPFP: Mine frequent patterns by expending patterns using stack based PUSH-POP method (PPFP method). Through this new approach, by using a very small amount of memory for recursive and time consuming operation in mining process, we improved the scalability and time efficiency of the frequent pattern mining. And the reported test results demonstrate them.

Performance analysis and prediction through various over-provision on NAND flash memory based storage (낸드 플래시 메모리기반 저장 장치에서 다양한 초과 제공을 통한 성능 분석 및 예측)

  • Lee, Hyun-Seob
    • Journal of Digital Convergence
    • /
    • v.20 no.3
    • /
    • pp.343-348
    • /
    • 2022
  • Recently, With the recent rapid development of technology, the amount of data generated by various systems is increasing, and enterprise servers and data centers that have to handle large amounts of big data need to apply high-stability and high-performance storage devices even if costs increase. In such systems, SSD(solid state disk) that provide high performance of read/write are often used as storage devices. However, due to the characteristics of reading and writing on a page-by-page basis, erasing operations on a block basis, and erassing-before-writing, there is a problem that performance is degraded when duplicate writes occur. Therefore, in order to delay this performance degradation problem, over-provision technology of SSD has been applied internally. However, since over-provided technologies have the disadvantage of consuming a lot of storage space instead of performance, the application of inefficient technologies above the right performance has a problem of over-costing. In this paper, we proposed a method of measuring the performance and cost incurred when various over-provisions are applied in an SSD and predicting the system-optimized over-provided ratio based on this. Through this research, we expect to find a trade-off with costs to meet the performance requirements in systems that process big data.

A Case Study of Producing Infographics Using Tableau Public (Tableau Public을 이용한 인포그래픽 제작 사례연구)

  • Kim, Dong Hwan
    • Spatial Information Research
    • /
    • v.23 no.2
    • /
    • pp.21-29
    • /
    • 2015
  • Recently, according to the increasingly populated data, many media and organizations focus on big data, data visualization, information visualization and infographics. Domestically, Chosun.com and Hankyoreh online have improved on the data visualization field and internationally, the Guardian, Wall Street Journal, and New York Times are the leading companies on that area. Until now, many people have recognized infographics as a design-oriented product in Korea. However, one of significant data visualization programs, Tableau Public, can visualize data more efficiently. In this paper, Data Visualization Methods Quadrant for Policy Making is defined, and data analysis and producing infographics are executed. As used data, World Bank open source was adopted and using the number of passenger cars per 1,000 people, two analysis results are extracted. First, in high income group, the more GNI per capita, the lesser Slope is represented and in mid income group, the more GNI per capita positively affects to Slope. Second, in the global finance crisis, the car ownership rate was about 1.7 times than the usual state in the global economy. Through the case study, this paper suggests that the direction of producing infographics should be changed from design-oriented to data-oriented. Moreover, the data-oriented infographics should be propagated as means of scientific research and policy making.

IoT Environment and Security Countermeasures in 4th Industrial Revolution (4차 산업혁명 시대의 사물인터넷 현황 및 보안 대응책)

  • Hong, Sunghyuck
    • Journal of Digital Convergence
    • /
    • v.17 no.11
    • /
    • pp.195-200
    • /
    • 2019
  • The role of the Internet of Things in the Fourth Industrial Revolution is in the era of collecting data at the end and analyzing big data through technology to analyze the future or behavior. Therefore, due to the nature of the IoT, it is vulnerable to security and requires a lightweight security protocol. The spread of things Internet technology is changing our lives a lot. IT companies all over the world are already focusing on products and services based on things Internet, and they are going to the era of all things internet that can communicate not only with electronic devices but also with common objects. People, people, people and objects, things and things interact without limitation of time and space, collecting, analyzing and applying information. Life becomes more and more smart, but on the other hand, the possibility of leakage of personal information becomes greater. Therefore, this study proposed security threats that threaten the protection of personal information and countermeasures, and suggested countermeasures for building a secure IoT environment suitable for the Fourth Industrial Revolution.