Browse > Article
http://dx.doi.org/10.17661/jkiiect.2017.10.1.1

Spatial Big Data Query Processing System Supporting SQL-based Query Language in Hadoop  

Joo, In-Hak (Electronics and Telecommunications Research Institute)
Publication Information
The Journal of Korea Institute of Information, Electronics, and Communication Technology / v.10, no.1, 2017 , pp. 1-8 More about this Journal
Abstract
In this paper we present a spatial big data query processing system that can store spatial data in Hadoop and query the data with SQL-based query language. The system stores large-scale spatial data in HDFS-based storage system, and supports spatial queries expressed in SQL-based query language extended for spatial data processing. It supports standard spatial data types and functions defined in OGC simple feature model in the query language. This paper presents the development of core functions of the system including query language parsing, query validation, query planning, and connection with storage system. We compares the performance of the suggested system with an existing system, and our experiments show that the system shows about 58% performance improvement of query execution time over the existing system when executing region query for spatial data stored in Hadoop.
Keywords
Big data, Hadoop; Query language; Query processing; Spatial data; SQL;
Citations & Related Records
연도 인용수 순위
  • Reference
1 Ahmed Eldawy & Mohamed F. Mokbel, "A Demonstration of SpatialHadoop An Efficient MapReduce Framework for Spatial Data", Proceedings of the VLDB Endowment, 6(12), pp.1230-1233, 2013.   DOI
2 Ablimit Aji, Fusheng Wang, Hoang Vo, Rubao Lee, Qiaoling Liu, Xiaodong Zhang & Joel Salt, "Hadoop-GIS: A High Performance Spatial Data Warehousing System over MapReduce", Proceedings of the VLDB Endowment, 6(11), pp.1009-1020, 2013.   DOI
3 Ashish Thusoo, Joydeep Sen Sarma, Namit Jain, Zheng Shao, Prasad Chakka, Ning Zhang, Suresh Antony, Hao Liu, & Raghotham Murthy, "Hive - a Petabyte Scale Data Warehouse using Hadoop", 2010 IEEE 26th International Conference on Data Engineering(ICDE 2010), pp.996-1005, 2010.
4 Choi, H. S., Son, J. H., Yang, H. M., Ryu, H. S., Lim, B. N., Kim, S. H., & Chung. Y. D, "Tajo: A distributed data warehouse system on large clusters", Data Engineering (ICDE), 2013 IEEE 29th International Conference, pp.1320-1323, 2013.
5 Open Geospatial Consortium Inc, OGC 06-103r4 "OpenGIS(R) Implementation Standard for Geographic information - Simple feature access - Part 1: Common architecture", 2011.
6 Open Geospatial Consortium Inc, OGC 05-134 "OpenGIS(R) Implementation Specification for Geographic information - Simple feature access - Part 2: SQL option", 2005.
7 Choi, W. G., Kim, M. S., Jang, I. S., & Chang, Y. S., "The Comparative Research on 2D Web Mapping Open API for Designing Geo-spatial Open Platform", Journal of Korea Spatial Information Society, 22(5), pp.87-98, 2014.   DOI