Browse > Article
http://dx.doi.org/10.3745/KTSDE.2017.6.1.1

An Efficient Angular Space Partitioning Based Skyline Query Processing Using Sampling-Based Pruning  

Choi, Woosung (고려대학교 컴퓨터학과)
Kim, Minseok (고려대학교 컴퓨터학과)
Diana, Gromyko (고려대학교 컴퓨터학과)
Chung, Jaehwa (한국방송통신대학교 컴퓨터과학과)
Jung, Soonyong (고려대학교 컴퓨터학과)
Publication Information
KIPS Transactions on Software and Data Engineering / v.6, no.1, 2017 , pp. 1-8 More about this Journal
Abstract
Given a multi-dimensional dataset of tuples, a skyline query returns a subset of tuples which are not 'dominated' by any other tuples. Skyline query is very useful in Big data analysis since it filters out uninteresting items. Much interest was devoted to the MapReduce-based parallel processing of skyline queries in large-scale distributed environment. There are three requirements to improve parallelism in MapReduced-based algorithms: (1) workload should be well balanced (2) avoid redundant computations (3) Optimize network communication cost. In this paper, we introduce MR-SEAP (MapReduce sample Skyline object Equality Angular Partitioning), an efficient angular space partitioning based skyline query processing using sampling-based pruning, which satisfies requirements above. We conduct an extensive experiment to evaluate MR-SEAP.
Keywords
Skyline Computation; MapReduce; Pruning; Data Sampling;
Citations & Related Records
연도 인용수 순위
  • Reference
1 Borzsony, Stephan, Donald Kossmann, and Konrad Stocker, "The skyline operator," Data Engineering, 2001, Proceedings, 17th International Conference on. IEEE, 2001.
2 Zhang, Boliang, Shuigeng Zhou, and Jihong Guan, "Adapting skyline computation to the mapreduce framework: Algorithms and experiments," International Conference on Database Systems for Advanced Applications, Springer Berlin Heidelberg, 2011.
3 Park, Yoonjae, Jun-Ki Min, and Kyuseok Shim, "Parallel computation of skyline and reverse skyline queries using mapreduce," Proceedings of the VLDB Endowment, Vol.6, No.14, pp.2002-2013, 2013.   DOI
4 Jaehwa Chung, "Data Samping-based Angular Space Partitioning for Parallel Skyline Query Processing," The Korean Association Computer Education, Vol.18, No.5, pp.63-70, 2015.
5 J. S. Vitter, "Random sampling with a reservoir," ACM Transactions on Mathematical Software (TOMS), Vol.11, No.1, pp.37-57, 1985.   DOI
6 Woo-Sung Choi, Jong-Hyeon Min, Jaehwa Chung, and Soon-Young Jung, "A Sampling based Pruning Approach for Efficient Angular Space Partitioning based Skyline Query Processing," 2016 KIPS Spring Conference, Vol.23, No.1, pp.55-58, 2016.
7 Shang, Haichuan and Masaru Kitsuregawa, "Skyline operator on anti-correlated distributions," Proceedings of the VLDB Endowment, Vol.6, No.9, pp.649-660, 2013.   DOI