Browse > Article
http://dx.doi.org/10.9708/jksci.2012.17.1.031

A Dual Processing Load Shedding to Improve The Accuracy of Aggregate Queries on Clustering Environment of GeoSensor Data Stream  

Ji, Min-Sub (Dept. of Computer & Information Engineering, Inha University)
Lee, Yeon (Dept. of Computer & Information Engineering, Inha University)
Kim, Gyeong-Bae (Dept. of Computer Education, Seowon University)
Bae, Hae-Young (Dept. of Computer & Information Engineering, Inha University)
Abstract
u-GIS DSMSs have been researched to deal with various sensor data from GeoSensors in ubiquitous environment. Also, they has been more important for high availability. The data from GeoSensors have some characteristics that increase explosively. This characteristic could lead memory overflow and data loss. To solve the problem, various load shedding methods have been researched. Traditional methods drop the overloaded tuples according to a particular criteria in a single server. Tuple deletion sensitive queries such as aggregation is hard to satisfy accuracy. In this paper a dual processing load shedding method is suggested to improve the accuracy of aggregation in clustering environment. In this method two nodes use replicated stream data for high availability. They process a stream in two nodes by using a characteristic they share stream data. Stream data are synchronized between them with a window as a unit. Then, processed results are merged. We gain improved query accuracy without data loss.
Keywords
GeoSensor Network; Data Stream; Cluster System; Load Sheddin;
Citations & Related Records
연도 인용수 순위
  • Reference
1 J. Chen, D.J. DeWitt, F. Tian and Y. Wang, "NiagaraCQ: a scalable continous query system for internet databases" Proc. of the ACM SIGMOD International Conference on Management of Data, pp. 379-390, 2000.
2 F. Reiss and J.M. Hellerstein, "Data triage: an adaptive architecture for load shedding in TelegraphCQ" Proc. of the International Conference on Data Engineering, pp. 155-156, 2005.
3 Gregory F. Pfister, In search of Clusters, Prentice Hall PTR, 1998.
4 M. Wiesmann, F. Pedone,A. Shiper, B. Kemme, G. Alonso, "Understanding Replication in Databases and Distributed Systems", Proc. of the 20th International onference on Distributed Computing Systems, 2000.
5 Yi-Cheng T., Song L., Sunil P., and Bin Y., "Load Shedding in Stream Databases: A Control-Based Approach," In VLDB Conference, 2006, pp. 787-798.
6 Nesume T., Ugur C., Stan Z., Mitch C., and Michael S., "Load Shedding in a Data Stream Manager," VLDB, 2003, pp. 309-320.
7 Brian B., Mayur D., and Rajeev M., "Load Shedding for Aggregation Queries over Data Stream," ICDE, 2004, pp. 350-361.
8 Ho K., Sung-Ha B., Dong-Wook L., Gyoung-Bae K., Hae-Young B., "Load Shedding applying range overlap ratio of spatial query over Data Stream," ASGIS2009, 2009, pp. 49-55.
9 Won-il Jung, Sung-sun Sin, Sung-ha Baek, Yeon Lee, Hae-young Bae, "GeoSensor Data Stream Processing System for u-GIS Computing" KSISS journal, 11-1, 2009, pp. 9-16.
10 C.H. Lee, K.W. An, M.S. Lee, J.W. Kim, "Trends of u-GIS Spatial Information Technology" Telectronics and Telecommunications Trends, ETRI, 2007.
11 D.J.Abadi, D. Carney, U. Cetintemel, M. Cherniack, C. Convey, S. Lee, M. Stonebraker, N. Tatbul and S. Zdonik, Aurora: A new model and architecture for data stream management" VLDB J. Vol 12 No. 2, pp. 120-139, 2003.   DOI   ScienceOn
12 A. Arasu and et. al., "STREAM: The Stanford Da-ta Stream Management System" http://dbpubs.standford.edu/pub/2004-20, 2004.