Browse > Article

Adaptive Buffer Control over Disordered Streams  

Kim, Hyeon-Gyu (한국과학기술원 전산학과)
Kim, Cheol-Gi (한국정보통신대학교 전산학과)
Lee, Chung-Ho (한국전자통신연구원 텔레메틱스.USN연구단)
Kim, Myoung-Ho (한국과학기술원 전산학과)
Abstract
Disordered streams may cause inaccurate or delayed results in window-based queries. Existing approaches usually leverage buffers to hand]e the streams. However, most of the approaches estimate the buffer size simply based on the maximum network delay in the streams, which tends to over-estimate the buffer size and result in high latency. In this paper, we propose a probabilistic approach to estimate the buffer size adaptively according to the fluctuated network delays. We first assume that intervals of tuple generations follow an exponential distribution and network delays have a normal distribution. Then, we derive an estimation function from the assumptions. The function takes a drop ratio as an input parameter, which denotes a percentage of tuple drops permissible during query execution. By describing the drop ratio in a query specification, users can control the quality of query results such as accuracy or latency according to application requirements. Our experimental results show that the proposed function has better adaptivity than the existing function based on the maximum network delay.
Keywords
Data Streams; Disorder; Drop Ratios; Adaptivity;
Citations & Related Records
연도 인용수 순위
  • Reference
1 Samuel R. Madden, Mehul A. Shah, Joseph M. Hellerstein and Vijayshankar Raman, Continuously Adaptive Continuous Queries over Streams. ACM SIGMOD Conference, Madison, WI, June 2002
2 Rajeev Motwani et al, Query Proessing, Resource Management, and Approximation in a Data Stream Management System. CIDR 2003, Jan. 2003
3 Arvind Arasu et al, STREAM: The Stanford Data Stream Management System. IEEE Data Engineering Bulletin, Vol. 26 No. 1, March 2003
4 D. Abadi, D. Carney, U. Cetintemel, M. Cherniack, C. Convey, S. Lee, M. Stonebraker, N. Tatbul, S. Zdonik. Aurora: A New Model and Architecture for Data Stream Management. VLDB Journal (12)2: 120-139, August 2003
5 D. Abadi at al, The Design of the Borealis Stream Processing Engine. CIDR 2005, Asilomar, CA, January 2005
6 U. Srivastava and J. Widom. Flexible Time Management in Data Stream Systems. ACM PODS 2004, June 2004
7 S. Babu, U. Srivastava and J. Widom, Exploiting k-Constraints to Reduce Memory Overhead in Continuous Queries over Data Streams. ACM TODS, Sep. 2004
8 NS2 Sensor Network Extension: http://pf.itd.nrl.navy.mil/nrlsensorsim
9 S. Babu and J. Widom, Continuous Queries over Data Streams. ACM SIGMOD Record, Sep. 2001
10 B. Babcock, S. Babu, M. Datar, R. Motwani, and J. Widom, Models and Issues in Data Stream Systems. Invited paper in Proc. of the 2002 ACM Symp. on Principles of Database Systems (PODS 2002), June 2002
11 Dimitry P. Bertsekas and John N. Tsitsiklis, Introduction to Probability: International Edition, Athena Scientific, Belmont, Massachusetts, 2002
12 Chuck Cranor, Theodore Johnson, Oliver Spataschek and Vladislav Shkapenyuk, Gigascope: A Stream Database for Network Applications. ACM SIGMOD, June 9-12 2003
13 Douglas Terry, David Goldberg, David Nichols, and Brian Oki, Continuous Queries over Append-Only Databases. ACM SIGMOD, 1992
14 Jin Li, David Maier, Kristin Tufte, Vassilis Papadimos, Peter A. Tucker, No Pane, No Gain: Efficient Evaluation of Sliding Window Aggregates over Data Streams. SIGMOD Record, Vol 34, No. 1, March 2005
15 A. Arasu, S. Babu and J. Widom, The CQL Continuous Query Language: Semantic Foundations and Query Execution. Stanford University Technical Report, Oct. 2003
16 Sirish Chandrasekaran et al, TelegraphCQ: Continuous Dataflow Processing for an Uncertain World. CIDR 2003
17 Jin Li, David Maier, Kristin Tufte, Vassilis Papadimos, Peter A. Tucker, Semantics and Evaluation Techniques for Window Aggregates in Data Streams. ACM SIGMOD 2005, June 14-16, 2005, Baltimore, Maryland, USA
18 Peter A. Tucker, David Maier, Time Sheard, Leonidas Fegaras, Exploiting Punctuation Semantics in Continuous Data Streams. IEEE Transactions on Knowledge and Data Engineering, May/June 2003
19 J. Chen, D. J. DeWitt, F. Tian, and Y. Wang. NiagaraCQ: A scalable continuous query system for internet databases. ACM SIGMOD pages 379-390, May 2000
20 TinyDB: http://www.tinyos.net
21 SENSIM: http://csc.lsu.edu/sensor_web/simulator.html
22 David Maier, Jin Li, Peter A. Tucker, Kristin Tufte and Vassilis Papadimos, Semantics of Data Streams and Operators. ICDT 2005, LNCS 3363, pp.37-52, 2005
23 Lukasz Golab, Shaveen Garg, and M.Tamer Ozsu, On Indexing Sliding Windows over Online Data Streams, EDBT 2004, LNCS 2992, pp.712-729, 2004