Web Prefetching Scheme for Efficient Internet Bandwidth Usage

효율적인 인터넷 대역폭 사용을 위한 웹 프리페칭 기법

  • 김숙향 (포항공과대학교 정보통신대학원 정보통신학과) ;
  • 홍원기 (포항공과대학교 컴퓨터공학과)
  • Published : 2000.09.01

Abstract

As the number of World Wide Web (Web) users grows, Web traffic continues to increase at an exponential rate. Currently, Web traffic is one of the major components of Internet traffic. Also, high bandwodth usage due to Web traffic is observed during peak periods while leaving bandwidth usage idle during off-peak periods. One of the solutions to reduce Web traffic and speed up Web access is through the use of Web caching. Unfortunately, Web caching has limitations for reducing network bandwidth usage during peak periods. In this paper, we focus our attention on the use of a prefetching algorithm for reducing bandwidth during peak periods by using off-peak period bandwidth. We propose a statistical, batch, proxy-side prefetching scheme that improves cache hit rate while only requiring a small amount of storage. Web objects that were accessed many times in previous 24 hours but would be expired in the next 24 hours, are selected and prefetched in our scheme. We present simulation results based on Web proxy and show that this prefetching algorithm can reduce peak time bandwidth using off-peak bandwidth.

World Wide Web(이하 웹)은 저렴한 가격과 다양하고 흥미 있는 정보를 쉽고 간편하게 찾아볼 수 있다는 장점으로 웹의 사용자는 하루가 다르게 증가되고 있으며 웹의 사용자의 증가와 함께 웹을 통해 전달되는 데이터 즉, 웹 문서, 그림, 멀티미디어 데이터 등의 크기 또한 빠르게 증가되고 있다. 웹 트래픽을 위한 네트워크 대역폭의 사용량을 살펴보면 사용자들의 요청이 많은 peak periods에는 대부분의 대역폭을 사용하고 있고, off-peak periods에는 사용하지 않는 대역폭이 존재한다. 지금까지 네트워크의 대역폭 소비량을 감소시키고, 검색 지연시간을 줄이기 위해 많은 연구가 이루어졌고 그 해결방안 중 하나가 웹 캐싱이다. 그러나, 웹 캐싱을 사용하더라도 peak periods 동안에는 네트워크 대역폭의 사용량을 감소시키기에는 한계가 있으며 off-peak periods에 여유 있는 네트워크 대역폭을 효율적으로 사용할 수 업사. 본 논문에서는 네트워크 대역폭을 균형 있게 사용하기 위해 캐싱 서버(SQUID)를 기반으로 하는 웹 프리페칭(Web prefetching)기법을 제안한다. 24시간 동안의 웹 사용 상황을 분석하여, 가장 많이 사용되면서 다음 24시간 내에 유효기간을 초과하는 웹 객체를 프리페칭하는 방법을 사용한다. 제안된 웹 프리페칭 기법은 peak periods 동안 요청되리라 예상되는 웹 객체를 off-peak periods 동안 남는 대역폭을 이용하여 미리 캐싱 서버에 가져다 놓은 방법으로써 많은 디스크 용량을 요구하지 않으면서도 캐시 객체 히트율을 높일 수 있다. 또한 사용자들의 웹 접근 패턴을 기반으로 하기 때문에 프리페칭된 웹 객체에 대하여 높은 객체 히트율을 보인다. 본 논문에서 소개한 기법의 유효성 시뮬레이션을 통해서 증명하였다. 시뮬레이션 결과는 본 논문에서 제안된 프리페칭 기법이 효율적으로 peak bandwidth를 줄일 수 있다는 것을 나타낸다.

Keywords

References

  1. Z. Wang and J. Crowcroft, Prefetching in World Wide Web, IEEE Globecom 96, http://www.cs.ucl.ac.uk/staff/zwang/papers/prefetch.ps.z
  2. V. Padmanabhan and J. Mogul, Using Predictive Prefetching to Improve World Wide Web Latency, Computer Communication Review, 26(3):22-36, July 1996 https://doi.org/10.1145/235160.235164
  3. Ken-ichi Chinen and Suguru Yanaguchi, An Interactive Prefetching Proxy Server for Improvement of WWW Latency, INET'97, 1997, http://www.isco.org/INET97/procceding/A1/A1_3.HTM
  4. Arthur Goldberg, Ilya Pevzner and Robert Buff, Caching Characteristic of Internet and Intranet Web proxy Traces, In Computer Measurement Group Conference (CMG'98), Anaheim, CA, December 1998, http://www.cs.nyu.edu/artg
  5. David Barnes and Neil G. Smith, An Analysis of World-Wide Web Proxy Cache Performance and its Application to the Modelling and Simulation of Network Traffic, In Proceedings of the Fourth International Conference on Telecommunication Systems Modeling and Analysis, March 1996, http://www.cs.ukc.ac.uk/people/staff/djb/pubs.html
  6. Themistoklis Palpanas and Alberto Mendelzon, Web Prefetching Using Partial Match Prediction, In Web Caching Workshop WCW'99, 1999, http://www.ircache.net/Cache/Workshop99/program.html
  7. Squid Internet Object Cache, available from http://squid.nlanr.net/Squid/
  8. Gihan V.Dias, Graham Cope and Ravi Wijayaratne, A Smart Internet Caching System, INET'96 Conference, 1996, httpV/www.isoc.org/isoc/ whatis/conferences/inet/96/proceedings/a4/a4_3.htm
  9. Katsuo Doi, WWW Access by Proactively Controlled Caching Proxy, Sharp Technical Journal, No. 66, December 1996
  10. Brad Duska, David Marwood, and Michael J.Feeley, The Measured Access Characteristics of World-Wide Web Client Proxy Caches, In Usenix Symposium on Internet Technologies and Systems (USITS), Monterey, CA, USA, December 8-11, 1997, Usenix, http://www.cs.ubc.ca/spider/marwood/Projects/SPA/Report/Report.html
  11. Marc Abrams, C.R.Standridge, G.Abdulla, S. Williams, and E.A.Fox, Caching Proxies: Limitations and Potentials, In Proceedings of the Fourth International WWW Conference, 1995, http://ei.cs.vt.edu/~succeed/WWW4/WWW4.html
  12. Anawat Chankhunthod et al, A Hierarchical Internet Object Cache, Technical Conference, Usenix 1996, http://excalibur.usc.edu/cache-html/cache.html
  13. James Griffioen and Randy Appleton. Reducing File System Latency using a Predictive Approach, Proceddings of the 1994 Summer USENIX Technical Conference, Boston, Massachusetts, USA, 1994, http://usenix.org/publications/library/proceedings/bos94/griffioen.html
  14. Azer Bestavros, Speculative Data Dissemination and Service to Reduce Server Load, Network Traffic and Service Tome in Distributed Information System, In International Conference on Data Engineering, pages 180-189, New Orleans, LO, February 1996
  15. Tomas M. Kroeger, Darrell D. E. Long, and Jeffrey C. Mogul. Exploring the bounds of web latency reduction from caching and prefetching, In Proceedings of USENIX Symposium on Internet Technology and Systems, December 1997, http://www.usenix.org/publications/library/proceedings/usits97/kroeger.html
  16. Evangelos P. Margatos and Catherine E. Chronaki, A top-10 Approach to Prefetching on the Web, Technical report, In Proceedings of INET' 98 (The Internet Summit), Geneva, Switzerland, July 1998, http://www.ics.forth.gr/ proj/arch-vlsi/OS/www.html
  17. Wcol Group, WWW Collector the prefetching proxy server for WWW, 1997, http://shika.aist-nara.ac.jp/products/wcol/wcol.html
  18. Li Fan, Quinn Jacobson, Pei Cao, and Wei Lin, Web Prefetching Between Low-Bandwidth Clients and Proxies: Potential and Performance, In Proceedings of the Joint International Conference on Measurement and Modeling of Computer Systems (SIGMETRICS '99), Atlanta, GA, May 1999, http://www.cs.wisc.edu/~cao/
  19. Wget, available from http://subzero.campus.luth.se/FreeDocs/wget-1.4.2/wget_toc.html
  20. Tim Bray, Measuring the Web, In Proceedings of the Fifth International World Wide Web Conference, pages 993-1005, Paris, France, May 1996
  21. Allison Woodruff, Paul M. Aoki, Eric Brewer, Paul Gauthier, and Lawrence A. Rowe, An Investigation of Documents from the WWW, In Proceedings of the Fifth International WWW Conference, pages 963-979, Paris, France, May 1996
  22. T. Berners-Lee, R. Fielding, and H. Frystyk, Hypertext Transfer Protocol HTTP/1.0, RFC 1945. May, 1996
  23. R. Fielding, J. Gettys, J. Mogul, H. Frystyk, L. Masinter, P. Leach and T. Berners-Lee, Hypertext Transfer Protocol - HTTP/1.1, RFC 2616, June 1999
  24. Evangelos Markatos , Catherine E. Chronaki, A Top-10 Approach to Prefetching on the Web, Technical Report No. 173, ICS-FORTH, Heraklion, Crete, Greece ,August 1996
  25. Ghaleb Abdulla, Edward A. Fox, Marc Abrams, and Stephen Williams, WWW Proxy Traffic Characterization with Application to Caching, Technical Report TR-97-03, Computer Science Department, Virginia Tech, March 1997, http://www.cs.vt.edu/~chitra/work.html
  26. James E. Pitkow, Summary of WWW characterizations, In Proceedings of the Seventh International World Wide Web Conference, Brisbane, Australia, April 1998, http://www7.scu.edu.au/programme/fullpapers/1877/coml877.htm https://doi.org/10.1016/S0169-7552(98)00066-X
  27. Pei Cao, Edward W. Felten, Anna R. Karlin, and Kai Li, A Study of Integrated Prefetching and Caching Strategies, In Proceedings of the ACM SIGMETRICS Conference on Measurement and Modeling of Computer Systems, May 1995, http://www.cs.wisc.edu/~cao/publications.html
  28. Eric A. Brewer, Paul Gauthier, and Dennis McEvoy, The long-term viability of large-scale caching, In Proceedings of the Third International WWW Caching Workshop, Manchester, England. June 1998, http://wwwcache.ja.net/events/workshop
  29. CacheRaQ of Cobalt Network, available from http://www.coblatnet.com
  30. Webspeed of Packetstorm Technologies, available from http://www.packetstorm.on.ca/products/webspeed/featuresindetail.html
  31. David H. Crocker, Standard For The Format Of Arpa Internet Text Message, RFC 822, August 13, 1982