Effective Reference Probability Incorporating the Effect of Expiration Time in Web Cache

웹 캐쉬에서 만기시간의 영향을 고려한 유효참조확률

  • Lee, Jeong-Joon (Dept. of Computer Science, Korea Advanced Institute of Science and Technology) ;
  • Moon, Yang-Se (Dept. of Computer Science, Korea Advanced Institute of Science and Technology) ;
  • Whang, Kyu-Young (Dept. of Computer Science, Korea Advanced Institute of Science and Technology) ;
  • Hong, Eui-Kyung
  • 이정준 (한국과학기술원 전산학과) ;
  • 문양세 (한국과학기술원 전산학과) ;
  • 황규영 (한국과학기술원 전산학과) ;
  • 홍의경 (서울시립대학교 전산통계학과)
  • Published : 2001.12.01

Abstract

Web caching has become an important problem addressing the performance issues in web applications. In this paper we propose a method that enhances the performance of web caching by incorporating the expiration time of web data we introduce the notion of the effective reference probability that incorporates the effect of expiration time into the reference probability used in the existing cache replacement algorithms .We formally define the effective reference probability and derive it theoretically using a probabilistic model. By simply replacing probabilities with the effective reference probability in the existing cache replacement algorithms we can take the effect of expiration time into account The results of performance evaluation through experiments show that the replacement algorithms using the effective reference probability always outperform the existing ones. The reason is that the proposed method precisely reflects the theoretical probability of getting the cache effect, and thus, incorporates the influence of the expiration time more effectively. In particular when the cache fraction is 0.05 and data update is comparatively frequent (i.e. the update frequency is more than 1/0 of the reference frequency) the performance enhancement is more than 30% in LRU-2 and 13% in Aggarwal's method (PSS integrating a refresh overhead factor) The results show that effective reference probability contributes significantly to the performance enhancement of the web cache in the presence of expiration time.

웹 캐쉬는 웹 응용의 성능향상을 위한 중요한 문제가 되었다. 본 논문에서는 웹 데이타에 부여 되는 만기시간(expiration time을 활용하여 웹 캐쉬의 성능을 향상시키는 방법을 제안한다. 먼저, 기존 캐 쉬 교체 알고리즘에서 사용된 참조확률에 만기시간의 영향을 반영한 유효참조확률(effective referenceprobability)의 개념을 제시한다. 그리고, 유효참조확률을 정형적으로 정의한 후, 확률적 모델 하에서 이 론적으로 유도한다. 기존의 교체 알고리즘에서 참조확률을 유효탐조확률로 대체하면 기존 교체 알고리즘에만기시간의 영향을 반영할 수 있다. 성능평가 결과, 유효참조확률을 사용한 알고리즘이 그렇지 않은 알고 리즘보다 항상 우수한 결과를 보였다. 이것은 제안한 방법이 만기시간을 고려하여 캐쉬효과를 얻을 이론적 인 확률을 정확히 반영한 방법으로서, 만기시간의 영향을 보다 효과적으로 반영하기 때문이다. 특히, 유효 참조확률은 캐쉬비율이 0.05이고 갱신이 비교적 자주 일어날 경우(갱신횟구가 참조횟수의 1/10이상인 경우)에 LRU-2의 경우 30% 이상. Aggarwal의 방법(refresh overhead factor가 통합된 PSS)의 경우 13%이상 성능을 개선하였다. 이 같은 결과는 유효참조확률이 만기시간이 주어지는 웹 캐쉬에서 크게 성능을 개선할 수 있음을 보여주고 있다.

Keywords

References

  1. Abrams, M., Standridge, C., Abdulla, G., Williams, S., and Fox, E .. Caching Proxies: Limitations and Potentials, In Proc. Int'l World Wide Web Conf., CERN, Boston, pp. 110-133, Dec. 1995
  2. Aggarwal, C., Wolf, J. L., and Yu, P. S., Caching on the World Wide Web, IEEE Trans. on Knowledge and Data Engineering, Vol. 11, No.1, pp. 94-107, Feb. 1999 https://doi.org/10.1109/69.755618
  3. Arlitt, M. F. and Willaiamson, C. L., Web Server Workload Characterization: The Search for Invariants, In Proc. Int'l Conf. on Measurement and Modeling of Computer Systems, ACM SIGMETRICS, Philadelphia, pp. 126-136, May 1996 https://doi.org/10.1145/233013.233034
  4. Gadde, S., Rabinovich, M., and Chase J., Reduce, Reuse, Recycle: An Approach to Building Large Internet Caches, In Proc. Workshop on Hot Topics in Operating Systems(HotOS-VI), May 1997 https://doi.org/10.1109/HOTOS.1997.595189
  5. Gwertzman, J. and Seltzer, M., World-Wide Web Cache Consistency, In Proc. USENIX Technical Conf., USENIX Association, San Diego, CA, Jan. 1996
  6. Markatos, E., Main Memory Caching of Web Documents, Computer Networks and ISDN Systems, Vol. 28, pp. 893-905, 1996 https://doi.org/10.1016/0169-7552(96)00035-9
  7. Scheuermann, P., Shim, J. H., and Vingralek, R, A Case for Delay-Conscious Caching of Web Documents, In Proc. Int'l WWW Conf. CERN, Santa Clara, California, Feb. 1997
  8. Williams, S., Abrams, M., Standridge, C. R., Abdulla, G., and Fox, E. A., Removal Policies in Network Caches for World-Wide Web Documents, In Proc. Conf. on Applications, Technologies, Architectures, and Protocols for Computer Communications, ACM SIGCOMM, Palo Alto, Aug. 1996 https://doi.org/10.1145/248156.248182
  9. Coffman, E. G. Jr., and Denning, P. J. Operating Systems Theory, Prentice Hall, 1975
  10. Liu, G., and Maguire, G. Q., A Survey of Caching and Prefetching Techniques in Distributed Systems, TR. TRITA-IT R 94-40, Dept. of Teleinformatics, Royal Institute of Technology, Oct. 1994
  11. Fielding, R, Gettys, J.. Mogul, J, C., Frystyk, H., and Berners-Lee., T., Hypertext Transfer Protocol HTTP/1.1. RFC 2068, http://nic.ddn.mil/ftp/rfc/rfc2068.txt, Jan. 1997
  12. Chankhunthod, A., Danzig, P. B., and Neerdaels, C., A Hierarchical Internet Object Cache, In Proc. USENIX Technical Conf., USENIX Association, San Diego, CA, Jan. 1996
  13. O'Neil, E. J., O'Neil, P. E., and Weikum, G., The LRU-K Page Replacement Algorithm For Database, In Proc. Int'l Conf. on Management of Data, ACM SIGMOD, Washington, D.C., pp. 297-306, May 1993 https://doi.org/10.1145/170035.170081
  14. Aho, A. V., Denning, P. J., and Ullman, J. D., Priciples of Optimal Page Replacement, Journal of ACM, Vol. 18, No. 1, pp. 80-83, Jan. 1971 https://doi.org/10.1145/321623.321632
  15. Belady, L. A., A Study of Replacement Algorithms for Virtual Storage Computers, IBM Systems Journal, Vol. 5, No.2, pp. 78-101, 1966
  16. Denning, P. J., The Working Set Model for Program Behavior, Comm of the ACM, Vol. 11, No.5, pp.323-333, May 1968 https://doi.org/10.1145/363095.363141
  17. Ross, S.M., Introduction to Probability Models, Academic Press, 1993
  18. Andrews, A., Kappe, F., Maurer, H., and Schmaranz, K., On Second Generation Hypermedia Systems, In Proc. ED-Media 95, World Conf. on Educational Multimedia and Hypermedia, June 1995
  19. Sinnwell, M., and Weikum, G., A Cost-ModelBased Online Method for Distributed Caching, In Proc. Int'l Conf. on Data Engineering, IEEE, Birmingham, U.K., pp. 532-541, 1997 https://doi.org/10.1109/ICDE.1997.582022
  20. Park, C.M., Whang, K.Y., Han, W.S., and Song, L.Y., A Cost-Based Buffer Replacement Algorithm for Object-Oriented Database Systems, Information Sciences, Vol. 138, No. 14, pp. 99-117, 2001 https://doi.org/10.1016/S0020-0255(01)00116-5
  21. Breslau, L., Cao, P., Fan, L., Philips, G., and Shenker, S., Web Cachig and ZipfIike Distributions: Evidence and Implications, In Proc. IEEE Int'l Conf. on Computer Communications (INFOCOM), New York, 1999 https://doi.org/10.1109/INFCOM.1999.749260
  22. Proc. IEEE Int'l Conf on Computer Communications(INFOCOM) Web Cachig and Zipf-like Distributions: Evidence and Implications Breslau,L.;Cao,P.;Fan,L.;Philips,G.;Shenker,S.