Browse > Article

Fast URL Lookup Using URL Prefix Hash Tree  

Park, Chang-Wook (플랜티넷 기술연구소)
Hwang, Sun-Young (서강대학교 전자공학과)
Abstract
In this paper, we propose an efficient URL lookup algorithm for URL list-based web contents filtering systems. Converting a URL list into URL prefix form and building a hash tree representation of them, the proposed algorithm performs tree searches for URL lookups. It eliminates redundant searches of hash table method. Experimental results show that proposed algorithm is $62%{\sim}210%$ faster, depending on the number of segment, than conventional hash table method.
Keywords
hash tree; URL lookup;
Citations & Related Records
연도 인용수 순위
  • Reference
1 Web Sense Inc., http://www.websense.com
2 C. Ding, C. Chi, J. Deng, and C. Dong, "Centralized Content-based Web Filtering and Blocking: How Far Can It Go?," in Proc. IEEE Int. Conf. on Systems, Man, and Cybernetics, vol.2, pp. 115-119, Oct. 1999
3 Basso et al., "Method and System for Performing a Pattern Match Search for Text Strings," US Patent No. US 7054855 B2, 2006
4 Erik Burckart and Aravind Srinivasan, "Multidimensional hashed tree based URL matching engine using progressive hashing," US Patent Publication Number 2005-0055437 A1, 2005
5 플랜티넷, http://www.plantynet.com
6 M. Hammami, Y. Chahir, and L. Chen, "WebGuard: Web Based Adult Content Detection and Filtering System," in Proc. IEEE/WIC Int. Conf. on Web Intelligence, pp. 574-578, Oct. 2003
7 H. Yan, J. Wang, X. Li, and L. Guo, "Architectural Design and Evaluation of an Efficient Web-crawling System," in Proc. Int. Symp. on Parallel and Distributed Processing, pp. 1824-1831, Apr. 2001
8 정보통신윤리위원회, http://www.icec.or.kr
9 B. Michel, K. Nikoloudakis, P. Reiher, and L. Zhang, "URL Forwarding and Compression in Adaptive Web Caching," in Proc. IEEE. INFOCOM, vol.2, pp. 670-678, Mar. 2000
10 Google Inc., http://www.google.com
11 Secure Computing Corporation, http://www.securecomputing.com
12 P. Gupta and N. McKeown, "Algorithms for packet classification," IEEE Network, vol. 15, no. 2, pp. 24-32, March 2001
13 N. Huang, R. Liu, C. Chen, Y. Chen, and L. Huang, "Fast URL Lookup Engine for Content-Aware Multi-Gigabit Switches," in Proc. Int. Conf. on Advanced Information Networking and Applications, vol.1,  pp. 641-646, Mar. 2005
14 World Wide Web Consortium, "Platform for Internet Content Selection: PICS," http://www.w3.org/PICS/
15 한국전산원, "네트워크용 유해정보 차단도구 NCApatrol Proxy 1.0 개발 보고서", 한국전산원 연구 보고서 IV-PER-98035, 1998년 12월
16 R. Du, R. Safavi-Naini, and W. Susilo, "Web Filtering Using Text Classification," in Proc. IEEE Int. Conf. on Networks, pp. 325-330, Oct. 2003