Preliminary Performance Evaluation of a Web Crawler with Dynamic Scheduling Support

동적 스케줄링 기반 웹 크롤러의 성능분석

  • Lee, Yong-Doo (School of Computer and Communication Engineering, Daegu University) ;
  • Chae, Soo-Hwan (School of Computer and Communication Engineering, Hankuk Aviation University)
  • Published : 2003.09.01

Abstract

A web crawler is used widely in a variety of Internet applications such as search engines. As the Internet continues to grow, high performance web crawlers become more essential. Crawl scheduling which manages the allocation of web pages to each process for downloading documents is one of the important issues. In this paper, we identify issues that are important and challenging in the crawl scheduling. To address the issues, we propose a dynamic owl scheduling framework and subsequently a system architecture for a web crawler subject to the framework. This paper presents the architecture of a web crawler with dynamic scheduling support. The result of our preliminary performance evaluation made to the proposed crawler architecture is also presented.

Keywords