• Title/Summary/Keyword: 증분 웹 로봇

Search Result 1, Processing Time 0.014 seconds

An Empirical Study on Changes of Web Pages (웹 문서 변화에 관한 실험적 연구)

  • Kim Sung Jin;Lee Sang Ho
    • Journal of KIISE:Databases
    • /
    • v.32 no.2
    • /
    • pp.151-160
    • /
    • 2005
  • As web pages are created, destroyed, and updated frequently, web databases should be updated to keep up-to-date web pages. In order to keep web databases fresh effectively, we need to understand the change of real web pages. Previous researches on the change of the web pages have directed their efforts on the contents modification of web pages only, and have not taken into account the factors of creation and destruction of web pages In their research. This paper investigates the web page changes, which include contents modification, page creation, and page destruction. We introduce three metrics, namely DR (Download Rate), MR (Modification Rate), and CAV (Coefficient of Age Variation) to represent the change of the web pages. We have monitored three million web pages collected from the famous and random sites every other day for one hundred days. With the Download Rate and the Modification Rate, we learned that the download success and the modification depends on the past change of them, and proposes two estimation formulae that predict the download success and modification. With the Coefficient of Age Variation, we show how web pages do not change periodically.