• Title/Summary/Keyword: 웹수집기 성능 비교

Search Result 1, Processing Time 0.019 seconds

Comparison of Web Crawler Performance for Web Record Management (원격수집 방식의 웹기록물 관리를 위한 웹수집기 성능 비교 연구)

  • Chang, Jinho;Kwon, Hyuksang;Lee, Kyumo;Choi, Dong Joon
    • The Korean Journal of Archival Studies
    • /
    • no.74
    • /
    • pp.155-186
    • /
    • 2022
  • As of 2022, the number of Internet sites for public institutions registered on the 'Government 24' website (www.gov.kr) of the Ministry of the Interior and Safety is 17,000. The direct transfer takes a lot of human and material resources and time between the records-producing institution and the records-management institution that manages websites as records. In addition, it is practically difficult for records management institutions to migrate and operate various software and application technologies required to run each website. A method of automatically collecting websites from a remote location using web crawler software is used domestically and abroad to overcome these practical limitations. This study compared the performance of the web crawler required to collect and manage public Internet websites as records remotely. The most suitable web crawler was selected through a step-by-step review of several web crawlers from previous studies and other literature. Several public agency websites were applied to compare the actual performance of the crawlers in the evaluation process. The study provides empirical and specific performance comparison information for organizations that need to choose a web crawler.