Acknowledgement
This paper was supported by Wonkwang University in 2021.
References
- I. Hernandez, C. R. Rivero, and D. Ruiz "Deep Web crawling: a survey," World Wide Web, vol. 22, pp. 1577-1610, May 2019. DOI: 10.1007/s11280-018-0602-1.
- White Paper: The Deep Web: Surfacing Hidden Value [Internet]. Available: https://quod.lib.umich.edu/j/jep/3336451.0007.104?view=text;rgn=main.
- M. A. Kausar, V. S. Dhaka, and S. K. Singh, "Web Crawler: A Review," International Journal of Computer Applications, vol. 63, pp. 31-36, Feb. 2013. DOI: 10.5120/10440-5125.
- B. Ahuja, A. Anuradha, and A. Ashish, "Hidden Web Data Extraction Tools," International Journal of Computer Applications, vol. 82, no. 15, pp. 9-15, Nov. 2013. DOI: 10.5120/14238-2377.
- M. Alvarez, J. Raposo, A. Pan, F. Cacheda, F. Bellas, and V. Carneiro, "DeepBot: a focused crawler for accessing hidden web content," in Proceedings of the 3rd international workshop on Data enginering issues in E-commerce and services: In conjunction with ACM Conference on Electronic Commerce, San Diego: CA, USA, pp. 18-25, 2007. DOI: 10.1145/1278380.1278385.
- S. Raghavan and H. Garcia-Molina, "Crawling the Hidden Web," in Proceedings of 27th International Conference on Very Large Data Bases (VLDB 2001), Rome, Italy, pp. 129-138, 2001.
- J. Edwards, K. McCurley, and J. Tomlin, "An adaptive model for optimizing performance of an incremental web crawler," in Proceedings of the 10th international conference on World Wide Web, HongKong, pp.106-113, 2001. DOI: 10.1145/371920.371960.
- H. Oh, D. Won, C. Kim, S. Park, and Y. Kim, "Design and implementation of crawling algorithm to collect deep web information for web archiving," Data Technologies and Applications, vol. 52, no. 2, pp. 266-277, Mar. 2018. DOI: 10.1108/DTA-07-2017-0053.
- Beautiful Soup [Internet]. Available: https://www.crummy.com/software/BeautifulSoup/bs4/doc/.
- ChromeDriver [Internet]. Available: https://chromedriver.chromium.org.
- MongoDB [Internet]. Available: https://www.mongodb.com/.
- WordCloud [Internet]. Available: https://pypi.org/project/wordcloud/.