Browse > Article
http://dx.doi.org/10.6109/jkiice.2015.19.3.552

Implement on Search Machine using Open Source Framework  

Song, Hyun-Ok (Department of Computer Engineering, Paichai University)
Kim, A-Yong (Department of Computer Engineering, Paichai University)
Jung, Hoe-Kyung (Department of Computer Engineering, Paichai University)
Abstract
IT technology development and smart appliances due to the increased use of a lot of data on production and consumption has become in the internet. Because this is why importance of information retrieval technology although the growing becoming aware of the difficult techniques to access the required of lot a background knowledge on information retrieval technology. However, the Lucene due to emerge provide to background can implement on search engine by using the Lucene of lack background knowledge for search technology. In this paper, suggest to implement on search engine by using the developed a framework on Lucene-based. Suggest a frameworks are use in the search engines on have guarantee in server environment support on distributed processing and distributed storage, and high availability by using the Hadoop and Nutch, Solr, Zookeeper.
Keywords
Hadoop; Lucene; Nutch; Search Engine; Solr; YARN;
Citations & Related Records
연도 인용수 순위
  • Reference
1 Hee-Seok Park, "Effective Travel Information Search on the Internet Search Engine" Korea Academic Society of Tourism Management, Vol. 15, No. 1, pp. 212-231, 2010.
2 Heydon, Allan, and Marc Najork, "Mercator: A scalable, extensible web crawler." World Wide Web 2.4, pp.219-229, 1999.   DOI
3 Shkapenyuk, Vladislav, and Torsten Suel, "Design and implementation of a high-performance distributed web crawler." IEEE 18th International Conference on, 2002.
4 Apache Hadoop, http://hadoop.apache.org/, 2014.
5 Dean, Jeffrey, and Ghemawat. Sanjay, "MapReduce: simplified data processing on large clusters." Communications of the ACM 51.1, pp.107-113, 2008.   DOI
6 Vavilapalli, Vinod Kumar, et al., "Apache hadoop yarn: Yet another resource negotiator." ACM Proceedings of the 4th annual Symposium on Cloud Computing, 2013.
7 Apache Nutch, http://nutch.apache.org/, 2014.
8 Apacje Lucene, http://lucene.apache.org/core/, 2014.
9 Apache Solr Reference Guide Covering Apache Solr 4.8, https://archive.apache.org/dist/lucene/solr/ref-guide/apache-solr-ref-guide-4.8.pdf, 2014.
10 Apache Zookeeper, http://zookeeper.apache.org/, 2014.
11 Hunt, Patrick, et al., "ZooKeeper: Wait-free Coordination for Internet-scale Systems." USENIX Annual Technical Conference, Vol.8, 2010.