DOI QR코드

DOI QR Code

Design and Implementation of a User-based Collaborative Filtering Application using Apache Mahout - based on MongoDB -

  • Lee, Junho (Dept. of Computer Science, Soonchunhyang University) ;
  • Joo, Kyungsoo (Dept. of Computer Software Engineering, Soonchunhyang University)
  • Received : 2018.02.19
  • Accepted : 2018.04.03
  • Published : 2018.04.30

Abstract

It is not easy for the user to find the information that is appropriate for the user among the suddenly increasing information in recent years. One of the ways to help individuals make decisions in such a lot of information is the recommendation system. Although there are many recommendation methods for such recommendation systems, a representative method is collaborative filtering. In this paper, we design and implement the movie recommendation system on user-based collaborative filtering of apache mahout based on mongoDB. In addition, Pearson correlation coefficient is used as a method of measuring the similarity between users. We evaluate Precision and Recall using the MovieLens 100k dataset for performance evaluation.

Keywords

References

  1. Jacobs, A., "The pathologies of big data," Communications of the ACM, 52(8), pp. 36-44, 2009. https://doi.org/10.1145/1536616.1536632
  2. Xiaoyuan Su and Taghi M. Khoshgoftaar, "A Survey of Collaborative Filtering Techniques," Advances in Artificial Intelligence, pp. 1-19, 2009.
  3. Mingun Kim, and Kyoung-Jae Kim, "Recommender Systems using Structural Hole and Collaborative Filtering." Journal of Intelligence and Information Systems 20.4, pp. 107-120, 2014. https://doi.org/10.13088/jiis.2014.20.4.107
  4. Karydi, Efthalia, and Konstantinos Margaritis. "Parallel and distributed collaborative filtering: A survey." ACM Computing Surveys(CSUR), 49.2, 37, 2016.
  5. Su-Mi Shin, Kyung-Chang Kim, "Addressing the New User Problem of Recommender Systems Based on Word Embedding Learning and Skip-gram Modelling," Journal of The Korea Society of Computer and Information, Vol. 21, No. 7, pp. 9-16, 2016. https://doi.org/10.9708/JKSCI.2016.21.7.009
  6. Resnick, P., Iacovou, N., Suchak, M., Bergstrom, P., and Riedl, J., "GroupLens: an open architecture for collaborative filtering of netnews," In Proceedings of the 1994 ACM conference on Computer supported cooperative work, pp. 175-186, October, 1994.
  7. Sarwar, B., Karypis, G., Konstan, J., and Riedl, J., "Item-based collaborative filtering recommendation algorithms," In Proceedings of the 10th international conference on World Wide Web, ACM, pp. 285-295, April, 2001.
  8. Jun-Ho Lee, and Kyung-Soo Joo., "Implementation and Comparison of Moive Recommendation System using Item-Based Collaborative Filtering.", Soonchunhyang J. Instit. Technol.23(1), pp. 013-020, 2017.
  9. Jun-Ho Lee, and Kyung-Soo Joo., "Design and Implementation of Collaborative Filtering Application System using Apache Mahout." Journal of The Korea Society of Computer and Information, Vol.22, No.7, pp. 125-131, 2017. https://doi.org/10.9708/JKSCI.2017.22.07.125
  10. ZHAO, Zhi-Dan, SHANG, Ming-Sheng. "User-based collaborative-filtering recommendation algorithms on hadoop," In: Knowledge Discovery and Data Mining, 2010. WKDD'10. Third International Conference on. IEEE, pp. 478-481, 2010.
  11. Yu, K., Schwaighofer, A., Tresp, V., Xu, X., & Kriegel, H. P., "Probabilistic memory-based collaborative filtering," IEEE Transactions on Knowledge and Data Engineering, pp. 56-69, 2004.
  12. Y. Ding, "Time weight collaborative filtering," Proceedings of the 14th ACM international conference, pp.485-492, 2005.
  13. S. Owen, R. Anil, T. Dunning and E. Friedman, "Mahout in Action" ManningPublications, 2014.
  14. The Apache Foundation, https://github.com/apache/mahout/blob/trunk/mrlegacy/src/main/java/org/apache/mahout/cf/taste/impl/recommender/
  15. Freeman Eric, Freeman Elisabeth, Sierra Kathy and Bates Bert, "Head First Design Patterns" Oreilly & Associates Inc, pp. 529-558, 2004.
  16. Michael Dirolf, "MongoDB The Definitive Guide." O'ReillyMedia, 2013.
  17. Jun-Ho Lee, Jung-Woong Woo, Cheong-An Lee and Kyung-Soo Joo, "A Unified Object-Oriented Analysis and Design Methodology for Secure Web ," The 2nd International Conference on Interdisciplinary research on Computer science, Psychology, and Education(ICICPE), pp. 22-24, 2018.
  18. Jun-Ho Lee, and Kyung-Soo Joo., "Development of the design methodology for large-scale database based on MongoDB." Journal of The Korea Society of Computer and Information, Vol.22, No.11 pp. 57-63, 2017. https://doi.org/10.9708/jksci.2017.22.01.057
  19. Sarwar, B., Karypis, G., Konstan, J., and Riedl, J., "Application of dimensionality reduction in recommender system-a case study," Minnesota Univ Minneapolis Dept of Computer Science, No. TR-00-043, 2000.
  20. Yang, Y. and Liu, X., "A re-examination of text categorization methods," In Proceedings of the 22nd annual international ACM SIGIR conference on Research and development in information retrieval, ACM, pp. 42-49, August, 1999.