DOI QR코드

DOI QR Code

Improving Performance of Jaccard Coefficient for Collaborative Filtering

  • Lee, Soojung (Dept. of Computer Education, Gyeongin National University of Education)
  • Received : 2016.09.20
  • Accepted : 2016.11.04
  • Published : 2016.11.30

Abstract

In recommender systems based on collaborative filtering, measuring similarity is very critical for determining the range of recommenders. Data sparsity problem is fundamental in collaborative filtering systems, which is partly solved by Jaccard coefficient combined with traditional similarity measures. This study proposes a new coefficient for improving performance of Jaccard coefficient by compensating for its drawbacks. We conducted experiments using datasets of various characteristics for performance analysis. As a result of comparison between the proposed and the similarity metric of Pearson correlation widely used up to date, it is found that the two metrics yielded competitive performance on a dense dataset while the proposed showed much better performance on a sparser dataset. Also, the result of comparing the proposed with Jaccard coefficient showed that the proposed yielded far better performance as the dataset is denser. Overall, the proposed coefficient demonstrated the best prediction and recommendation performance among the experimented metrics.

Keywords

References

  1. D. Jannach, Z. Karakaya, and F. Gedikli, "Accuracy Improvements for Multi-criteria Recommender Systems," Proc. of the ACM Conf. Electronic Commerce, pp. 674-689, 2012.
  2. X. Su and T.M. Khoshgoftaar, "A Survey of Collaborative Filtering Techniques," Advances in Artificial Intelligence, 2009.
  3. A. Bellogin and A. P. de Vries, "Understanding Similarity Metrics in Neighbour-based Recommender Systems," Proceedings of the 2013 Conference on the Theory of Information Retrieval, 2013.
  4. H. Ahn, "A New Similarity Measure for Collaborative Filtering to Alleviate the New User Cold-starting Problem," Information Sciences, Vol. 178, No. 1, pp. 37-51, Jan. 2008. https://doi.org/10.1016/j.ins.2007.07.024
  5. S. Lee, "Performance Analysis of Similarity Reflecting Jaccard Index for Solving Data Sparsity in Collaborative Filtering," The Journal of Korean Association of Computer Education, Vol. 19, No. 4, pp. 59-66, July 2016.
  6. H. Liu, Z. Hu, A. Mian, H. Tian, and X. Zhu, "A New User Similarity Model to Improve the Accuracy of Collaborative Filtering," Knowledge-Based Systems, Vol. 56, pp. 156-166, Jan. 2014. https://doi.org/10.1016/j.knosys.2013.11.006
  7. J. Bobadilla, F. Ortega, A. Hernando, and J. Bernal, "A Collaborative Filtering Approach to Mitigate the New User Cold Start Problem," Knowledge-Based Systems, Vol. 26, pp. 225-238, Feb. 2012. https://doi.org/10.1016/j.knosys.2011.07.021
  8. G. Koutrica, B. Bercovitz, and H. Garcia, "FlexRecs: Expressing and Combining Flexible Recommendations," Proc. of the ACM SIGMOD Int'l Conf. on Management of Data, pp. 745-758, 2009.
  9. J. Bobadilla, F. Serradilla, and J. Bernal. "A New Collaborative Filtering Metric That Improves the Behavior of Recommender Systems," Knowledge-Based Systems, Vol. 23, pp. 520-528, Aug. 2010. https://doi.org/10.1016/j.knosys.2010.03.009
  10. K. G. Saranya, G. S. Sadasivam, and M. Chandralekha, "Performance Comparison of Different Similarity Measures for Collaborative Filtering Technique," Indian Journal of Science and Technology, Vol. 9, No. 29, Aug. 2016.
  11. H-F. Sun, et al., "JacUOD: A New Similarity Measurement for Collaborative Filtering," Journal of Computer Science and Technology, Vol. 27, No. 6, pp. 1252-1260, Nov. 2012. https://doi.org/10.1007/s11390-012-1301-5
  12. B. Jeong, J. Lee, and H. Cho, "Improving Memory-based Collaborative Filtering via Similarity Updating and Prediction Modulation," Information Sciences, Vol. 180, No. 5, pp. 602-612, Mar. 2010. https://doi.org/10.1016/j.ins.2009.10.016
  13. M. Gao, Z. Wu, and F. Jiang, "Userrank for Item-based Collaborative Filtering Recommendation," Information Processing Letters, Vol. 111, pp. 440-446, Apr. 2011. https://doi.org/10.1016/j.ipl.2011.02.003