Browse > Article
http://dx.doi.org/10.9708/jksci.2016.21.11.121

Improving Performance of Jaccard Coefficient for Collaborative Filtering  

Lee, Soojung (Dept. of Computer Education, Gyeongin National University of Education)
Abstract
In recommender systems based on collaborative filtering, measuring similarity is very critical for determining the range of recommenders. Data sparsity problem is fundamental in collaborative filtering systems, which is partly solved by Jaccard coefficient combined with traditional similarity measures. This study proposes a new coefficient for improving performance of Jaccard coefficient by compensating for its drawbacks. We conducted experiments using datasets of various characteristics for performance analysis. As a result of comparison between the proposed and the similarity metric of Pearson correlation widely used up to date, it is found that the two metrics yielded competitive performance on a dense dataset while the proposed showed much better performance on a sparser dataset. Also, the result of comparing the proposed with Jaccard coefficient showed that the proposed yielded far better performance as the dataset is denser. Overall, the proposed coefficient demonstrated the best prediction and recommendation performance among the experimented metrics.
Keywords
Collaborative Filtering; Recommender System; Similarity Measure; Jaccard Coefficient;
Citations & Related Records
Times Cited By KSCI : 1  (Citation Analysis)
연도 인용수 순위
1 D. Jannach, Z. Karakaya, and F. Gedikli, "Accuracy Improvements for Multi-criteria Recommender Systems," Proc. of the ACM Conf. Electronic Commerce, pp. 674-689, 2012.
2 X. Su and T.M. Khoshgoftaar, "A Survey of Collaborative Filtering Techniques," Advances in Artificial Intelligence, 2009.
3 A. Bellogin and A. P. de Vries, "Understanding Similarity Metrics in Neighbour-based Recommender Systems," Proceedings of the 2013 Conference on the Theory of Information Retrieval, 2013.
4 H. Ahn, "A New Similarity Measure for Collaborative Filtering to Alleviate the New User Cold-starting Problem," Information Sciences, Vol. 178, No. 1, pp. 37-51, Jan. 2008.   DOI
5 S. Lee, "Performance Analysis of Similarity Reflecting Jaccard Index for Solving Data Sparsity in Collaborative Filtering," The Journal of Korean Association of Computer Education, Vol. 19, No. 4, pp. 59-66, July 2016.
6 H. Liu, Z. Hu, A. Mian, H. Tian, and X. Zhu, "A New User Similarity Model to Improve the Accuracy of Collaborative Filtering," Knowledge-Based Systems, Vol. 56, pp. 156-166, Jan. 2014.   DOI
7 K. G. Saranya, G. S. Sadasivam, and M. Chandralekha, "Performance Comparison of Different Similarity Measures for Collaborative Filtering Technique," Indian Journal of Science and Technology, Vol. 9, No. 29, Aug. 2016.
8 J. Bobadilla, F. Ortega, A. Hernando, and J. Bernal, "A Collaborative Filtering Approach to Mitigate the New User Cold Start Problem," Knowledge-Based Systems, Vol. 26, pp. 225-238, Feb. 2012.   DOI
9 G. Koutrica, B. Bercovitz, and H. Garcia, "FlexRecs: Expressing and Combining Flexible Recommendations," Proc. of the ACM SIGMOD Int'l Conf. on Management of Data, pp. 745-758, 2009.
10 J. Bobadilla, F. Serradilla, and J. Bernal. "A New Collaborative Filtering Metric That Improves the Behavior of Recommender Systems," Knowledge-Based Systems, Vol. 23, pp. 520-528, Aug. 2010.   DOI
11 H-F. Sun, et al., "JacUOD: A New Similarity Measurement for Collaborative Filtering," Journal of Computer Science and Technology, Vol. 27, No. 6, pp. 1252-1260, Nov. 2012.   DOI
12 B. Jeong, J. Lee, and H. Cho, "Improving Memory-based Collaborative Filtering via Similarity Updating and Prediction Modulation," Information Sciences, Vol. 180, No. 5, pp. 602-612, Mar. 2010.   DOI
13 M. Gao, Z. Wu, and F. Jiang, "Userrank for Item-based Collaborative Filtering Recommendation," Information Processing Letters, Vol. 111, pp. 440-446, Apr. 2011.   DOI