DOI QR코드

DOI QR Code

새로 출시되는 품목들을 위한 단어 기반의 사용자 선호도 예측 기법

A Prediction System of User Preferences for Newly Released Items Based on Words

  • 최윤석 (서울대학교 컴퓨터공학부) ;
  • 문병로 (서울대학교 컴퓨터공학부)
  • 발행 : 2006.04.01

초록

협동적 여과(CF) 시스템은 구현의 용이성과 뛰어난 성능으로 널리 활용되고 있다. 그러나 이 시스템은 데이터 희소성 신상품 추천 불가, 추천 근거에 대한 설명 부족 등의 문제점을 포함하고 있어 이를 해결하기 위한 많은 연구가 진행되었다. 데이터 희소성 문제는 데이터의 누적에 따라 해결될 수 있지만, 협동적 여과 기법의 특성상 새로이 출시되는 품목에 대한 추천이 불가능하다. 이를 해결하기 위해 내용 기반(CB) 기법을 같이 사용하는 연구들이 제안되었다. 또한 협동적 여과 시스템은 추천 과정에 있어 추천 근거에 대한 설명을 제공하지 않는다. 본 연구에서는 추천에 대한 설명 기능을 포함하고 있는 선호 단어를 활용한 내용기반 예측 시스템을 제안한다. 이 시스템은 새로이 출시되는 영화에 대해 사용자의 영화에 대한 평가 정보를 예측하며, 추천의 근거가 되는 선호 단어를 제시한다. 또한 기존의 내용기반 예측 시스템에서 일어나는 속성 비매칭 문제로 인한 성능 저하를 막기 위해 기호 네트워크를 활용한 성능 개선 방법을 제안한다. 성능 비교를 위해 EachMovie 데이터베이스와 IMDb 사의 영화 홍보 데이터를 사용하였다.

CF systems are widely used in recommendation due to the easy implementation and the outstanding performance. They have several problems such as the sparsity problem, the first-rater problem, and recommending explanation. Many studies are suggested to resolve these problems. While the influence of the sparsity problem lessens as the users' data are accumulated, but the first-rater problem is originated from the CF systems and there are a number of researches to overcome the disadvantages of CF systems based on the content-based methods. Also CF systems are black boxes, providing no explanation of working of the recommendation. In this paper we present a content-based prediction system based on the preference words, which exposes the reasoning behind a recommendation. Our system predicts user's rating of a new movie and we suggest a semiotic network-based method to solve the mismatching problem between the items. For experimental comparison, we used EachMovie and IMDb dataset.

키워드

참고문헌

  1. P. Resnick and H. R. Varian, 'Recommender systems, Communications of the ACM', Vol.40, No.3, pp. 56-58, 1997 https://doi.org/10.1145/245108.245121
  2. D. Goldberg, D. Nichols, B. M. Oki, and D. Terry, 'Using collaborative filtering to weave an information tapestry', Communications of the ACM, Vol.35, No. 12, pp. 61-70, 1992 https://doi.org/10.1145/138859.138867
  3. J. Zobel and A. Moffat, 'Exploring the Similarity Space', SIGIR Forum, Vol. 32, No.1, pp. 18-34, 1998 https://doi.org/10.1145/281250.281256
  4. K. Lang, News Weeder: learning to filter netnews, Proceedings of the 12th International Conference on Machine Learning, pp. 331-339, 1995
  5. B. Krulwich and C. Burkey, 'The InfoFinder agent: learning user interests through heuristic phrase extraction', IEEE Intelligent systems, Vol. 12, No.5, pp. 22-27, 1997 https://doi.org/10.1109/64.621224
  6. R. J. Mooney and L. Roy, Content-based book recommending using learning for text categorization, Proceedings of DL -00, 5th ACM Conference on Digital Libraries, pp. 195-204, 2000 https://doi.org/10.1145/336597.336662
  7. M. J. Pazzani and D. Billsus, 'Learning and Revising User Profiles: The Identification of Interesting Web Sites', Machine Learning, Vol. 27, No.3, pp. 313-331, 1997 https://doi.org/10.1023/A:1007369909943
  8. J. L. Herlocker, J. A. Konstan, A. Borchers, and J. Riedl, An algorithm framework for performing collaborative filtering, Proceedings of the Conference of Research and Developments in Information Retrieval, pp. 219-233, 1999 https://doi.org/10.1145/312624.312682
  9. E. Terra and C. L. A. Clarke, Scoring missing terms in information retrieval tasks, CIKM '04: Proceedings of the Thirteenth ACM conference on Information and knowledge management, pp.50-58, 2004 https://doi.org/10.1145/1031171.1031182
  10. J. Basilico and T. Hofmann, Unifying collaborative and content-based filtering, Twenty-first international conference on Machine learning, 2004 https://doi.org/10.1145/1015330.1015394
  11. P. Resnick, N. Iacovou, M. Suchak, P. Bergstorm and J. Riedl, GroupLens: An Open Architecture for Collaborative Filtering of Netnews, Proceedings of ACM 1994 Conference on Computer Supported Cooperative Work, pp. 175-186, 1994 https://doi.org/10.1145/192844.192905
  12. H. L. Stead, M. Rosenstein, and G. Furnas, Recommending and Evaluating Choices in A Virtual Community of Use, Proceedings of the CHI -95 Conference on Human Factors in Computing systems, pp.194-201, 1995 https://doi.org/10.1145/223904.223929
  13. U. Shardanand and P. Maes, Social information filtering: algorithms for automating word of mouth, Proceedings of CHI'95 Conference on Human Factors in Computing systems, pp. 210-217, 1995 https://doi.org/10.1145/223904.223931
  14. D. Greening, Building Consumer Trust with Accurate Product Recommendations, LikeMinds White Paper LMWSWP-210-6966, 1997
  15. J. L. Herlocker, J. A. Konstan, and John Riedl, Explaining collaborative filtering recommendations, Computer Supported Cooperative Work, pp. 241-250, 2000
  16. J. A. Konstan, B. N. Miller, D. Maltz, J. L. Herlocker, L. R. Gordon, and J. Riedl, GroupLens: Applying Collaborative Filtering to Usenet News, Communications of the ACM, Vol. 40, No.3, pp. 77-87, 1997 https://doi.org/10.1145/245108.245126
  17. G. Salton, 'Associative document retrieval techniques using bibliographic information', Journal of the American Society for Information Science, Vol. 10, No.4, pp. 440-457, 1963 https://doi.org/10.1145/321186.321188
  18. G. Fischer and C. Stevens, Information access in complex, poorly structured information spaces, Proceedings of the SIGCHI conference on Human factors in computing systems, pp. 63-70, 1991 https://doi.org/10.1145/108844.108854
  19. Mitchell T., Machine Learning, McGraw-Hill, New York, 1997
  20. M. Balabanovic and Y. Shoham, 'Combining Content- Based and Collaborative Recommendation', Communications of the ACM, Vol. 40, No.3, 1997 https://doi.org/10.1145/245108.245124
  21. G. A. Miller, R. Beckwith, C. Fellbaum, D. Gross, and K. J. Miller, 'Introduction to wordnet: An on-line lexical database', Journal of Lexicography, Vol. 3, No.4, pp. 234-244, 1990 https://doi.org/10.1093/ijl/3.4.235
  22. S. Y. Lee, S. S. Choi and B. R. Moon, Search Improvement by Genetic Algorithms with a Semiotic Network, GECCO 2002: Proceedings of the Genetic and Evolutionary Computation Conference, pp. 1126-1132, 2002
  23. J. Xu and W. B. Croft, 'Improving the effectiveness of information retrieval with local context analysis', ACM Transactions on Information systems, Vol.18, No.1, pp. 79-112, 2000 https://doi.org/10.1145/333135.333138
  24. P. N. Tan, V. Kumar, and J. Srivastava, Selecting the right interestingness measure for association patterns, KDD '02: Proceedings of the eighth ACM SIGKDD international conference on Knowledge discovery and data mining, pp. 32-41, 2002 https://doi.org/10.1145/775047.775053
  25. J. L. Herlocker, J. A. Konstan, L. G. Terveen and J. T. Riedl, 'Evaluating collaborative filtering recommender systems', ACM Transactions on Information systems, Vol. 22, No.1, pp. 5-53, 2004 https://doi.org/10.1145/963770.963772
  26. R. Kohavi, A Study of Cross-Validation and Bootstrap for Accuracy Estimation and Model Selection, Proceedings of the Fourteenth International Joint Conference on Artificial Intelligence, pp. 1137-1145, 1995
  27. C. J. Crouch, D. B. Crouch, Q. Chen, and S. J. Holtz, 'Improving the retrieval effectiveness of very short queries', Information Processing and Management, Vol. 38, No.1, pp. 1-36, 2002 https://doi.org/10.1016/S0306-4573(01)00023-1