Browse > Article
http://dx.doi.org/10.9708/jksci.2021.26.06.115

Methodology for Search Intent-based Document Recommendation  

Lee, Donghoon (Graduate School of Business IT, Kookmin University)
Kim, Namgyu (Graduate School of Business IT, Kookmin University)
Abstract
It is not an easy task for a user to find the correct documents that a user really wanted at once from a vast amount of the search results. For this reason, various methods of recommending documents by taking the user's preferences into consideration based on the user's document browsing history have been proposed. However, the document recommendation methodology based on the document browsing history also has a limitation that only the information the user has viewed is utilized, but the intent of the user searching for the document is not fully utilized. Therefore, we propose a document recommendation method based on the user's search intent that utilizes information on "Why" the user reads the document, instead of the information on "Who" reads the document. In order to confirm the feasibility of the proposed methodology, an experiment was conducted by analyzing 239,438 actual user's search history of one of the most popular e-commerce platform companies in Korea. As a result, our methodology showed superior performance compared to the existing content-based or simple browsing history-based recommendation model.
Keywords
Document Recommendation; Search Intent; Text Mining; TF-IDF; User Access Log;
Citations & Related Records
Times Cited By KSCI : 1  (Citation Analysis)
연도 인용수 순위
1 J. Seo, T. Shon, J. Seo, and J. Moon, "A study on the filtering of spam e-mail using n-Gram indexing and support vector machine," Journal of the Korea Institute of Information Security & Cryptology, Vol. 14, No. 2, pp. 23-33, Apr. 2004.   DOI
2 J. Macqueen, "Some methods for classification and analysis of multivariate observations," Proceedings of the Fifth Berkeley Symposium on Mathematical Statistics and Probability, pp. 281-297, Berkeley, USA, Jun. 1967.
3 G. Salton and M. J. McGill, "Introduction to modern information retrieval," McGraw-Hill, pp. 1-448, 1983.
4 J. L. Herlocker, J. A. Konstan, L. G. Terveen, and J. T. Riedl, "Evaluating Collaborative Filtering Recommender Systems," ACM Transactions on Information Systems, Vol. 22, No. 1, pp. 5-53, Jan. 2004. DOI: 10.1145/963770.963772   DOI
5 O. Park and H. Park, "A Study on the International Research Trends in Electronic Records Management: InterPARES 3 and ITrust Achievements," Journal of Korean Society of Archives and Records Management, Vol. 16, No. 1, pp. 89-120, Feb. 2016. DOI: 10.14404/JKSARM.2016.16.1.089   DOI
6 C. H. Cai, A. W. C. Fu, C. H. Cheng, and W. W. Kwong, "Mining association rules with weighted items," Proceedings. IDEAS'98. International Database Engineering and Applications Symposium, pp. 68-77, Cardiff, UK, Jul. 1998. DOI: 10.1109/IDEAS.1998.694360   DOI
7 T. Shin, K. Chang, and Y. Park, "Customer Recommendation Using Customer Preference Estimation Model and Collaborative Filtering," Korea Intelligent Information Systems Society, Vol. 12, No. 4, pp. 1-14, Dec. 2006.
8 R. Agrawal, T. Imielinski, and A. Swami, "Mining association rules between sets of items in large databases," Proceedings of the 1993 ACM SIGMOD international conference on Management of data, pp. 207-216, New York, NY, USA, Jan. 1993. DOI: 10.1145/170035.170072   DOI
9 D. Lee, "A Regression-Model-based Method for Combining Interestingness Measures of Association Rule Mining," Journal of Intelligence and Information Systems, Vol. 23, No. 6, pp.127-141, Mar. 2017. DOI: 10.13088/JIIS.2017.23.1.127   DOI
10 C. D. Manning, P. Raghavan, and H. Schutze, "Introduction to Information Retrieval," Cambridge University Press, pp. 1-506, 2008.
11 R. Baeza-Yates and B. Ribeiro-Neto, "Modern Inormation Retrieval: The concepts and technology behind search (2nd. ed.)," Addison-Wesley Publishing Company, pp. 1-913, 2011.
12 H. Choi and H. Varian, "Predicting the present with google trends," Econ. Record, Vol. 88, No. 1, pp. 2-9, Jun. 2012. DOI: 10.1111/j.1475-4932.2012.00809.x   DOI
13 J. Kim, J. Suh, D. Ahn, and Y. Cho, "A Personalized Recommendation Methodology based on Collaborative Filtering," Journal of Intelligence and Information Systems, Vol. 8, No. 2, pp. 139-157, Dec. 2002.
14 D. Lee, "A Study on the Improvement of Recommendation Accuracy by Using Category Association Rule Mining," Journal of Intelligence and Information Systems, Vol. 26, No. 2, pp. 27-42, Jun. 2020. DOI: 10.13088/JIIS.2020.26.2.027   DOI
15 Y. Cho and J. Kim, "Application of Web usage mining and product taxonomy to collaborative recommendations in e-commerce," Expert Systems with Applications, Vol. 26, No. 2, pp. 233-246, Feb. 2004. DOI: 10.1016/s0957-4174(03)00138-6   DOI
16 N. Kim, D. Lee, H. Choi, and W. X. S. Wong, "Investigations on Techniques and Applications of Text Analytics," The Journal of Korean Institute of Communications and Information Sciences, Vol. 42, No. 2, pp. 471-492, Feb. 2017. DOI: 10.7840/kics.2017.42.2.471   DOI
17 D. D. Lee and H. S. Seung, "Learning the parts of objects by non-negative matrix factorization," Nature, Vol. 401, No. 6755, pp. 788-791, Oct. 1999. DOI: 10.1038/44565   DOI
18 P. Resnick, N. Iacovou, M. Suchak, P. Bergstrom, and J. Riedl, "GroupLens: an open architecture for collaborative filtering of netnews," Proceedings of the 1994 ACM conference on Computer supported cooperative work, pp.175-186, New York, NY, USA, Oct. 1994. DOI: 10.1145/192844.192905   DOI
19 D. M. Blei, A. Y. Ng, and M. I. Jordan, "Latent dirichlet allocation," The Journal of Machine Learning Research, Vol. 3, pp. 993-1022, Jan. 2003.
20 A. Lee, K. Choi, and G. Kim, "LDA Topic Modeling and Recommendation of Similar Patent Document Using Word2vec," Information Systems Review, Vol. 22, No. 1, pp. 17-31, Feb. 2020.   DOI
21 J. Kim, J. Byun, D. Sun, T. Kim, and Y. Kim, "A Model for Measuring the R&D Project Similarity using Patent Information," Journal of the Korea Institute of Information and Communication Engineering, Vol. 18, No. 5, pp. 1013-1021, May. 2014. DOI: 10.6109/JKIICE.2014.18.5.1013   DOI
22 Y. Bai and S. Park, "LEXAI : Legal Document Similarity Analysis Service using Explainable AI," Journal of Computing Science and Engineering, Vol. 47, No. 11, pp. 1061-1070, Nov. 2020. DOI: 10.5626/JOK.2020.47.11.1061   DOI
23 Y. Yoo, J. Kim, B. Sohn, and J. Jung, "Evaluation of Collaborative Filtering Methods for Developing Online Music Contents Recommendation System," The Transactions of The Korean Institute of Electrical Engineers, Vol. 66, No. 7, pp. 1083-1091, Jul. 2017. DOI: 10.5370/KIEE.2017.66.7.1083   DOI
24 H. Lee and J. Kim, "Issue Keyword Extraction Method Using Document Similarity Method-Focused on Internet Articles -," Asia-pacific Journal of Multimedia services convergent with Art, Humanities, and Sociology, Vol. 7, No. 8, pp. 383-391, Aug. 2017. DOI: 10.35873/ajmahs.2017.7.8.035   DOI
25 J. Konstan, B. Miller, D. Maltz, J. Herlocker, L. Gordon, and J. Riedi, "GroupLens: applying collaborative filtering to Usenet news," Communications of the ACM, Vol. 40, No. 3, pp. 77-87, Mar. 1997. DOI: 10.1145/245108.245126   DOI
26 S. Lee, Y. Cho, J. Lee, and D. Yu, "Comparative study of recommender systems using movie rating data," Journal of the Korean Data And Information Science Sociaty, Vol. 31, No. 6, pp. 975-991, Nov. 2020. DOI: 10.7465/jkdi.2020.31.6.975   DOI
27 D. Reinsel, J. Gantz, and J. Rydning, Data Age 2025: The Evolution of Data to Life-Critica, https://www.import.io/wp-content/uploads/2017/04/Seagate-WP-DataAge2025-March-2017.pdf
28 Y. Cho, J. Kim, and S. Kim, "A personalized recommender system based on web usage mining and decision tree induction," Expert Systems with Applications, Vol. 23, No. 3, pp. 329-342, Oct. 2002. DOI: 10.1016/s0957-4174(02)00052-0   DOI
29 Y. Wu and A. Chen, "Index structures of user profiles for efficient web page filtering services," Proceedings 20th IEEE International Conference on Distributed Computing Systems, p. 644-651, Taipei, Taiwan, Apr. 2000. DOI: 10.1109/ICDCS.2000.840981   DOI
30 D. Goldberg, D. Nichols, B. M. Oki, and D. Terry, "Using collaborative filtering to weave an information tapestry," Communications of the ACM, Vol. 35, No. 12, pp. 61-70. Dec. 1992. DOI: 10.1145/138859.138867   DOI
31 G. Salton, A. Wong, and C. S. Yang, "A vector space model for automatic indexing," Communications of the ACM, Vol. 18, No. 11, pp. 613-620, Nov. 1975. DOI: 10.1145/361219.361220   DOI
32 W. Seo, H. Park, and J. Yoon, "An exploratory study on the korean national R&D trends using co-word analysis," Journal of Information Technology Applications & Management, Vol. 19, No. 4, pp. 1-18, Dec. 2012. DOI: 10.21219/JITAM.2012.19.4.001   DOI
33 G. Salton, "The SMART Retrieval System-Experiments in Automatic Document Processing," Prentice Hall, pp. 1-556, 1971.
34 H. Hotelling, "Analysis of a complex of statistical variables into principal components," Journal of Educational Psychology, Vol. 24, No. 6, pp. 417-441, Sep. 1933. DOI: 10.1037/h0071325   DOI
35 G. W. Stewart, "On the early history of the singular value decomposition," SIAM Review, Vol. 35, No. 4, pp. 551-566, Dec. 1993. DOI: 10.1137/1035134   DOI
36 V. Vapnik, "Estimation of Dependences Based on Empirical Data," Springer Verlag, pp. 1-523, 1982.
37 H. Choi and E. Hwang, "Emotion-based Music Recommendation System based on Twitter Document Analysis," KIISE Transactions on Computing Practices, Vol. 18, No. 11, pp. 762-767, Nov. 2012.
38 J. Son, S. Kim, H. Kim, and S. Cho, "Review and Analysis of Recommender Systems," Journal of Korean Institute of Industrial Engineers, Vol. 41, No. 2, pp. 185-208, Apr. 2015.   DOI
39 K. Pearson, "On lines and planes of closest fit to systems of point in space," The London, Edinburgh, and Dublin Philosophical Magazine and Journal of Science Series 6, Vol. 2, No. 11, pp. 559-572, Nov. 1901. DOI: 10.1080/14786440109462720   DOI