Browse > Article
http://dx.doi.org/10.13088/jiis.2014.20.2.123

A Methodology for Extracting Shopping-Related Keywords by Analyzing Internet Navigation Patterns  

Kim, Mingyu (Graduate School of Business IT, Kookmin University)
Kim, Namgyu (Graduate School of Business IT, Kookmin University)
Jung, Inhwan (Department of Computer Engineering, Hansung University)
Publication Information
Journal of Intelligence and Information Systems / v.20, no.2, 2014 , pp. 123-136 More about this Journal
Abstract
Recently, online shopping has further developed as the use of the Internet and a variety of smart mobile devices becomes more prevalent. The increase in the scale of such shopping has led to the creation of many Internet shopping malls. Consequently, there is a tendency for increasingly fierce competition among online retailers, and as a result, many Internet shopping malls are making significant attempts to attract online users to their sites. One such attempt is keyword marketing, whereby a retail site pays a fee to expose its link to potential customers when they insert a specific keyword on an Internet portal site. The price related to each keyword is generally estimated by the keyword's frequency of appearance. However, it is widely accepted that the price of keywords cannot be based solely on their frequency because many keywords may appear frequently but have little relationship to shopping. This implies that it is unreasonable for an online shopping mall to spend a great deal on some keywords simply because people frequently use them. Therefore, from the perspective of shopping malls, a specialized process is required to extract meaningful keywords. Further, the demand for automating this extraction process is increasing because of the drive to improve online sales performance. In this study, we propose a methodology that can automatically extract only shopping-related keywords from the entire set of search keywords used on portal sites. We define a shopping-related keyword as a keyword that is used directly before shopping behaviors. In other words, only search keywords that direct the search results page to shopping-related pages are extracted from among the entire set of search keywords. A comparison is then made between the extracted keywords' rankings and the rankings of the entire set of search keywords. Two types of data are used in our study's experiment: web browsing history from July 1, 2012 to June 30, 2013, and site information. The experimental dataset was from a web site ranking site, and the biggest portal site in Korea. The original sample dataset contains 150 million transaction logs. First, portal sites are selected, and search keywords in those sites are extracted. Search keywords can be easily extracted by simple parsing. The extracted keywords are ranked according to their frequency. The experiment uses approximately 3.9 million search results from Korea's largest search portal site. As a result, a total of 344,822 search keywords were extracted. Next, by using web browsing history and site information, the shopping-related keywords were taken from the entire set of search keywords. As a result, we obtained 4,709 shopping-related keywords. For performance evaluation, we compared the hit ratios of all the search keywords with the shopping-related keywords. To achieve this, we extracted 80,298 search keywords from several Internet shopping malls and then chose the top 1,000 keywords as a set of true shopping keywords. We measured precision, recall, and F-scores of the entire amount of keywords and the shopping-related keywords. The F-Score was formulated by calculating the harmonic mean of precision and recall. The precision, recall, and F-score of shopping-related keywords derived by the proposed methodology were revealed to be higher than those of the entire number of keywords. This study proposes a scheme that is able to obtain shopping-related keywords in a relatively simple manner. We could easily extract shopping-related keywords simply by examining transactions whose next visit is a shopping mall. The resultant shopping-related keyword set is expected to be a useful asset for many shopping malls that participate in keyword marketing. Moreover, the proposed methodology can be easily applied to the construction of special area-related keywords as well as shopping-related ones.
Keywords
Internet Shopping; Keyword Evaluation; Keyword Marketing; Search Keyword Extraction;
Citations & Related Records
Times Cited By KSCI : 2  (Citation Analysis)
연도 인용수 순위
1 Oh, C. W., "Study of the characteristics of Internet keyword advertising's rate system and it's unfair click types," The Korean Journal of Advertising, Vol. 19, No. 4(2008), 7-27.
2 Rutz, O. and R. E. Bucklin, "From Generic to Branded: A Model of Spillover Dynamics in Paid Search Advertising," Journal of Marketing Research, Vol.48, No.1(2011), 87-102.   DOI   ScienceOn
3 Statistics Korea, "E-commerce and Cyber Shopping," Statistics Korea, 2014.
4 Youm, D. H., "The Influence of Mobile Banner Characteristics on Advertisement Selection," Dankook University, 2012.
5 Fain, D. C. and J. O. Pedersen, "Sponsored Search : A Brief History," Bulletin of the American Society for Information Science and Technology, Vol.32, No.2(2006), 12-13.
6 Agarwal, A., K. Hosanagar, and M. D. Smith, "Location, Location, Location: An Analysis of Profitability of Position in Online Advertising Markets," Journal of Marketing Research, Vol. 48(2008), 1057-1073.
7 Choi, Y. S., "Researches of Keyword Advertisement of Domestic Portal Websites" Myongji University, 2005.
8 Jeong, D. Y., "The Optimal Positioning Strategy for Auction-Based CPC Advertising," Korea Internet e-Commerce Association, Vol.6, No.2(2006), 81-101.
9 Johnson, G. J., G. C. Bruner, and A. Kumar, "Interactivity and Its Facets Revised," Journal of Advertising, Vol.35, No.4(2006), 35-52.
10 Kim, D. Y., G. G. Lim and D. C. Lee, "A Study on the Efficiency of Internet Keyword Advertisement According to CPM and CPC Methods by Analyzing Transactional Data," Journal of the Society for e-Business Studies, Vol.16, No.4(2011), 139-152.   과학기술학회마을   DOI   ScienceOn
11 Lee, S. J., S. W. Lee, "A Related Keyword Group Extraction Method for Keyword Marketing," The Korean Institute of Information Scientists and Engineers, Vol.31, No.2(2004), 124-126.
12 Lim, S. G., Shoppingmall marketing book, Hanbit Media, Seoul, 2007.
13 Buhalis, D., "Strategic Use of Information Technologies in the Tourism Industry," Tourism Management, Vol.19, No.5(1998), 409-421.   DOI   ScienceOn
14 Lee, D. Y., H. G. Kim, "Developing the Purchase Conversion Model of the Keyword Advertising Based on the Individual Search," Korean Academic Society of Business Administration, Vol.38, No.1(2013), 123-138.   과학기술학회마을   DOI   ScienceOn