Browse > Article
http://dx.doi.org/10.13088/jiis.2019.25.1.109

Color-related Query Processing for Intelligent E-Commerce Search  

Hong, Jung A (School of Business, Hanyang University)
Koo, Kyo Jung (School of Finance, Hanyang University)
Cha, Ji Won (School of Business, Hanyang University)
Seo, Ah Jeong (School of Computer Software, Hanyang University)
Yeo, Un Yeong (School of Business Informatics, Hanyang University)
Kim, Jong Woo (School of Business, Hanyang University)
Publication Information
Journal of Intelligence and Information Systems / v.25, no.1, 2019 , pp. 109-125 More about this Journal
Abstract
As interest on intelligent search engines increases, various studies have been conducted to extract and utilize the features related to products intelligencely. In particular, when users search for goods in e-commerce search engines, the 'color' of a product is an important feature that describes the product. Therefore, it is necessary to deal with the synonyms of color terms in order to produce accurate results to user's color-related queries. Previous studies have suggested dictionary-based approach to process synonyms for color features. However, the dictionary-based approach has a limitation that it cannot handle unregistered color-related terms in user queries. In order to overcome the limitation of the conventional methods, this research proposes a model which extracts RGB values from an internet search engine in real time, and outputs similar color names based on designated color information. At first, a color term dictionary was constructed which includes color names and R, G, B values of each color from Korean color standard digital palette program and the Wikipedia color list for the basic color search. The dictionary has been made more robust by adding 138 color names converted from English color names to foreign words in Korean, and with corresponding RGB values. Therefore, the fininal color dictionary includes a total of 671 color names and corresponding RGB values. The method proposed in this research starts by searching for a specific color which a user searched for. Then, the presence of the searched color in the built-in color dictionary is checked. If there exists the color in the dictionary, the RGB values of the color in the dictioanry are used as reference values of the retrieved color. If the searched color does not exist in the dictionary, the top-5 Google image search results of the searched color are crawled and average RGB values are extracted in certain middle area of each image. To extract the RGB values in images, a variety of different ways was attempted since there are limits to simply obtain the average of the RGB values of the center area of images. As a result, clustering RGB values in image's certain area and making average value of the cluster with the highest density as the reference values showed the best performance. Based on the reference RGB values of the searched color, the RGB values of all the colors in the color dictionary constructed aforetime are compared. Then a color list is created with colors within the range of ${\pm}50$ for each R value, G value, and B value. Finally, using the Euclidean distance between the above results and the reference RGB values of the searched color, the color with the highest similarity from up to five colors becomes the final outcome. In order to evaluate the usefulness of the proposed method, we performed an experiment. In the experiment, 300 color names and corresponding color RGB values by the questionnaires were obtained. They are used to compare the RGB values obtained from four different methods including the proposed method. The average euclidean distance of CIE-Lab using our method was about 13.85, which showed a relatively low distance compared to 3088 for the case using synonym dictionary only and 30.38 for the case using the dictionary with Korean synonym website WordNet. The case which didn't use clustering method of the proposed method showed 13.88 of average euclidean distance, which implies the DBSCAN clustering of the proposed method can reduce the Euclidean distance. This research suggests a new color synonym processing method based on RGB values that combines the dictionary method with the real time synonym processing method for new color names. This method enables to get rid of the limit of the dictionary-based approach which is a conventional synonym processing method. This research can contribute to improve the intelligence of e-commerce search systems especially on the color searching feature.
Keywords
E-Commerce; Color; Image Crawling; RGB; Synonym;
Citations & Related Records
Times Cited By KSCI : 1  (Citation Analysis)
연도 인용수 순위
1 Hwang M.-N., Lee S., Cho M., Kim S.-Y., Choi S.-P., Jung H. (2012). Ontology Construction of Technological Knowledge for R&D Trend Analysis, Journal of the Korea Contents Association, 12(12), 35-45   DOI
2 Kim S., Kim G. (2012). Ontology-based User Customized Search Service Considering User Intention, Journal of Intelligence and Information Systems, 18(4), 129-143   DOI
3 Kim T., Yang J., Lee J., Son J., Jeong Y. (2005). Efficient production of Ontology for Intelligent E-Commerce. Journal of Intelligence and Information Systems, 273-279.
4 Lei Y., Uren V., Motta E. (2006) SemSearch: A Search Engine for the Semantic Web. In: Staab S., Svatek V. (eds) Managing Knowledge in a World of Networks. EKAW 2006. Lecture Notes in Computer Science, vol 4248. Springer, Berlin, Heidelberg.
5 Lin S., Hanrahan P. 2013. Modeling how people extract color themes from images. In Proceedings of the SIGCHI Conference on Human Factors in Computing Systems (CHI '13). ACM, New York, NY, USA, 3101-3110. DOI: https://doi.org/10.1145/2470654.2466424   DOI
6 Mahama S., A. T., Dossa, A. S., Gouton, P. (2016). Choice of distance metrics for RGB color image analysis. Electronic Imaging, 2016(20), 1-4.   DOI
7 Naver Corp. (2007). Patent No.10-2007-0115690. Seoul: Republic of Korea Patent and Trademark Office
8 Qu, Y., Cheng, G. (2011). Falcons concept search: A practical search engine for web ontologies. IEEE Transactions on Systems, Man, and Cybernetics-Part A: Systems and Humans, 41(4), 810-816.   DOI
9 Rose D., Levinson D. 2004. Understanding user goals in web search. In Proceedings of the 13th international conference on World Wide Web (WWW '04). ACM, New York, NY, USA, 13-19.
10 Sudeepthi, G., Anuradha, G., Babu, M. S. P. (2012). A survey on semantic web search engine. International Journal of Computer Science Issues (IJCSI), 9(2), 241.
11 Tran T., Cimiano P., Rudolph S., Studer R. (2007) Ontology-Based Interpretation of Keywords for Semantic Search. In: Aberer K. et al. (eds) The Semantic Web. ISWC 2007, ASWC 2007. Lecture Notes in ComputerScience, vol 4825. Springer, Berlin, Heidelberg.
12 Turban E., Outland J., King D., Lee J.K., Liang TP., Turban D.C. (2018) Intelligent (Smart) E-Commerce. In: Electronic Commerce 2018. Springer Texts in Business and Economics. Springer, Cham.
13 Wikipedia color name chart. https://ko.wikipedia.org/wiki/%EC%83%89_%EB%AA%A9%EB%A1%9D (accessed September 2018).
14 Woo S., Kim K., Kim C. (2005). User Category - Based Intelligent E-Commerce Meta - Search Engine. Journal of Intelligence and Information Systems, 346-355.
15 WordNet. http://www.wordnet.co.kr/ (accessed September 2018).
16 Cho Y., Kim Y. (2011). Color Expression by Information Extraction. Proceedings of KIIT Summer Conference, 618-620.
17 Apple Inc. (2015). Patent No.10-2015-7004968. Washington, DC: U.S. Patent and Trademark Office
18 Apple Inc. (2017). Patent No.10-2017-0069606. Washington, DC: U.S. Patent and Trademark Office
19 Cao Y., Ju T., Xu J., Hu S‐M. (2016). Extracting Sharp Features from RGB‐D Images. Computer Graphics Forum. 36. 10.1111/cgf.13069.   DOI
20 Chakraborty S., Nagwani N-K., Dey L. (2014). Performance comparison of incremental k-means and incremental dbscan algorithms. arXiv preprint arXiv:1406.4751.
21 ClickZ Intelligence. "Seven Ways Artificial Intelligence Can Be Used for Marketing." ClickZ, May 31, 2013.clickz.com/seven-ways-artificial-intelligence-can-be-used-for-marketing/96572,(accessed September 2018).
22 Davis, B. "15 Examples of Artificial Intelligence in Marketing." Econsultancy, April 19, 2016. econsultancy.com/blog/67745-15-examples-of-artificial-intelligence-in-marketing (accessed September 2018).
23 Ester M., Kriegel H. P., Sander J., Xu X. (1996, August). A density-based algorithm for discovering clusters in large spatial databases with noise. In Kdd (Vol. 96, No. 34, pp. 226-231).
24 Google Image Search https://www.google.co.kr/imghp?hl=ko&tab=wi&authuser=0 (accessed September 2018).
25 Google LLC. (2017). U.S. Patent No.10-2017-7031186. Washington, DC: U.S. Patent and Trademark Office
26 Gomez-Perez A., Fernandez-Lopez M., Corcho O. (2006) Ontological Engineering: With Examples From the Areas of Knowledge Management, e-Commerce and the Semantic Web. Springer, London. https://doi.org/10.1007/b97353.
27 HTML chart. https://html-color-codes.info/ (accessed September 2018).