Browse > Article
http://dx.doi.org/10.13088/jiis.2021.27.3.139

A Two-Stage Learning Method of CNN and K-means RGB Cluster for Sentiment Classification of Images  

Kim, Jeongtae (Department of Mathematics, The Catholic University of Korea)
Park, Eunbi (Department of Mathematics, The Catholic University of Korea)
Han, Kiwoong (Department of Industrial Management Engineering, Korea University)
Lee, Junghyun (Department of Information & Industrial Engineering, The Catholic University of Korea)
Lee, Hong Joo (Department of Business Administration, Catholic University of Korea)
Publication Information
Journal of Intelligence and Information Systems / v.27, no.3, 2021 , pp. 139-156 More about this Journal
Abstract
The biggest reason for using a deep learning model in image classification is that it is possible to consider the relationship between each region by extracting each region's features from the overall information of the image. However, the CNN model may not be suitable for emotional image data without the image's regional features. To solve the difficulty of classifying emotion images, many researchers each year propose a CNN-based architecture suitable for emotion images. Studies on the relationship between color and human emotion were also conducted, and results were derived that different emotions are induced according to color. In studies using deep learning, there have been studies that apply color information to image subtraction classification. The case where the image's color information is additionally used than the case where the classification model is trained with only the image improves the accuracy of classifying image emotions. This study proposes two ways to increase the accuracy by incorporating the result value after the model classifies an image's emotion. Both methods improve accuracy by modifying the result value based on statistics using the color of the picture. When performing the test by finding the two-color combinations most distributed for all training data, the two-color combinations most distributed for each test data image were found. The result values were corrected according to the color combination distribution. This method weights the result value obtained after the model classifies an image's emotion by creating an expression based on the log function and the exponential function. Emotion6, classified into six emotions, and Artphoto classified into eight categories were used for the image data. Densenet169, Mnasnet, Resnet101, Resnet152, and Vgg19 architectures were used for the CNN model, and the performance evaluation was compared before and after applying the two-stage learning to the CNN model. Inspired by color psychology, which deals with the relationship between colors and emotions, when creating a model that classifies an image's sentiment, we studied how to improve accuracy by modifying the result values based on color. Sixteen colors were used: red, orange, yellow, green, blue, indigo, purple, turquoise, pink, magenta, brown, gray, silver, gold, white, and black. It has meaning. Using Scikit-learn's Clustering, the seven colors that are primarily distributed in the image are checked. Then, the RGB coordinate values of the colors from the image are compared with the RGB coordinate values of the 16 colors presented in the above data. That is, it was converted to the closest color. Suppose three or more color combinations are selected. In that case, too many color combinations occur, resulting in a problem in which the distribution is scattered, so a situation fewer influences the result value. Therefore, to solve this problem, two-color combinations were found and weighted to the model. Before training, the most distributed color combinations were found for all training data images. The distribution of color combinations for each class was stored in a Python dictionary format to be used during testing. During the test, the two-color combinations that are most distributed for each test data image are found. After that, we checked how the color combinations were distributed in the training data and corrected the result. We devised several equations to weight the result value from the model based on the extracted color as described above. The data set was randomly divided by 80:20, and the model was verified using 20% of the data as a test set. After splitting the remaining 80% of the data into five divisions to perform 5-fold cross-validation, the model was trained five times using different verification datasets. Finally, the performance was checked using the test dataset that was previously separated. Adam was used as the activation function, and the learning rate was set to 0.01. The training was performed as much as 20 epochs, and if the validation loss value did not decrease during five epochs of learning, the experiment was stopped. Early tapping was set to load the model with the best validation loss value. The classification accuracy was better when the extracted information using color properties was used together than the case using only the CNN architecture.
Keywords
Sentiment Analysis of Image; Sense of Color; CNN; Two-stage learning;
Citations & Related Records
연도 인용수 순위
  • Reference
1 Kim, S. I., D. S. Kim, J. W. Kim, "Public Sentiment Analysis of Korean Top-10 Companies : Big Data Approach Using Multi-categorical Sentiment Lexicon," Journal of Intelligence and Information Systems, Vol. 22, No. 3 (2016), 45~69.   DOI
2 Liu, D., Y. Jiang, M. Pei, S. Liu, "Emotional image color transfer via deep learning," Pattern Recognition Letters, Vol. 110 (2018), 16~22.   DOI
3 Nam, M., E. Lee, J. Shin, "A Method for User Sentiment Classification using Instagram Hashtags," Korea Multimedia Society, Vol. 18, No. 11 (2015), 391-399.
4 Netzer, O., R. Feldman, J. Goldenberg, M. Fresko, "Mine Your Own Business: Market-Structure Surveillance Through Text Mining," Marketing Science, Vol. 31, No. 3, (2012), 521-543.   DOI
5 Machajdik, J., A. Hanbury, "Affective image classification using features inspired by psychology and art theory," Proceedings of the ACM Multimedia 2010 International Conference(MM' 10), Firenze, Italy, 2010, 83-92.
6 Cruz, R. A., H. J. Lee, "The Brand Personality Effect: Communicating Brand Personality on Twitter and its Influence on Online Community Engagement," Journal of Intelligence and Information Systems, Vol. 20, No. 1 (2014), 67-101.   DOI
7 Osgood, C. E., "The Cross-Cultural Generality of Visual-Verbal Synesthetic Tendencies," Behavioral Science, Vol. 5 (1960), 146-169.   DOI
8 Zhao, S., G. Ding, Q. Huang, T.-S. Chua, B. W. Schuller, K. Keutzer, "Affective Image Content Analysis: A Comprehensive Survey," Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence (IJCAI-18), Stockholm, Sweden, 2018, 5534-5541.
9 Lee, J.-S., D. H. Park, "Development of Customer Sentiment Pattern Map for Webtoon Content Recommendation," Journal of Intelligence and Information Systems, Vol. 25, No. 4 (2019), 67-88.
10 Mikels, J. A., B. L. Fredrickson, G. R. Larkin, C. M. Lindberg, S. J. Maglio, "Emotional category data on images from the international affective picture system," Behavior Research Methods, Vol. 37, No. 4 (2005), 626-630.   DOI
11 Panda, R. J. Zhang, H. Li, J.-Y. Lee, X. Lu, A. K. Roy-Chowdhury, "Contemplating Visual Emotions: Understanding and Overcoming Dataset Bias," Proceedings of European Conference on Computer Vision (ECCV), Munich, Germany, 2018, 594~612.
12 Park, H. J., K. S. Shin, "Aspect-Based Sentiment Analysis Using BERT: Developing Aspect Category Sentiment Classification Models," Journal of Intelligence and Information Systems, Vol. 26, No. 4 (2020), 1-15.   DOI
13 Schlosberg, H., "Three dimensions of emotion," Psychological Review, Vol. 61, No. 2 (1954), 81-88.   DOI
14 Seo, S.-H., J.-T. Kim, "Research trend of deep learning based sentiment analysis," Korea Multimedia Society, Vol. 20, No. 3 (2016), 8~22.
15 Yang, Y., J. Jia, S. Zhang, B. Wu, Q. Chen, "How do your friends on social media disclose your emotions?" Proceedings of the National Conference on Artificial Intelligence, Quebec, Canada, 2014, 306-312.
16 Chen, M., L. Zhang, J. P. Allebach, "Learning deep features for image emotion classification," Proceedings of 2015 IEEE International Conference on Image Processing(ICIP), Quebec, Canada, 2015, 4491~4495.
17 Yang, J., M. Sun, X. Sun, "Learning visual sentiment distributions via augmented conditional probability neural network," Proceedings of AAAI Conference on Artificial Intelligence, San Francisco, California, USA, 2017, 224~230.
18 Zhang, J., H. Sun, Z. Wang, T. Ruan, "Another Dimension: Towards Multi-subnet Neural Network for Image Sentiment Analysis," Proceedings of 2019 IEEE International Conference on Multimedia and Expo (ICME), Shanghai, China, 2019, 1126-1131.
19 Peng, K., T. Chen, A. Sadovnik and A. Gallagher, "A mixed bag of emotions: Model, predict, and transfer emotion distributions," Proceedings of 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Boston, MA, USA, 2015, 860-868.
20 Argyris, Y. A., Z. Wang, Y. Kim, Z. Yin, "The effects of visual congruence on increasing consumers' brand engagement: An empirical investigation of influencer marketing on instagram using deep-learning algorithms for automatic image classification," Computers in Human Behavior, Vol. 112 (2020), 106443.   DOI
21 Corchs, S., E. Fersini, F. Gasparini, "Ensemble learning on visual and textual data for social image emotion classification," International Journal of Machine Learning and Cybernetics, Vol. 10, No. 8 (2019), 2057~2070.   DOI
22 Ekman, P., "An argument for basic emotions," Cognition Emotion, Vol. 6 (1992), 169-200.   DOI
23 Fei, Z., E. Yang, D. D. Li, S. Butler, W. Ijomah, X. Li, H. Zhou, "Deep convolution network based emotion analysis towards mental health care," Neurocomputing, Vol. 388 (2020), 212~227.   DOI
24 Gilbert, A. N., A. J. Fridlund, L. A. Lucchina, "The color of emotion: A metric for implicit color associations," Food Quality and Preference, Vol. 52 (2016), 203~210.   DOI
25 Gupta, S., S. K. Gupta, "Investigating Emotion-Color Association in Deep Neural Netwokrs," arXiv:2011.11058, 2020, https://arxiv.org/abs/2011.11058
26 Song, K., T. Yao, Q. Ling, T. Mei, "Boosting image sentiment analysis with visual attention," Neurocomputing, Vol. 312 (2018), 218-228.   DOI
27 Lee, J., Park, E., "Fuzzy Similarity-Based Emotional Classification of Color Images," IEEE Transactions on Multimedia, Vol. 13, No. 5 (2011), 1031-1039.   DOI
28 Priya, D. T., J. D. Udayan, "Affective emotion classification using feature vector of image based on visual concepts," The International Journal of Electrical Engineering & Education, (July 2020), 1~22.
29 D'Andrade, R., M. Egan, "The colors of emotion," American Ethnologist, Vol. 1 (1974), 49-63.   DOI
30 Zhang, W., X. He, W. Lu, "Exploring Discriminative Representations for Image Emotion Recognition With CNNs," IEEE Transactions on Multimedia, Vol. 22, No. 2 (2020), 515~523.   DOI
31 Understanding the Meaning of Colors in ColorPsychology, 2009. Available at http://www.empower-yourself-with-color-psychology.com/
32 Ali, A. R., U. Shahis, M. Ali, J. Ho, "High-Level Concepts for Affective Understanding of Images," arXiv:1705.02751v1, 2017, https://arxiv.org/abs/1705.02751
33 Lee, E., J. A. Lee, J. H. Moon, Y. Sung, "Pictures speak louder than words: Motivations for using Instagram," Cyberpsychology, behavior, and social networking, Vol. 18, No. 9 (2015), 552-556.   DOI
34 Gajarla, V., A. Gupta, "Emotion detection and sentiment analysis of images," Georgia Institute of Technology, 2015.
35 Han, G.-W., J. H. Lee, H. J. Lee, "A CNN and K-means RGB Cluster Ensemble Method for Image Sentiment Classification," Proceedings of 2020 Spring Korea Intelligent Information Systems Society Conference, Seoul, South Korea, 2020, 26.
36 He, K., X. Zhang, S. Ren, J. Sun, "Deep Residual Learning for Image Recognition," Proceedings of 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Las Vegas, NV, USA, 2016, 770~778.
37 Li, B., C. Guo, H. Ren, "Image Emotion Recognition Based on Deep Neural Network," Proceedings of 2018 IEEE International Conference of Safety Produce Informatization (IICSPI) (2018), 561~564.
38 Liao, S., J. Wang, R. Yu, K. Sato, Z. Cheng, "CNN for situations understanding based on sentiment analysis of twitter data," Procedia Computer Science, Vol. 111, 2017, 376-381.   DOI