[KSCI] Korea Science Citation Index Service

http://dx.doi.org/10.3745/KTCCS.2020.9.1.17

CNN Architecture Predicting Movie Rating from Audience's Reviews Written in Korean

Kim, Hyungchan (한국기술교육대학교 컴퓨터공학부)
Oh, Heung-Seon (한국기술교육대학교 컴퓨터공학부)
Kim, Duksu (한국기술교육대학교 컴퓨터공학부)

Publication Information

KIPS Transactions on Computer and Communication Systems / v.9, no.1, 2020 , pp. 17-24 More about this Journal

Abstract

In this paper, we present a movie rating prediction architecture based on a convolutional neural network (CNN). Our prediction architecture extends TextCNN, a popular CNN-based architecture for sentence classification, in three aspects. First, character embeddings are utilized to cover many variants of words since reviews are short and not well-written linguistically. Second, the attention mechanism (i.e., squeeze-and-excitation) is adopted to focus on important features. Third, a scoring function is proposed to convert the output of an activation function to a review score in a certain range (1-10). We evaluated our prediction architecture on a movie review dataset and achieved a low MSE (e.g., 3.3841) compared with an existing method. It showed the superiority of our movie rating prediction architecture.

Keywords

NLP; CNN; Movie Rating; Un-Normalized Text Data;

Citations & Related Records

Reference

1	A. Krizhevsky, I. Sutskever, and G. E. Hinton, "Imagenet classification with deep convolutional neural networks," Advances in Neural Information Processing Systems, pp. 1097-1105, 2012.
2	O. Abdel-Hamid, A. Mohamed, H. Jiang, L. Deng, G. Penn, and D. Yu, "Convolutional neural networks for speech recognition," IEEE/ACM Transactions on Audio, Speech, and Language Processing, 22, 10, pp.1533-1545, 2014. DOI
3	R. Collobert and J. Weston, "A unified architecture for natural language processing: Deep neural networks with multitask learning," In Proceedings of the 25th Inter-National Conference on Machine Learning, 160-167, 2008.
4	Y. Kim, "Convolutional Neural Networks for Sentence Classification," ArXiv e-prints:1408.5882, 2014.
5	K. Simonyan and A. Zisserman. "Very Deep Convolutional Networks for Large-Scale Image Recognition," ArXiv e-prints:1409.1556, 2014.
6	A. Conneau, H. Schwenk, L. Barrault, and Y. Lecun, "Very Deep Convolutional Networks for Text Classification," ArXiv eprints:1606.01781, 2016.
7	H. T. Le, C. Cerisara, and A. Denis, "Do convolutional networks need to be deep for text classification?," arXiv preprint 1707.04108, 2017.
8	C. M. Chang, J. Cho, H. Liu, R. K. Wagner, H. Shu, A. Zhou, C. S. Cheuk, and A. Muse, "Changing models across cultures: Associations of phonological awareness and morphological structure awareness with vocabulary and word recognition in second graders from beijing, hong kong, korea, and the united states," Journal of Experimental Child Psychology, Vol.92, No.2, pp.140-160, 2005. DOI
9	S. Petrov, D. Das, and R. McDonald. "A universal part-ofspeech tagset," arXiv preprint arXiv:1104.2086, 2011.
10	Mecab-ko morphological analyzer [internet], http://eunjeon.blogspot.com/.
11	Twitter morphological analyzer [internet], https://openkoreantext.org.
12	KoNLPy benchmark performance among the libraries, [internet] http://konlpy.org/en/latest/morph/#compariso n-between-pos-tagging-classes.
13	J. Hu, L. Shen, S. Albanie, G. Sun, and E. Wu, "Squeezeand-Excitation Networks," ArXiv e-prints:1709.01507, 2017.
14	Ashish Vaswani, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan N. Gomez, Lukasz Kaiser, and Illia Polosukhin, "Attention Is All You Need," In Advances in Neural Information Processing Systems, 2017, pp. 5998-6008.
15	Devlin, J., Chang, M.-W., Lee, K., & Toutanova, K., "BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding," arXiv preprint arXiv:1810.04805, 2018.
16	X. Yu, A. Falenska, and N. T. Vu. A, "general-purpose tagger with convolutional neural networks," arXiv preprint:1706.01723, 2017.
17	D. P. Kingma and J. Ba., "Adam: A Method for Stochastic Optimization," ArXiv e-prints, 1412.6980, 2014.
18	Hannanum morphological analyzer [internet], http://semanticweb.kaist.ac.kr/research/hannanum/index.html
19	M. D. Zeiler, "Adadelta: an adaptive learning rate method," arXiv preprint:1212.5701, 2012.

KSCI

CNN Architecture Predicting Movie Rating from Audience's Reviews Written in Korean 한국어 관객 평가기반 영화 평점 예측 CNN 구조

CNN Architecture Predicting Movie Rating from Audience's Reviews Written in Korean