[KSCI] Korea Science Citation Index Service

http://dx.doi.org/10.3837/tiis.2021.10.002

Extracting and Clustering of Story Events from a Story Corpus

Yu, Hye-Yeon (Department of Electrical and Computer Engineering, Sungkyunkwan University)
Cheong, Yun-Gyung (Department of AI, Sungkyunkwan University)
Bae, Byung-Chull (School of Games, Hongik University)

Publication Information

KSII Transactions on Internet and Information Systems (TIIS) / v.15, no.10, 2021 , pp. 3498-3512 More about this Journal

Abstract

This article describes how events that make up text stories can be represented and extracted. We also address the results from our simple experiment on extracting and clustering events in terms of emotions, under the assumption that different emotional events can be associated with the classified clusters. Each emotion cluster is based on Plutchik's eight basic emotion model, and the attributes of the NLTK-VADER are used for the classification criterion. While comparisons of the results with human raters show less accuracy for certain emotion types, emotion types such as joy and sadness show relatively high accuracy. The evaluation results with NRC Word Emotion Association Lexicon (aka EmoLex) show high accuracy values (more than 90% accuracy in anger, disgust, fear, and surprise), though precision and recall values are relatively low.

Keywords

Event Representation; Emotional Event; Event Clustering; Event Extraction; Sentiment Analysis of Story Event;

Citations & Related Records

Reference

1	S. Bird, E. Klein, and E. Loper, Natural Language Processing with Python: Analyzing Text with the Natural Language Toolkit, Beijing: O'Reilly, 2009.
2	A. Tozzo, D. Jovanovic, and M. Amer, "Neural event extraction from movies description," in Proc. of the First Workshop on Storytelling, pp. 60-66, 2018.
3	B.-C. Bae, Y.-G. Cheong, and D. Vella, "Modeling foreshadowing in narrative comprehension for sentimental readers," in Proc. of Interactive Storytelling on Interactive Digital Storytelling, Cham: Springer International Publishing, pp. 1-12, 2013.
4	S. B. Chatman, Story and Discourse: Narrative Structure in Fiction and Film, Ithaca, NY: Cornell University Press, 1980.
5	K. Pichotta and R. J. Mooney, "Using sentence-level LSTM language models for script inference," in Proc. of the 54th Annual Meeting of the Association for Computational Linguistics, pp. 279-289, 2016.
6	G. Prince, A Dictionary of Narratology, Lincoln, NE: University of Nebraska Press, 2003.
7	Y. Kim, M. Kang, and S. R. Jeong, "Text mining and sentiment analysis for predicting box office success," KSII Trans. Internet Inf. Syst., vol. 12, no. 8, pp. 4090-4102, 2018. DOI
8	H.-Y. Yu, S. Park, Y.-G. Cheong, M.-H. Kim, and B.-C. Bae, "Emotion-based story event clustering," in Proc. of Interactive Storytelling on Interactive Digital Storytelling, Springer, pp. 348-353, 2019.
9	M. Bradley and P. J. Lang, "Measuring Emotion: The Self-Assessment Manikin and the Semantic Differential," J. Behav. Ther. Exp. Psychiatry, vol. 25, pp. 49-59, 1994. DOI
10	C. Manning, M. Surdeanu, J. Bauer, J. Finkel, S. Bethard, and D. McClosky, "The Stanford CoreNLP natural language processing toolkit," in Proc. of 52nd Annual Meeting of the Association for Computational Linguistics: System Demonstrations, pp. 55-60, 2014.
11	J. A. Russell, "A circumplex model of affect," Journal of personality and social psychology, 39(6), 1161-1178, 1980. DOI
12	P. Ekman, "An argument for basic emotions," Cogn. Emot., vol. 6, no. 3-4, pp. 169-200, 1992. DOI
13	Z. Hu, E. Rahimtoroghi, L. Munishkina, R. Swanson, and M. A. Walker, "Unsupervised induction of contingent event pairs from film scenes," in Proc. of the 2013 Conference on Empirical Methods in Natural Language Processing, pp. 369-379, 2013.
14	Del Corro, L. and Gemulla, R., "ClausIE: clause-based open information extraction," in Proc. of the 22nd international conference on World Wide Web (WWW '13). Association for Computing Machinery, New York, NY, USA, pp. 355-366, 2013.
15	M. A. Walker, G. I. Lin, and J. E. Sawyer, "An Annotated Corpus of Film Dialogue for Learning and Characterizing Character Style," in Proc. of the 8th Int. Conf. Lang. Resour. Eval., 1373-1378, 2012.
16	J. L. Fleiss, "Measuring nominal scale agreement among many raters," Psychol. Bull., vol. 76, no. 5, pp. 378-382, 1971. DOI
17	S. M. Mohammad and P. D. Turney, "Crowdsourcing a Word-Emotion Association Lexicon," Computational Intelligence, 29(3), 436-465, 2013. DOI
18	N. Chambers and D. Jurafsky, "Unsupervised learning of narrative event chains," in Proc. of the 46th Annual Meeting of the Association for Computational Linguistics, June 15-20, 2008.
19	J. A. Russell, "Core affect and the psychological construction of emotion," Psychol. Rev., vol. 110, no. 1, pp. 145-172, 2003. DOI
20	D. Bamman, B. O'Connor, and N. A. Smith, "Learning latent personas of film characters," in Proc. of the 51st Annual Meeting of the Association for Computational Linguistics, pp. 352-361, 2013.
21	M. Hall, E. Frank, G. Holmes, B. Pfahringer, P. Reutemann, and I. H. Witten, "The WEKA data mining software: An update," SIGKDD, vol. 11, no. 1, pp. 10-18, 2009. DOI
22	L. J. Martin et al., "Event Representations for Automated Story Generation with Deep Neural Nets," in Proc. of the Thirty-Second AAAI Conference on Artificial Intelligence, New Orleans, Louisiana, USA, February 2-7, 2018.
23	J. Song, K. T. Kim, B. Lee, S. Kim, and H. Y. Youn, "A novel classification approach based on Naive Bayes for Twitter sentiment analysis," KSII Trans. Internet Inf. Syst., vol. 11, no. 6, pp. 2996-3011, 2017. DOI
24	K. Oatley, "A taxonomy of the emotions of literary response and a theory of identification in fictional narrative," Poetics, vol. 23, no. 1-2, pp. 53-74, 1995. DOI
25	K. Pichotta and R. J. Mooney, "Learning statistical scripts with LSTM recurrent neural networks," in Proc. of the 30th AAAI Conf. Artif. Intell. AAAI 2016, pp. 2800-2806, 2016.
26	N. Chambers and D. Jurafsky, "Unsupervised learning of narrative schemas and their participants," in Proc. of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP: Volume 2, pp. 602-610, 2009.
27	N. Mostafazadeh et al., "A corpus and cloze evaluation for deeper understanding of commonsense stories," in Proc. of the 2016 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, pp. 839-849, 2016.
28	R. Plutchik, "The nature of emotions," American Scientist, 89(4), 344-350, Jul 2001. DOI
29	K. Pichotta and R. Mooney, "Statistical script learning with multi-argument events," in Proc. of the 14th Conference of the European Chapter of the Association for Computational Linguistics, pp. 220-229, 2014.