Browse > Article
http://dx.doi.org/10.14695/KJSOS.2020.23.2.61

Greeting, Function, and Music: How Users Chat with Voice Assistants  

Wang, Ji (Graduate School of Comprehensive Human Sciences, University of Tsukuba)
Zhang, Han (Graduate School of Comprehensive Human Sciences, University of Tsukuba)
Zhang, Cen (Graduate School of Comprehensive Human Sciences, University of Tsukuba)
Xiao, Junjun (NetEase Hangzhou Network Co. Ltd)
Lee, Seung Hee (Faculty of Art and Design, University of Tsukuba)
Publication Information
Science of Emotion and Sensibility / v.23, no.2, 2020 , pp. 61-74 More about this Journal
Abstract
Voice user interface has become a commercially viable and extensive interaction mechanism with the development of voice assistants. Despite the popularity of voice assistants, the academic community does not utterly understand about what, when, and how users chat with them. Chatting with a voice assistant is crucial as it defines how a user will seek the help of the assistant in the future. This study aims to cover the essence and construct of conversational AI, to develop a classification method to deal with user utterances, and, most importantly, to understand about what, when, and how Chinese users chat with voice assistants. We collected user utterances from the real conventional database of a commercial voice assistant, NetEase Sing in China. We also identified different utterance categories on the basis of previous studies and real usage conditions and annotated the utterances with 17 labels. Furthermore, we found that the three top reasons for the usage of voice assistants in China are the following: (1) greeting, (2) function, and (3) music. Chinese users like to interact with voice assistants at night from 7 PM to 10 PM, and they are polite toward the assistants. The whole percentage of negative feedback utterances is less than 6%, which is considerably low. These findings appear to be useful in voice interaction designs for intelligent hardware.
Keywords
Voice Interaction Design; Chat; Chinese User; Intelligent Hardware;
Citations & Related Records
Times Cited By KSCI : 2  (Citation Analysis)
연도 인용수 순위
1 Akasaki, S., & Kaji, N. (2017). Chat detection in an intelligent assistant: Combining task-oriented and non-task-oriented spoken dialogue systems. arXiv preprint arXiv:1705.00746. DOI: 10.18653/v1/p17-1120.
2 Ammari, T., Kaye, J., Tsai, J. Y., & Bentley, F. (2019). Music, Search, and IoT: How People (Really) Use Voice Assistants. ACM Transactions on Computer-Human Interaction (TOCHI), 26(3), 1-28. DOI: 10.1145/3311956.
3 Austin, J. L. (1962). How to do things with words Oxford University Press. DOI: 10.1093/acprof.oso/9780198245537.001.0001.
4 Akasaki, S., & Kaji, N. (2017). Chat detection in an intelligent assistant: Combining task-oriented and non-task-oriented spoken dialogue systems. arXiv preprint arXiv:1705.00746. DOI: 10.18653/v1/p17-1120.
5 Canalys. (2019). China overtakes US in fast growing smart speaker market. https://www.canalys.com/newsroom/china-overtakes-us-in-fast-growing-smart-speaker-market. DOI: 10.31857/s0869-5873897745-754-12467.
6 Cheepen, C. (1988). The predictability of informal conversation. Pinter Pub Ltd. DOI: 10.2307/414637.
7 Cho, Yu Suk., Eom, Kimin., & Joo, Hyo Min. (2009). The effect of the human voice that is consistent with context and the mechanical meloy on user's subjective experience in mobile phones. In 2009 Korean Society for Emontion and Sensibility (pp.531-544). DOI: 10.31274.
8 Duquette, A., Michaud, F., & Mercier, H. (2008). Exploring the use of a mobile robot as an imitation agent with children with low-functioning autism. Autonomous Robots, 24(2), 147-157. DOI: 10.1007/s10514-007-9056-5.Ehrenbrink   DOI
9 P., Osman, S., & Moller, S. (2017, November). Google now is for the extraverted, Cortana for the introverted: investigating the influence of personality on IPA preference. In Proceedings of the 29th Australian Conference on Computer-Human Interaction (pp. 257-265). DOI: 10.1145/3152771.3152799.
10 Hoy, M. B. (2018). Alexa, Siri, Cortana, and more: an introduction to voice assistants. Medical Reference Services Quarterly, 37(1), 81-88. DOI: 10.1080/02763869.   DOI
11 Nass, C. I., & Brave, S. (2005). Wired for speech: How voice activates and advances the human-computer relationship (p. 9). Cambridge, MA: MIT press. DOI: 10.1162/coli.2006.32.3.451.
12 Moldovan, C., Rus, V., & Graesser, A. C. (2011). Automated Speech Act Classification For Online Chat. MAICS, 710, 23-29. DOI: 10.1007/978-3-642-67758-8_3.
13 Price, R. (2016). Microsoft is deleting its AI chatbot's incredibly racist tweets. Business Insider.
14 Volokhin, S., & Agichtein, E. (2018, March). Understanding music listening intents during daily activities with implications for contextual music recommendation. In Proceedings of the 2018 Conference on Human Information Interaction & Retrieval (pp. 313-316). DOI: 10.1145/3176349.3176885.
15 Sato-Shimokawara, E., Shinoda, Y., Takatani, T., Lee, H., Wada, K., & Yamaguchi, T. (2016, August). Analysis of category estimation for cloud based chat robot. In 2016 25th IEEE International Symposium on Robot and Human Interactive Communication (RO-MAN) (pp. 308-311). IEEE. DOI: 10.1109/roman.2016.7745147.
16 Tay, B. T., Low, S. C., Ko, K. H., & Park, T. (2016). Types of humor that robots can play. Computers in Human Behavior, 60, 19-28. DOI: 10.1016/j.cnb.2011.08.011.   DOI
17 Verma Shubham (2018). Why so serious? Amazon's Alexa is 'laughing' at night and scaring users; here's what's happening. https://www.financialexpress.com/industry/technology/why-so-serious-amazons-alexa-is-laughing-at-night-and-scaring-users-heres-whats-happening/1091784/
18 Weizenbaum, J. (1966). ELIZA-a computer program for the study of natural language communication between man and machine. Communications of the ACM, 9(1), 36-45. DOI: 10.1145/365153.365168.   DOI
19 Yoo, Cho-Rong., Kim Song-Hyun., & Kim, Jin-Woo. (2020). A Comparative Study of the Use of Intelligent Personal Assistant Services Experiences: Siri, Google Assistant, Bixby. Science of Emotion and Sensibility, 23(1) 69-78. DOI: 10.14695/KJSOS.2020.23.1.69   DOI
20 Moon, Y., & Nass, C. (1996). How “real” are computer personalities? Psychological responses to personality types in human-computer interaction. Communication Research, 23(6), 651-674. DOI: 10.1177/009365096023006002.   DOI