• Title/Summary/Keyword: voice interface

Search Result 298, Processing Time 0.024 seconds

The Effect of Interjection in Conversational Interaction with the AI Agent: In the Context of Self-Driving Car (인공지능 에이전트 대화형 인터랙션에서의 감탄사 효과: 자율주행 맥락에서)

  • Lee, Sooji;Seo, Jeeyoon;Choi, Junho
    • The Journal of the Convergence on Culture Technology
    • /
    • v.8 no.1
    • /
    • pp.551-563
    • /
    • 2022
  • This study aims to identify the effect on the user experiences when the embodied agent in a self-driving car interacts with emotional expressions by using 'interjection'. An experimental study was designed with two conditions: the inclusion of injections in the agent's conversation feedbacks (with interjections vs. without interjections) and the type of conversation (task-oriented conversation vs. social-oriented conversation). The online experiment was conducted with the four video clips of conversation scenario treatments and measured intimacy, likability, trust, social presence, perceived anthropomorphism, and future intention to use. The result showed that when the agent used interjection, the main effect on social presence was found in both conversation types. When the agent did not use interjection in the task-oriented conversation, trust and future intention to use were higher than when the agent talked with emotional expressions. In the context of the conversation with the AI agent in a self-driving car, we found only the effect of adding emotional expression by using interjection on the enhancing social presence, but no effect on the other user experience factors.

Analysis of the utility of intelligent speakers in the Internet of Things environment (사물인터넷 환경에서 지능형 스피커의 활용성 분석)

  • Lee, Seong-Hoon;Lee, Dong-Woo
    • Journal of Internet of Things and Convergence
    • /
    • v.8 no.3
    • /
    • pp.41-46
    • /
    • 2022
  • Smart home in the Internet of Things (IoT) environment aims to provide an optimal living environment for users by connecting all devices in the home. In such a smart home environment, artificial intelligence speakers are being used as a way to manage and control all devices. The existing speaker function is changing from simple music playback to the role of an interface that controls and manages all devices in the smart home space. This study dealt with the market status and usability analysis in the US and Korea, the leader in artificial intelligence speakers. The main target companies were Amazon, Google, and Apple in the US, as well as Kakao, SKT, and KT in Korea. In addition, based on the reaction results of domestic users to artificial intelligence speakers, the derivation of major problems and directions for improvement were described.

Development of Electrical Sequence Control Safety Module Circuit Using Artificial Intelligence Controller (인공지능 컨트롤러를 이용한 전기 시퀀스 제어 안전 모듈 회로 개발)

  • Hong Yong Kim
    • Journal of the Society of Disaster Information
    • /
    • v.18 no.4
    • /
    • pp.699-705
    • /
    • 2022
  • Purpose: Sequence control is widely used by being applied to manufacturing, distribution, construction, and automation in the medical industry. With the development of the fourth industry, artificial intelligence convergence technology in the control field is becoming an important factor in the industry. In particular, it is required to evaluate the safety and innovation of facilities where microprocessors and artificial intelligence are fused to existing systems and develop reliable equipment, so it is intended to develop equipment for educational purposes and drive the development of the field. Method: The self-developed all-in-one artificial intelligence controller module is a device that combines artificial intelligence capabilities with existing sequence and PLC control circuits. As the performance evaluation items of this equipment, the recognition ability of motion, voice, text, color, etc. and the stability and reliability of the circuit were evaluated. Conclusion: After designing the sequence and PLC circuit, the performance evaluation items of the integrated integrated artificial intelligence controller module were all satisfied, and there was no problem in the safety and reliability of the circuit.

Understanding how agent control based on social status affects user experience factors in multi-user autonomous driving environments (다중 사용자 자율 주행 운전 환경에서 사회적 지위에 따른 에이전트의 제어권이 사용자 경험 요소에 미치는 영향)

  • JiYeon Kim;JuHye Ha;ChangHoon Oh
    • The Journal of the Convergence on Culture Technology
    • /
    • v.9 no.1
    • /
    • pp.735-745
    • /
    • 2023
  • The purpose of this study is to examine how the control of an agent according to a driver's social status affects user experience factors in a multi-user environment of self-driving vehicles. We conducted a user study where participants viewed four scenarios (route changing/parking x accepting/declining a fellow passenger's command) and answered a survey, followed by a post-hoc interview. Results showed that either the routing scenario or accepting a passenger's command scenario had higher usefulness (convenience, effectiveness, efficiency) than their counterparts. Regardless of the car owner's social status, participants rated AI agents more positively when they met their goals effectively. They also stressed that vehicle owners should always be in control of their agents. This study can provide guidelines for designing future autonomous driving scenarios where an agent interacts with a driver, and passengers.

Applying Social Strategies for Breakdown Situations of Conversational Agents: A Case Study using Forewarning and Apology (대화형 에이전트의 오류 상황에서 사회적 전략 적용: 사전 양해와 사과를 이용한 사례 연구)

  • Lee, Yoomi;Park, Sunjeong;Suk, Hyeon-Jeong
    • Science of Emotion and Sensibility
    • /
    • v.21 no.1
    • /
    • pp.59-70
    • /
    • 2018
  • With the breakthrough of speech recognition technology, conversational agents have become pervasive through smartphones and smart speakers. The recognition accuracy of speech recognition technology has developed to the level of human beings, but it still shows limitations on understanding the underlying meaning or intention of words, or understanding long conversation. Accordingly, the users experience various errors when interacting with the conversational agents, which may negatively affect the user experience. In addition, in the case of smart speakers with a voice as the main interface, the lack of feedback on system and transparency was reported as the main issue when the users using. Therefore, there is a strong need for research on how users can better understand the capability of the conversational agents and mitigate negative emotions in error situations. In this study, we applied social strategies, "forewarning" and "apology", to conversational agent and investigated how these strategies affect users' perceptions of the agent in breakdown situations. For the study, we created a series of demo videos of a user interacting with a conversational agent. After watching the demo videos, the participants were asked to evaluate how they liked and trusted the agent through an online survey. A total of 104 respondents were analyzed and found to be contrary to our expectation based on the literature study. The result showed that forewarning gave a negative impression to the user, especially the reliability of the agent. Also, apology in a breakdown situation did not affect the users' perceptions. In the following in-depth interviews, participants explained that they perceived the smart speaker as a machine rather than a human-like object, and for this reason, the social strategies did not work. These results show that the social strategies should be applied according to the perceptions that user has toward agents.

Handover Scheme between WiFi and Mobile WiMax (WiFi와 mobile WiMax간 핸드오버 방안)

  • Park, Seung-Kyun
    • The Journal of the Korea Contents Association
    • /
    • v.11 no.1
    • /
    • pp.34-41
    • /
    • 2011
  • At present wireless internet access service is available through the 3G network, mobile WiMAX and WiFi anytime and anywhere. In this environment where there are various networks, users should be able to select specific networks depending on different situations. And it is necessary to provide mobility support between homogeneous and between heterogenous networks. Given this situation, the many proposals have been presented to link 3G, which has the largest service area among various networks, with mobile WiMAX(IEEE 802.16e), or with WiFi(IEEE 802.11). But, recently, with the increasing volume of wireless internet use and wireless internet data, due to the advents of net-book, e-book and smart phone, the service area of WiFi and mobile WiMAX has rapidly expanded. Especially, the availability of real-time application such as internet phone has led to the relative shrinking of the proportion of 3G mobile communication network giving conventional voice service, and enlargement of those of wireless internet access networks like WiFi and mobile WiMAX. This paper suggests a handover scheme based on PMIPv6, whitch support mobility between WiFi and mobile WiMAX, and minimizes handover delay. In this scheme, the mobile node has a dual stack structure composed of two interfaces-WiFi and mobile WiMAX. Since WiFi dose not support mobility, it is suggested that the mobile node have the capacity to deal with handover signaling between gateway in case of handover between homogeneous networks. This handover scheme, suggested comparing with current handovers between homogeneous networks, has proved, in its analytic evaluation, to be able to reduce handover, transmission, and signaling overhead.

The Implementation of a PC GUI for a Multimedia Tele-Medical System based on ATM / B-ISDN (ATM/B-ISDN 통신망 기반의 멀티미디어 원격의료 정보시스템을 위한 PC용 GUI 구현)

  • 정연기;김영탁
    • Journal of Korea Multimedia Society
    • /
    • v.1 no.1
    • /
    • pp.45-55
    • /
    • 1998
  • In the tele-medical system, the broadband network for multimedia telecommunication and the multimedia terminal equipment for the remote access of the tele-medical information are essential. Especially, the tele-medical terminal equipment should provide the multimedia GUI environment in order to support the similar medical process by the tele-medical system. In this paper, we present a multimedia GUI (Graphic User Interface) for a Multimedia Tele-Medical System (TeleMedi_GUI) based on ATM/B-ISDN. In the tele-medical system, one workstation is used for the multimedia data server that is supporting multiple client terminals that are connected by the ATM network. The client terminals are based on Multimedia Personal Computers, and provide the remote access environment of the tele-medical database. We also developed the remote access protocols among the clients and the server to access multimedia medical information of the multimedia server. With using the TeleMedi_GUI, the doctors can examine and treat patients efficiently, using image data like X-ray/CT and voice data such as the S-ray diagnosis. The result of this paper can be applied to the following areas: 1) the implementation of the advanced medical service system interconnecting the small-scale health center and general hospitals, 2) the development of a fully computerized medical information system within the hospital.

  • PDF

The Audience Behavior-based Emotion Prediction Model for Personalized Service (고객 맞춤형 서비스를 위한 관객 행동 기반 감정예측모형)

  • Ryoo, Eun Chung;Ahn, Hyunchul;Kim, Jae Kyeong
    • Journal of Intelligence and Information Systems
    • /
    • v.19 no.2
    • /
    • pp.73-85
    • /
    • 2013
  • Nowadays, in today's information society, the importance of the knowledge service using the information to creative value is getting higher day by day. In addition, depending on the development of IT technology, it is ease to collect and use information. Also, many companies actively use customer information to marketing in a variety of industries. Into the 21st century, companies have been actively using the culture arts to manage corporate image and marketing closely linked to their commercial interests. But, it is difficult that companies attract or maintain consumer's interest through their technology. For that reason, it is trend to perform cultural activities for tool of differentiation over many firms. Many firms used the customer's experience to new marketing strategy in order to effectively respond to competitive market. Accordingly, it is emerging rapidly that the necessity of personalized service to provide a new experience for people based on the personal profile information that contains the characteristics of the individual. Like this, personalized service using customer's individual profile information such as language, symbols, behavior, and emotions is very important today. Through this, we will be able to judge interaction between people and content and to maximize customer's experience and satisfaction. There are various relative works provide customer-centered service. Specially, emotion recognition research is emerging recently. Existing researches experienced emotion recognition using mostly bio-signal. Most of researches are voice and face studies that have great emotional changes. However, there are several difficulties to predict people's emotion caused by limitation of equipment and service environments. So, in this paper, we develop emotion prediction model based on vision-based interface to overcome existing limitations. Emotion recognition research based on people's gesture and posture has been processed by several researchers. This paper developed a model that recognizes people's emotional states through body gesture and posture using difference image method. And we found optimization validation model for four kinds of emotions' prediction. A proposed model purposed to automatically determine and predict 4 human emotions (Sadness, Surprise, Joy, and Disgust). To build up the model, event booth was installed in the KOCCA's lobby and we provided some proper stimulative movie to collect their body gesture and posture as the change of emotions. And then, we extracted body movements using difference image method. And we revised people data to build proposed model through neural network. The proposed model for emotion prediction used 3 type time-frame sets (20 frames, 30 frames, and 40 frames). And then, we adopted the model which has best performance compared with other models.' Before build three kinds of models, the entire 97 data set were divided into three data sets of learning, test, and validation set. The proposed model for emotion prediction was constructed using artificial neural network. In this paper, we used the back-propagation algorithm as a learning method, and set learning rate to 10%, momentum rate to 10%. The sigmoid function was used as the transform function. And we designed a three-layer perceptron neural network with one hidden layer and four output nodes. Based on the test data set, the learning for this research model was stopped when it reaches 50000 after reaching the minimum error in order to explore the point of learning. We finally processed each model's accuracy and found best model to predict each emotions. The result showed prediction accuracy 100% from sadness, and 96% from joy prediction in 20 frames set model. And 88% from surprise, and 98% from disgust in 30 frames set model. The findings of our research are expected to be useful to provide effective algorithm for personalized service in various industries such as advertisement, exhibition, performance, etc.