• Title/Summary/Keyword: Conversation models

Search Result 32, Processing Time 0.024 seconds

Contextual Modeling in Context-Aware Conversation Systems

  • Quoc-Dai Luong Tran;Dinh-Hong Vu;Anh-Cuong Le;Ashwin Ittoo
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.17 no.5
    • /
    • pp.1396-1412
    • /
    • 2023
  • Conversation modeling is an important and challenging task in the field of natural language processing because it is a key component promoting the development of automated humanmachine conversation. Most recent research concerning conversation modeling focuses only on the current utterance (considered as the current question) to generate a response, and thus fails to capture the conversation's logic from its beginning. Some studies concatenate the current question with previous conversation sentences and use it as input for response generation. Another approach is to use an encoder to store all previous utterances. Each time a new question is encountered, the encoder is updated and used to generate the response. Our approach in this paper differs from previous studies in that we explicitly separate the encoding of the question from the encoding of its context. This results in different encoding models for the question and the context, capturing the specificity of each. In this way, we have access to the entire context when generating the response. To this end, we propose a deep neural network-based model, called the Context Model, to encode previous utterances' information and combine it with the current question. This approach satisfies the need for context information while keeping the different roles of the current question and its context separate while generating a response. We investigate two approaches for representing the context: Long short-term memory and Convolutional neural network. Experiments show that our Context Model outperforms a baseline model on both ConvAI2 Dataset and a collected dataset of conversational English.

Development of Deep Learning Models for Multi-class Sentiment Analysis (딥러닝 기반의 다범주 감성분석 모델 개발)

  • Syaekhoni, M. Alex;Seo, Sang Hyun;Kwon, Young S.
    • Journal of Information Technology Services
    • /
    • v.16 no.4
    • /
    • pp.149-160
    • /
    • 2017
  • Sentiment analysis is the process of determining whether a piece of document, text or conversation is positive, negative, neural or other emotion. Sentiment analysis has been applied for several real-world applications, such as chatbot. In the last five years, the practical use of the chatbot has been prevailing in many field of industry. In the chatbot applications, to recognize the user emotion, sentiment analysis must be performed in advance in order to understand the intent of speakers. The specific emotion is more than describing positive or negative sentences. In light of this context, we propose deep learning models for conducting multi-class sentiment analysis for identifying speaker's emotion which is categorized to be joy, fear, guilt, sad, shame, disgust, and anger. Thus, we develop convolutional neural network (CNN), long short term memory (LSTM), and multi-layer neural network models, as deep neural networks models, for detecting emotion in a sentence. In addition, word embedding process was also applied in our research. In our experiments, we have found that long short term memory (LSTM) model performs best compared to convolutional neural networks and multi-layer neural networks. Moreover, we also show the practical applicability of the deep learning models to the sentiment analysis for chatbot.

A Study on the Semantic Network Analysis for Exploring the Generative AI ChatGPT Paradigm in Tourism Section (관광분야 생성형 AI ChatGPT 패러다임 탐색을 위한 의미연결망 연구)

  • Han Jangheon
    • Journal of Korea Society of Digital Industry and Information Management
    • /
    • v.19 no.4
    • /
    • pp.87-96
    • /
    • 2023
  • ChatGPT, a leader in generative AI, can use natural expressions like humans based on large-scale language models (LLM). The ability to grasp the context of the language and provide more specific answers by algorithms is excellent. It also has high-quality conversation capabilities that have significantly developed from past Chatbot services to the level of human conversation. In addition, it is expected to change the operation method of the tourism industry and improve the service by utilizing ChatGPT, a generative AI in the tourism sector. This study was conducted to explore ChatGPT trends and paradigms in tourism. The results of the study are as follows. First, keywords such as tourism, utilization, creation, technology, service, travel, holding, education, development, news, digital, future, and chatbot were widespread. Second, unlike other keywords, service, education, and Mokpo City data confirmed the results of a high degree of centrality. Third, due to CONCOR analysis, eight keyword clusters highly relevant to ChatGPT in the tourism sector emerged.

A study on Korean multi-turn response generation using generative and retrieval model (생성 모델과 검색 모델을 이용한 한국어 멀티턴 응답 생성 연구)

  • Lee, Hodong;Lee, Jongmin;Seo, Jaehyung;Jang, Yoonna;Lim, Heuiseok
    • Journal of the Korea Convergence Society
    • /
    • v.13 no.1
    • /
    • pp.13-21
    • /
    • 2022
  • Recent deep learning-based research shows excellent performance in most natural language processing (NLP) fields with pre-trained language models. In particular, the auto-encoder-based language model proves its excellent performance and usefulness in various fields of Korean language understanding. However, the decoder-based Korean generative model even suffers from generating simple sentences. Also, there is few detailed research and data for the field of conversation where generative models are most commonly utilized. Therefore, this paper constructs multi-turn dialogue data for a Korean generative model. In addition, we compare and analyze the performance by improving the dialogue ability of the generative model through transfer learning. In addition, we propose a method of supplementing the insufficient dialogue generation ability of the model by extracting recommended response candidates from external knowledge information through a retrival model.

Demand Analysis for Community-based Tourism Using Count Data Models (가산자료모형을 이용한 지역사회기반형 관광수요 분석)

  • Yun, Hee-Jeong
    • The Korean Journal of Community Living Science
    • /
    • v.22 no.2
    • /
    • pp.247-255
    • /
    • 2011
  • This study analyzed the demand for a community-based tourism site using a poisson model, a negative binominal model, a truncated poisson model and a truncated negative binominal model as count data models. For these reasons, questionnaire surveys were conducted into 5 community-based tourism sites in Chuncheon city with 406 tourists, and was analyzed using the STATA program. The fitness levels of four models were significant(p=0.0000) using a likelihood ratio test. The study results suggest that the demand of community-based tourism sites for visiting tourists was influenced by a pre-visiting experience, recognition of sustainable tourism, visitation of downtown, purchase of souvenir or farm produce, conversation with regional residents, regional harmony, preservation of natural resources and sex within the poisson and truncated poisson models. However, the variables of visitation of downtown, preservation of natural resources and sex were not significant within the negative binominal model and the visitation of downtown and preservation of natural resources were not significant within the truncated negative binominal model. The results of the visiting demand of community-based tourism sites can provide information for sustainable regional development strategies.

A Multi-agent Architecture for Coordination of Supply Chains with Local Information Sharing (지역적 정보 공유를 활용하는 멀티 에이전트 시스템 기반의 공급사슬 관리 아키텍쳐)

  • Ahn, Hyung-Jun;Park, Sung-Joo
    • Asia pacific journal of information systems
    • /
    • v.14 no.4
    • /
    • pp.49-70
    • /
    • 2004
  • Multi-agent technology is being regarded as one of the promising technologies for today's supply chain management because of its desirable features such as autonomy, intelligence, and collaboration. This paper suggests a multi-agent system architecture with which companies can improve the efficiency of their supply chains by collaborative operation. Reflecting the practical difficulties of collaboration in complex supply chains, the architecture allows agent systems to share information with only neighboring companies for the coordinated operation. The suggested architecture is elaborated with a collaboration model based on Petri-net, conversation models for communication, and internal behavior models of each agent. A simulation experiment was performed for the evaluation of the suggested architecture. The result implies that when the estimation of market demand is higher than a certain level, the suggested architecture can be beneficial.

Public Diplomacy, Propaganda, or What? China's Communication Practices in the South China Sea Dispute on Twitter

  • Nip, Joyce Y.M.;Sun, Chao
    • Journal of Public Diplomacy
    • /
    • v.2 no.1
    • /
    • pp.43-68
    • /
    • 2022
  • Multiple modes of communication on social media can contribute to public diplomacy in informing, conversing, and networking with members of foreign publics. However, manipulative behaviours on social media, prevalent especially in high tension contexts, create disruptions to authentic communication in what could be grey/black propaganda or information warfare. This study reviews existing literature about models of public diplomacy to guide an empirical study of China's communication in the #SouthChinaSea conversation on Twitter. It uses computational methods to identify, record, and analyze one-way, two-way, and network communication of China's actors. It employs manual qualitative research to determine the nature of China's actors. On that basis, it assesses China's Twitter communication in the issue against various models of public diplomacy.

Re-visitation Choice Impacts of Consideration on Sustainable Tourism Development - Using Logit and Probit Models - (지속가능한 관광개발 의식이 지역 재방문 선택에 미치는 영향 - 로짓모형과 프로빗모형을 활용하여 -)

  • Shin, Sang-Hyun;Yun, Hee-Jeong
    • Journal of Korean Society of Rural Planning
    • /
    • v.17 no.1
    • /
    • pp.59-65
    • /
    • 2011
  • Re-visitation have an effect on dependent variables of regional tourism demand model. This study focused on the re-visitation impacts of consideration on sustainable tourism development of tourists as a new factors of tourism. Based on literature reviews, 11 variables were selected, a questionnaire survey was given to 406 tourists divided into 5 tourism sites at Chuncheon city, and logit model and probit model were used for analysis. The fitness levels of two models were very significant(p=0.0000). The study results suggest that the likelihood of the rural tourist to make a return visit is influenced by recognition of sustainable tourism, purchase of souvenir and farm produce, visitation of regional shops, conversation with regional residents, residents' participation on development, age and marriage. The results of such re-visitation demand can provide information for regional development strategies. The approach to re-visitation research impacts of consideration on sustainable tourism development is expected to become a useful foundation in studying on sustainable regional development.

Trends in Deep-neural-network-based Dialogue Systems (심층 신경망 기반 대화처리 기술 동향)

  • Kwon, O.W.;Hong, T.G.;Huang, J.X.;Roh, Y.H.;Choi, S.K.;Kim, H.Y.;Kim, Y.K.;Lee, Y.K.
    • Electronics and Telecommunications Trends
    • /
    • v.34 no.4
    • /
    • pp.55-64
    • /
    • 2019
  • In this study, we introduce trends in neural-network-based deep learning research applied to dialogue systems. Recently, end-to-end trainable goal-oriented dialogue systems using long short-term memory, sequence-to-sequence models, among others, have been studied to overcome the difficulties of domain adaptation and error recognition and recovery in traditional pipeline goal-oriented dialogue systems. In addition, some research has been conducted on applying reinforcement learning to end-to-end trainable goal-oriented dialogue systems to learn dialogue strategies that do not appear in training corpora. Recent neural network models for end-to-end trainable chit-chat systems have been improved using dialogue context as well as personal and topic information to produce a more natural human conversation. Unlike previous studies that have applied different approaches to goal-oriented dialogue systems and chit-chat systems respectively, recent studies have attempted to apply end-to-end trainable approaches based on deep neural networks in common to them. Acquiring dialogue corpora for training is now necessary. Therefore, future research will focus on easily and cheaply acquiring dialogue corpora and training with small annotated dialogue corpora and/or large raw dialogues.

Error Analysis of Recent Conversational Agent-based Commercialization Education Platform (최신 대화형 에이전트 기반 상용화 교육 플랫폼 오류 분석)

  • Lee, Seungjun;Park, Chanjun;Seo, Jaehyung;Lim, Heuiseok
    • Journal of the Korea Convergence Society
    • /
    • v.13 no.3
    • /
    • pp.11-22
    • /
    • 2022
  • Recently, research and development using various Artificial Intelligence (AI) technologies are being conducted in the field of education. Among the AI in Education (AIEd), conversational agents are not limited by time and space, and can learn more effectively by combining them with various AI technologies such as voice recognition and translation. This paper conducted a trend analysis on platforms that have a large number of users and used conversational agents for English learning among commercialized application. Currently commercialized educational platforms using conversational agent through trend analysis has several limitations and problems. To analyze specific problems and limitations, a comparative experiment was conducted with the latest pre-trained large-capacity dialogue model. Sensibleness and Specificity Average (SSA) human evaluation was conducted to evaluate conversational human-likeness. Based on the experiment, this paper propose the need for trained with large-capacity parameters dialogue models, educational data, and information retrieval functions for effective English conversation learning.