• Title/Summary/Keyword: Text analysis

Search Result 3,350, Processing Time 0.029 seconds

Identifying Social Relationships using Text Analysis for Social Chatbots (소셜챗봇 구축에 필요한 관계성 추론을 위한 텍스트마이닝 방법)

  • Kim, Jeonghun;Kwon, Ohbyung
    • Journal of Intelligence and Information Systems
    • /
    • v.24 no.4
    • /
    • pp.85-110
    • /
    • 2018
  • A chatbot is an interactive assistant that utilizes many communication modes: voice, images, video, or text. It is an artificial intelligence-based application that responds to users' needs or solves problems during user-friendly conversation. However, the current version of the chatbot is focused on understanding and performing tasks requested by the user; its ability to generate personalized conversation suitable for relationship-building is limited. Recognizing the need to build a relationship and making suitable conversation is more important for social chatbots who require social skills similar to those of problem-solving chatbots like the intelligent personal assistant. The purpose of this study is to propose a text analysis method that evaluates relationships between chatbots and users based on content input by the user and adapted to the communication situation, enabling the chatbot to conduct suitable conversations. To evaluate the performance of this method, we examined learning and verified the results using actual SNS conversation records. The results of the analysis will aid in implementation of the social chatbot, as this method yields excellent results even when the private profile information of the user is excluded for privacy reasons.

Rural Tourism Image and Major Activity Space in Gochang County Shown in Social Data - Focusing on the Keyword 'Gochang-gun Travel' - (소셜데이터에 나타난 고창군의 농촌관광 이미지와 주요 활동공간 - '고창군 여행' 키워드를 중심으로 -)

  • Kim, Young-Jin;Son, Gwangryul;Lee, Dongchae;Son, Yong-hoon
    • Journal of Korean Society of Rural Planning
    • /
    • v.27 no.3
    • /
    • pp.103-116
    • /
    • 2021
  • In this study, the characteristics of rural tourism image perceived by urban residents were analyzed through text analysis of blog data. In order to examine the images related to rural tourism, blog data written with the keyword "Gochang-gun travel" was used. LDA topic analysis, one of the text mining techniques, was used for the analysis. In the tourism image of Gochang-gun, 9 topics were derived, and 112 major places appeared. This was divided into 3 main activities and 5 object spaces through the review of keywords and the original text of blog data. As a result of the analysis, the traditional main resources of the region, Seonun mountain, Seonun temple, and Gochang-eup fortress, formed topic. On the other hand, world heritage such as dolmen and Ungok wetland did not appear as topic. In particular, the farms operated by the private sector form individual topics, and the theme farm can be seen as an important resource for tourism in Gochang-gun. Also, through the distribution of place keywords, it was possible to understand the characteristics of travel by region and the usage behavior of visitors. In the case of Gochang-gun, there was a phenomenon in which visitors were biased by region. This seems to be the result of Gochang-gun seeking to vitalize local tourism focusing on natural, ecological, and scenic resources. It is necessary to establish a plan for balanced regional development and develop other types of tourism resources. This study is different in that it identified the types and characteristics of rural tourism images in the region perceived by visitors, and the status of tourism at the regional level.

Using Text Mining and Social Network Analysis to Identify Determinant Characteristics Affecting Consumers' Evaluation of Clothing Fit (텍스트 마이닝과 소셜 네트워크 분석 기법을 활용한 소비자의 의복 맞음새(Fit)평가에 영향을 미치는 특성)

  • Soo Hyun Hwang;Juyeon Park
    • Science of Emotion and Sensibility
    • /
    • v.26 no.1
    • /
    • pp.101-114
    • /
    • 2023
  • This research aimed to recognize the determinant characteristics affecting consumers' clothing fit evaluation by employing text mining and social network analysis. For this aim, we first extracted text data linked to clothing fit from 2,000 consumer reviews collected from social network services and conducted semantic network examination and CONCOR analysis. As a result, we reported that "pants" and "skirts" were the most commonly associated clothing items with consumers' clothing fit evaluation. And the length of clothing was most commonly investigated. Then, the "waist" and "hip" were the most critical body parts affecting consumers' perception of clothing fit. Further, the four keywords including "wide," "large," "short," and "long" were the most employed ones in consumer reviews when evaluating clothing fit. This study is meaningful in that it specifically recognized the structural relationship and semantic meanings of keywords relevant to consumers' evaluation of clothing fit, which could bring empirical reference information for advanced clothing fit.

Analysis on the Trends of Research Themes of the Korean Dance Using Text Mining (텍스트 마이닝을 활용한 한국무용 연구주제 동향 분석)

  • Kim, Woo-Kyung;Yoo, Ji-Young
    • Journal of Korea Entertainment Industry Association
    • /
    • v.13 no.5
    • /
    • pp.215-228
    • /
    • 2019
  • The purpose of this study is to analyze the trends of research themes of the Korean dance in recent 20 years using text mining. The study has analyzed 3,047 words in 1,468 academic papers posted in the Research & Information Services Section(RISS). TEXTOM, a big data analysis solution, has been used to refine and analyse data, and the keyword analysis and topic modeling have been adopted during the text-mining process to come up with meaningful results. First, the theme of studies has shifted from the structure of the basic Korean dance moves to the use and transmission of the Korean dance. Second, those who participate in studies of the Korean dance have changed from middle-aged women to elderly women. Third, studies on dance records have been inactivated. Fourth, studies on Choi Seung-hee have consistently been a subject of interest. Fifth, the focus of studies has turned from the Korean creative dance to the Korean traditional dance. Sixth, there are no iconic research themes that would lead the academic trends with no clear boundaries of research themes.

A Trend Analysis and Policy proposal for the Work Permit System through Text Mining: Focusing on Text Mining and Social Network analysis (텍스트마이닝을 통한 고용허가제 트렌드 분석과 정책 제안 : 텍스트마이닝과 소셜네트워크 분석을 중심으로)

  • Ha, Jae-Been;Lee, Do-Eun
    • Journal of Convergence for Information Technology
    • /
    • v.11 no.9
    • /
    • pp.17-27
    • /
    • 2021
  • The aim of this research was to identify the issue of the work permit system and consciousness of the people on the system, and to suggest some ideas on the government policies on it. To achieve the aim of research, this research used text mining based on social data. This research collected 1,453,272 texts from 6,217 units of online documents which contained 'work permit system' from January to December, 2020 using Textom, and did text-mining and social network analysis. This research extracted 100 key words frequently mentioned from the analyses of data top-level key word frequency, and degree centrality analysis, and constituted job problem, importance of policy process, competitiveness in the respect of industries, and improvement of living conditions of foreign workers as major key words. In addition, through semantic network analysis, this research figured out major awareness like 'employment policy', and various kinds of ambient awareness like 'international cooperation', 'workers' human rights', 'law', 'recruitment of foreigners', 'corporate competitiveness', 'immigrant culture' and 'foreign workforce management'. Finally, this research suggested some ideas worth considering in establishing government policies on the work permit system and doing related researches.

Big Data Analysis on the Perception of Home Training According to the Implementation of COVID-19 Social Distancing

  • Hyun-Chang Keum;Kyung-Won Byun
    • International Journal of Internet, Broadcasting and Communication
    • /
    • v.15 no.3
    • /
    • pp.211-218
    • /
    • 2023
  • Due to the implementation of COVID-19 distancing, interest and users in 'home training' are rapidly increasing. Therefore, the purpose of this study is to identify the perception of 'home training' through big data analysis on social media channels and provide basic data to related business sector. Social media channels collected big data from various news and social content provided on Naver and Google sites. Data for three years from March 22, 2020 were collected based on the time when COVID-19 distancing was implemented in Korea. The collected data included 4,000 Naver blogs, 2,673 news, 4,000 cafes, 3,989 knowledge IN, and 953 Google channel news. These data analyzed TF and TF-IDF through text mining, and through this, semantic network analysis was conducted on 70 keywords, big data analysis programs such as Textom and Ucinet were used for social big data analysis, and NetDraw was used for visualization. As a result of text mining analysis, 'home training' was found the most frequently in relation to TF with 4,045 times. The next order is 'exercise', 'Homt', 'house', 'apparatus', 'recommendation', and 'diet'. Regarding TF-IDF, the main keywords are 'exercise', 'apparatus', 'home', 'house', 'diet', 'recommendation', and 'mat'. Based on these results, 70 keywords with high frequency were extracted, and then semantic indicators and centrality analysis were conducted. Finally, through CONCOR analysis, it was clustered into 'purchase cluster', 'equipment cluster', 'diet cluster', and 'execute method cluster'. For the results of these four clusters, basic data on the 'home training' business sector were presented based on consumers' main perception of 'home training' and analysis of the meaning network.

A Rule-Based Analysis from Raw Korean Text to Morphologically Annotated Corpora

  • Lee, Ki-Yong;Markus Schulze
    • Language and Information
    • /
    • v.6 no.2
    • /
    • pp.105-128
    • /
    • 2002
  • Morphologically annotated corpora are the basis for many tasks of computational linguistics. Most current approaches use statistically driven methods of morphological analysis, that provide just POS-tags. While this is sufficient for some applications, a rule-based full morphological analysis also yielding lemmatization and segmentation is needed for many others. This work thus aims at 〔1〕 introducing a rule-based Korean morphological analyzer called Kormoran based on the principle of linearity that prohibits any combination of left-to-right or right-to-left analysis or backtracking and then at 〔2〕 showing how it on be used as a POS-tagger by adopting an ordinary technique of preprocessing and also by filtering out irrelevant morpho-syntactic information in analyzed feature structures. It is shown that, besides providing a basis for subsequent syntactic or semantic processing, full morphological analyzers like Kormoran have the greater power of resolving ambiguities than simple POS-taggers. The focus of our present analysis is on Korean text.

  • PDF

A Study on the Integration Between Smart Mobility Technology and Information Communication Technology (ICT) Using Patent Analysis

  • Alkaabi, Khaled Sulaiman Khalfan Sulaiman;Yu, Jiwon
    • Journal of the Korea Society of Computer and Information
    • /
    • v.24 no.6
    • /
    • pp.89-97
    • /
    • 2019
  • This study proposes a method for investigating current patents related to information communication technology and smart mobility to provide insights into future technology trends. The method is based on text mining clustering analysis. The method consists of two stages, which are data preparation and clustering analysis, respectively. In the first stage, tokenizing, filtering, stemming, and feature selection are implemented to transform the data into a usable format (structured data) and to extract useful information for the next stage. In the second stage, the structured data is partitioned into groups. The K-medoids algorithm is selected over the K-means algorithm for this analysis owing to its advantages in dealing with noise and outliers. The results of the analysis indicate that most current patents focus mainly on smart connectivity and smart guide systems, which play a major role in the development of smart mobility.

A Study on Gamification Consumer Perception Analysis Using Big Data

  • Se-won Jeon;Youn Ju Ahn;Gi-Hwan Ryu
    • International Journal of Advanced Culture Technology
    • /
    • v.11 no.3
    • /
    • pp.332-337
    • /
    • 2023
  • The purpose of the study was to analyze consumers' perceptions of gamification. Based on the analyzed data, we would like to provide data by systematically organizing the concept, game elements, and mechanisms of gamification. Recently, gamification can be easily found around medical care, corporate marketing, and education. This study collected keywords from social media portal sites Naver, Daum, and Google from 2018 to 2023 using TEXTOM, a social media analysis tool. In this study, data were analyzed using text mining, semantic network analysis, and CONCOR analysis methods. Based on the collected data, we looked at the relevance and clusters related to gamification. The clusters were divided into a total of four clusters: 'Awareness of Gamification', 'Gamification Program', 'Future Technology of Gamification', and 'Use of Gamification'. Through social media analysis, we want to investigate and identify consumers' perceptions of gamification use, and check market and consumer perceptions to make up for the shortcomings. Through this, we intend to develop a plan to utilize gamification.

Analysis of Laughter Therapy Trend Using Text Network Analysis and Topic Modeling

  • LEE, Do-Young
    • Journal of Wellbeing Management and Applied Psychology
    • /
    • v.5 no.4
    • /
    • pp.33-37
    • /
    • 2022
  • Purpose: This study aims to understand the trend and central concept of domestic researches on laughter therapy. For the analysis, this study used total 72 theses verified by inputting the keyword 'laughter therapy' from 2007 to 2021. Research design, data and methodology: This study performed the development and analysis of keyword co-occurrence network, analyzed the types of researches through topic modeling, and verified the visualized word cloud and sociogram. The keyword data that was cleaned through preprocessing, was analyzed in the method of centrality analysis and topic modeling through the 1-mode matrix conversion process by using the NetMiner (version 4.4) Program. Results: The keywords that most appeared for last 14 years were laughter therapy, depression, the elderly, and stress. The five topics analyzed in thesis data from 2007 to 2021 were therapy, cognitive behavior, quality of life, stress, and the elderly. Conclusions: This study understood the flow and trend of research topics of domestic laughter therapy for last 14 years, and there should be continuous researches on laughter therapy, which reflects the flow of time in the future.