• Title/Summary/Keyword: Topic Modeling Analysis

Search Result 686, Processing Time 0.021 seconds

Brand Personality of Global Automakers through Text Mining

  • Kim, Sungkuk
    • Journal of Korea Trade
    • /
    • v.25 no.2
    • /
    • pp.22-45
    • /
    • 2021
  • Purpose - This study aims to identify new attributes by analyzing reviews conducted by global automaker customers and to examine the influence of these attributes on satisfaction ratings in the U.S. automobile sales market. The present study used J.D. Power for customer responses, which is the largest online review site in the USA. Design/methodology - Automobile customer reviews are valid data available to analyze the brand personality of the automaker. This study collected 2,998 survey responses from automobile companies in the U.S. automobile sales market. Keyword analysis, topic modeling, and the multiple regression analysis were used to analyze the data. Findings - Using topic modeling, the author analyzed 2,998 responses of the U.S. automobile brands. As a result, Topic 1 (Competence), Topic 5 (Sincerity), and Topic 6 (Prestige) attributes had positive effects, and Topic 2 (Sophistication) had a negative effect on overall customer responses. Topic 4 (Conspicuousness) did not have any statistical effect on this research. Topic 1, Topic 5, and Topic 6 factors also show the importance of buying factors. This present study has contributed to identifying a new attribute, personality. These findings will help global automakers better understand the impacts of Topic 1, Topic 5, and Topic 6 on purchasing a car. Originality/value - Contrary to a traditional approach to brand analysis using questionnaire survey methods, this study analyzed customer reviews using text mining. This study is timely research since a big data analysis is employed in order to identify direct responses to customers in the future.

Research Trend Analysis on Smart healthcare by using Topic Modeling and Ego Network Analysis (토픽모델링과 에고 네트워크 분석을 활용한 스마트 헬스케어 연구동향 분석)

  • Yoon, Jee-Eun;Suh, Chang-Jin
    • Journal of Digital Contents Society
    • /
    • v.19 no.5
    • /
    • pp.981-993
    • /
    • 2018
  • Smart healthcare is convergence of ICT and healthcare services, and interdisciplinary research has been actively conducted in various fields. The objective of this study is to investigate trends of smart healthcare research using topic modeling and ego network analysis. Text analysis, frequency analysis, topic modeling, word cloud, and ego network analysis were conducted for the abstracts of 2,690 articles in Scopus from 2001 to April 2018. Topic Modeling analysis resulted in eight topics, Topics included "AI in healthcare", "Smart hospital", "Healthcare platform", "Blockchain in healthcare", "Smart health data", "Mobile healthcare", " Wellness care", "Cognitive healthcare". In order to examine the topic modeling results core deeply, we analyzed word cloud and ego network analysis for eight topics. This study aims to identify trends in smart healthcare research and suggest implications for establishing future research direction.

Topic Modeling Analysis of Franchise Research Trends Using LDA Algorithm (LDA 알고리즘을 이용한 프랜차이즈 연구 동향에 대한 토픽모델링 분석)

  • YANG, Hoe-Chang
    • The Korean Journal of Franchise Management
    • /
    • v.12 no.4
    • /
    • pp.13-23
    • /
    • 2021
  • Purpose: This study aimed to derive clues for the franchise industry to overcome difficulties such as various legal regulations and social responsibility demands and to continuously develop by analyzing the research trends related to franchises published in Korea. Research design, data and methodology: As a result of searching for 'franchise' in ScienceON, abstracts were collected from papers published in domestic academic journals from 1994 to June 2021. Keywords were extracted from the abstracts of 1,110 valid papers, and after preprocessing, keyword analysis, TF-IDF analysis, and topic modeling using LDA algorithm, along with trend analysis of the top 20 words in TF-IDF by year group was carried out using the R-package. Results: As a result of keyword analysis, it was found that businesses and brands were the subjects of research related to franchises, and interest in service and satisfaction was considerable, and food and coffee were prominently studied as industries. As a result of TF-IDF calculation, it was found that brand, satisfaction, franchisor, and coffee were ranked at the top. As a result of LDA-based topic modeling, a total of 12 topics including "growth strategy" were derived and visualized with LDAvis. On the other hand, the areas of Topic 1 (growth strategy) and Topic 9 (organizational culture), Topic 4 (consumption experience) and Topic 6 (contribution and loyalty), Topic 7 (brand image) and Topic 10 (commercial area) overlap significantly. Finally, the trend analysis results for the top 20 keywords with high TF-IDF showed that 10 keywords such as quality, brand, food, and trust would be more utilized overall. Conclusions: Through the results of this study, the direction of interest in the franchise industry was confirmed, and it was found that it was necessary to find a clue for continuous growth through research in more diverse fields. And it was also considered an important finding to suggest a technique that can supplement the problems of topic trend analysis. Therefore, the results of this study show that researchers will gain significant insights from the perspectives related to the selection of research topics, and practitioners from the perspectives related to future franchise changes.

Keywords and Topic Analysis of Social Issues on Twitter Based on Text Mining and Topic Modeling (텍스트 마이닝과 토픽 모델링을 기반으로 한 트위터에 나타난 사회적 이슈의 키워드 및 주제 분석)

  • Kwak, Soo Jeong;Kim, Hyon Hee
    • KIPS Transactions on Software and Data Engineering
    • /
    • v.8 no.1
    • /
    • pp.13-18
    • /
    • 2019
  • In this study, we investigate important keywords and their relationships among the keywords for social issues, and analyze topics to find subjects of the social issues. In particular, we collected twitter data with the keyword 'metoo' which has attracted much attention in these days, and perform keyword analysis and topic modeling. First, we preprocess the twitter data, identified important keywords, and analyzed the relatedness of the keywords. After then, topic modeling is performed to find subjects related to 'metoo'. Our experimental results showed that relatedness of keywords and subjects on social issues in twitter are well identified based on keyword analysis and topic modeling.

Analysis of Global Media Reporting Trends for K-fashion -Applying Dynamic Topic Modeling- (K 패션에 대한 글로벌 미디어 보도 경향 분석 -다이내믹 토픽 모델링(Dynamic Topic Modeling)의 적용-)

  • Hyosun An;Jiyoung Kim
    • Journal of the Korean Society of Clothing and Textiles
    • /
    • v.46 no.6
    • /
    • pp.1004-1022
    • /
    • 2022
  • This study seeks to investigate K-fashion's external image by examining the trends in global media reporting. It applies Dynamic Topic Modeling (DTM), which captures the evolution of topics in a sequentially organized corpus of documents, and consists of text preprocessing, the determination of the number of topics, and a timeseries analysis of the probability distribution of words within topics. The data set comprised 551 online media articles on 'Korean fashion' or 'K-fashion' published on Google News between 2010 and 2021. The analysis identifies seven topics: 'brand look and style,' 'lifestyle,' 'traditional style,' 'Seoul Fashion Week (SFW) event,' 'model size,' 'K-pop,' and 'fashion market,' as well as annual topic proportion trends. It also explores annual word changes within the topic and indicates increasing and decreasing word patterns. In most topics, the probability distribution of the word 'brand' is confirmed to be on the increase, while 'digital,' 'platform,' and 'virtual' have been newly created in the 'SFW event' topic. Moreover, this study confirms the transition of each K-fashion topic over the past 12 years, along with various factors related to Hallyu content, traditional culture, government support, and digital technology innovation.

Analysis of Social Media Contents about Broadcast Media through Topic Modeling (토픽 모델링을 이용한 방송미디어 관련 소셜 미디어 콘텐츠 분석)

  • Park, Sangun
    • Journal of Information Technology Services
    • /
    • v.15 no.2
    • /
    • pp.81-92
    • /
    • 2016
  • Numerous people share their TV experience with other viewers on social media such as personal blogs and Twitter. It means that broadcast media, especially TV, affects the responses on social media. Moreover, the responses affect broadcast media ratings back. Social TV tried to use the relationship in marketing activities such as advertisement by analyzing the TV related social behavior. However, most of them used just the quantities of social media responses. This study analyzes the subjects of the responses on social media about specific TV dramas through topic modeling, and the relationship between the changes of popular topics and viewer ratings of the drama over specified periods. Five representative Korean dramas of 2014 were selected and Blog contents including viewer ratings about the dramas were collected from naver.com which is the representative portal in South Korea. The proposed analysis framework consists of three steps which are Blogs crawling, topic modeling, and topic trend analysis. We found some implications from the results of the topic trend analysis. Firstly, there were specific topics on dramas in social media. Secondly, the topics had some meaningful relationships with viewer ratings. Lastly, there were differences between the topics of dramas with higher viewer ratings and those with lower viewer ratings.

Analysis of Laughter Therapy Trend Using Text Network Analysis and Topic Modeling

  • LEE, Do-Young
    • Journal of Wellbeing Management and Applied Psychology
    • /
    • v.5 no.4
    • /
    • pp.33-37
    • /
    • 2022
  • Purpose: This study aims to understand the trend and central concept of domestic researches on laughter therapy. For the analysis, this study used total 72 theses verified by inputting the keyword 'laughter therapy' from 2007 to 2021. Research design, data and methodology: This study performed the development and analysis of keyword co-occurrence network, analyzed the types of researches through topic modeling, and verified the visualized word cloud and sociogram. The keyword data that was cleaned through preprocessing, was analyzed in the method of centrality analysis and topic modeling through the 1-mode matrix conversion process by using the NetMiner (version 4.4) Program. Results: The keywords that most appeared for last 14 years were laughter therapy, depression, the elderly, and stress. The five topics analyzed in thesis data from 2007 to 2021 were therapy, cognitive behavior, quality of life, stress, and the elderly. Conclusions: This study understood the flow and trend of research topics of domestic laughter therapy for last 14 years, and there should be continuous researches on laughter therapy, which reflects the flow of time in the future.

Efficient Topic Modeling by Mapping Global and Local Topics (전역 토픽의 지역 매핑을 통한 효율적 토픽 모델링 방안)

  • Choi, Hochang;Kim, Namgyu
    • Journal of Intelligence and Information Systems
    • /
    • v.23 no.3
    • /
    • pp.69-94
    • /
    • 2017
  • Recently, increase of demand for big data analysis has been driving the vigorous development of related technologies and tools. In addition, development of IT and increased penetration rate of smart devices are producing a large amount of data. According to this phenomenon, data analysis technology is rapidly becoming popular. Also, attempts to acquire insights through data analysis have been continuously increasing. It means that the big data analysis will be more important in various industries for the foreseeable future. Big data analysis is generally performed by a small number of experts and delivered to each demander of analysis. However, increase of interest about big data analysis arouses activation of computer programming education and development of many programs for data analysis. Accordingly, the entry barriers of big data analysis are gradually lowering and data analysis technology being spread out. As the result, big data analysis is expected to be performed by demanders of analysis themselves. Along with this, interest about various unstructured data is continually increasing. Especially, a lot of attention is focused on using text data. Emergence of new platforms and techniques using the web bring about mass production of text data and active attempt to analyze text data. Furthermore, result of text analysis has been utilized in various fields. Text mining is a concept that embraces various theories and techniques for text analysis. Many text mining techniques are utilized in this field for various research purposes, topic modeling is one of the most widely used and studied. Topic modeling is a technique that extracts the major issues from a lot of documents, identifies the documents that correspond to each issue and provides identified documents as a cluster. It is evaluated as a very useful technique in that reflect the semantic elements of the document. Traditional topic modeling is based on the distribution of key terms across the entire document. Thus, it is essential to analyze the entire document at once to identify topic of each document. This condition causes a long time in analysis process when topic modeling is applied to a lot of documents. In addition, it has a scalability problem that is an exponential increase in the processing time with the increase of analysis objects. This problem is particularly noticeable when the documents are distributed across multiple systems or regions. To overcome these problems, divide and conquer approach can be applied to topic modeling. It means dividing a large number of documents into sub-units and deriving topics through repetition of topic modeling to each unit. This method can be used for topic modeling on a large number of documents with limited system resources, and can improve processing speed of topic modeling. It also can significantly reduce analysis time and cost through ability to analyze documents in each location or place without combining analysis object documents. However, despite many advantages, this method has two major problems. First, the relationship between local topics derived from each unit and global topics derived from entire document is unclear. It means that in each document, local topics can be identified, but global topics cannot be identified. Second, a method for measuring the accuracy of the proposed methodology should be established. That is to say, assuming that global topic is ideal answer, the difference in a local topic on a global topic needs to be measured. By those difficulties, the study in this method is not performed sufficiently, compare with other studies dealing with topic modeling. In this paper, we propose a topic modeling approach to solve the above two problems. First of all, we divide the entire document cluster(Global set) into sub-clusters(Local set), and generate the reduced entire document cluster(RGS, Reduced global set) that consist of delegated documents extracted from each local set. We try to solve the first problem by mapping RGS topics and local topics. Along with this, we verify the accuracy of the proposed methodology by detecting documents, whether to be discerned as the same topic at result of global and local set. Using 24,000 news articles, we conduct experiments to evaluate practical applicability of the proposed methodology. In addition, through additional experiment, we confirmed that the proposed methodology can provide similar results to the entire topic modeling. We also proposed a reasonable method for comparing the result of both methods.

Combining Ego-centric Network Analysis and Dynamic Citation Network Analysis to Topic Modeling for Characterizing Research Trends (자아 중심 네트워크 분석과 동적 인용 네트워크를 활용한 토픽모델링 기반 연구동향 분석에 관한 연구)

  • Yu, So-Young
    • Journal of the Korean Society for information Management
    • /
    • v.32 no.1
    • /
    • pp.153-169
    • /
    • 2015
  • The combined approach of using ego-centric network analysis and dynamic citation network analysis for refining the result of LDA-based topic modeling was suggested and examined in this study. Tow datasets were constructed by collecting Web of Science bibliographic records of White LED and topic modeling was performed by setting a different number of topics on each dataset. The multi-assigned top keywords of each topic were re-assigned to one specific topic by applying an ego-centric network analysis algorithm. It was found that the topical cohesion of the result of topic modeling with the number of topic corresponding to the lowest value of perplexity to the dataset extracted by SPLC network analysis was the strongest with the best values of internal clustering evaluation indices. Furthermore, it demonstrates the possibility of developing the suggested approach as a method of multi-faceted research trend detection.

A Study on Research Trend Analysis and Topic Class Prediction of Digital Transformation using Text Mining

  • Lee, JeeYoung
    • International journal of advanced smart convergence
    • /
    • v.8 no.2
    • /
    • pp.183-190
    • /
    • 2019
  • In the era of the Fourth Industrial Revolution, digital transformation, which means changes in all industrial structures, politics, economics and society as well as IT technology, is an important issue. It is difficult to know which research topic is being studied because digital transformation is being studied in various fields. Convergence research is possible because a research topic is studied in various fields such as computer science area and Decision science area. However, it is difficult to know the specific research status of the research topic. In this study, eight research topics were derived using the topic modeling technique of text mining for abstract of academic literature and the trend of each topic was analyzed. We also proposed to create a Topic-Word Proportions Table in the LDA based Topic modeling process to predict the topic of new literature. The results of this study are expected to contribute to advanced convergence research on topic of digital transformation. It is expected that the literature related to each research topic will be grasped and contribute to the design of a new convergence research.