• 제목/요약/키워드: Topic modeling

검색결과 810건 처리시간 0.02초

Topic Modeling Analysis of Social Media Marketing using BERTopic and LDA

  • YANG, Woo-Ryeong;YANG, Hoe-Chang
    • 산경연구논집
    • /
    • 제13권9호
    • /
    • pp.37-50
    • /
    • 2022
  • Purpose: The purpose of this study is to explore and compare research trends in Korea and overseas academic papers on social media marketing, and to present new academic perspectives for the future direction in Korea. Research design, data and methodology: We used English abstract of research paper (Korea's: 1,349, overseas': 5,036) for word frequency analysis, topic modeling, and trend analysis for each topic. Results: The results of word frequency and co-occurrence frequency analysis showed that Korea researches focused on the experiential values of users, and overseas researches focused on platforms and content. Next, 13 topics and 12 topics for Korea and overseas researches were derived from topic modeling. And, trend analysis showed that Korean studies were different from overseas in applying marketing methods to specific industries and they were interested in the short-term performance of social media marketing. Conclusions: We found that the long-term strategies of social media marketing and academic interest in the overall industry will necessary in the future researches. Also, data mining techniques will necessary to generate more general results by quantifying various phenomena in reality. Finally, we expected that continuous and various academic approaches for volatile social media is effective to derive practical implications.

슈퍼앱 리뷰 토픽모델링을 통한 서비스 강화 방안 연구 (Research on Service Enhancement Approach based on Super App Review Data using Topic Modeling)

  • 유제원;송지훈
    • 한국산업융합학회 논문집
    • /
    • 제27권2_2호
    • /
    • pp.343-356
    • /
    • 2024
  • Super app is an application that provides a variety of services in a unified interface within a single platform. With the acceleration of digital transformation, super apps are becoming more prevalent. This study aims to suggest service enhancement measures by analyzing the user review data before and after the transition to a super app. To this end, user review data from a payment-based super app(Shinhan Play) were collected and studied via topic modeling. Moreover, a matrix for assessing the importance and usefulness of topics is introduced, which relies on the eigenvector centrality of the inter-topic network obtained through topic modeling and the number of review recommendations. This allowed us to identify and categorize topics with high utility and impact. Prior to the transition, the factors contributing to user satisfaction included 'payment service,' 'additional service,' and 'improvement.' Following the transition, user satisfaction was associated with 'payment service' and 'integrated UX.' Conversely, dissatisfaction factors before the transition encompassed issues related to 'signup/installation,' 'payment error/response,' 'security authentication,' and 'security error.' Following the transition, user dissatisfaction arose from concerns regarding 'update/error response' and 'UX/UI.' The research results are expected to be used as a basis for establishing strategies to strengthen service competitiveness by making super app services more user-oriented.

LDA를 사용한 COVID-19 관련 국내 논문의 연구 토픽 분석 (Research Topic Analysis of the Domestic Papers Related to COVID-19 Using LDA)

  • 김은회;서유화
    • 한국정보전자통신기술학회논문지
    • /
    • 제15권5호
    • /
    • pp.423-432
    • /
    • 2022
  • 본 논문은 학술연구자들이 COVID-19 관련 논문의 전체적인 연구 동향을 파악할 수 있도록 한다. KCI 사이트에서 수집한 2020년 1월부터 2022년 7월까지 총 10,599편의 COVID-19 관련 논문 정보를 LDA 토픽 모델링으로 분석한 결과를 제시한다. 또한 학술연구자들이 자신의 관심 연구분야의 토픽을 쉽게 파악할 수 있도록 LDA 토픽 모델링의 결과를 주요 연구 카테고리별로 분석하고, 토픽별로 연구가 많이 이루어지는 세부 연구 카테고리 정보를 분석한다. 학술연구자들이 시간의 흐름에 따른 연구 토픽의 추세(trend)를 파악하는 것은 연구 동향을 파악하는데 매우 중요하다. 따라서 이를 위해 본 논문에서는 시계열 분해를 사용하여 토픽들의 추세(trend)를 분석하여 제시한다.

The Impact of Topic Distribution on Review Sentiment: A Comparative Study between South Korea and the U.S.

  • Cho, Mina;Hwang, Dugmee;Jeon, Seongmin
    • 한국벤처창업학회:학술대회논문집
    • /
    • 한국벤처창업학회 2022년도 춘계학술대회
    • /
    • pp.123-126
    • /
    • 2022
  • Online reviews offer valuable information to businesses by reflecting consumer experiences about their products and services. Two important aspects of online reviews are first, the topics consumers choose to address and second, the sentiments expressed in their reviews. Building upon previous literature that shows online reviews are context-dependent, we examine the impact of topic distribution on review sentiment in South Korea and the U.S. during pre-and post-pandemic periods. After performing topic modeling on Airbnb app review data, we measure the contribution of each topic on review sentiment using SHAP values. Our results indicate variations in topic distribution trends between 2018 and 2021. Also, the order and magnitude of topics' impact on review sentiment change between pre-and post-pandemic periods for both countries. This study can help businesses to understand how topics and sentiments associated with their products and services changed after pandemic, and also help them identify areas of improvement.

  • PDF

Impact of Topic Distribution on Review Sentiment: A Comparative Study between South Korea and the U.S.

  • Mina Cho;Dugmee Hwang;SeongMin Jeon
    • Asia pacific journal of information systems
    • /
    • 제32권3호
    • /
    • pp.514-536
    • /
    • 2022
  • Online reviews offer valuable information to businesses by reflecting consumer experiences about their products and services. Two crucial aspects of online reviews are the topics consumers choose to address, and the sentiments expressed in their reviews. Building upon previous literature that shows online reviews are context-dependent, we employ the Expectation-Confirmation Theory (ECT) to examine the impact of topic distribution on review sentiment in South Korea and the U.S. during pre- and post-pandemic periods. After applying a topic modeling to Airbnb app review data, we measure the contribution of each topic on review sentiment using SHAP values. Our results indicate variations in topic distribution trends between 2018 and 2021. In addition, the order and magnitude of topics' impact on review sentiment change between pre- and post-pandemic periods for both countries. This study can help businesses understand how topics and sentiments associated with their products and services changed after the pandemic and thus identify areas of improvement.

언어 자원과 토픽 모델의 순차 매칭을 이용한 유사 문장 계산 기반의 위키피디아 한국어-영어 병렬 말뭉치 구축 (Building a Korean-English Parallel Corpus by Measuring Sentence Similarities Using Sequential Matching of Language Resources and Topic Modeling)

  • 천주룡;고영중
    • 정보과학회 논문지
    • /
    • 제42권7호
    • /
    • pp.901-909
    • /
    • 2015
  • 본 논문은 위키피디아로부터 한국어-영어 간 병렬 말뭉치를 구축하기 위한 연구이다. 이를 위해, 언어 자원과 토픽모델의 순차 매칭 기반의 유사 문장 계산 방법을 제안한다. 먼저, 언어자원의 매칭은 위키피디아 제목으로 구성된 위키 사전, 숫자, 다음 온라인 사전을 단어 매칭에 순차적으로 적용하였다. 또한, 위키피디아의 특성을 활용하기 위해 위키 사전에서 추정한 번역 확률을 단어 매칭에 추가 적용하였다. 그리고 토픽모델로부터 추출한 단어 분포를 유사도 계산에 적용함으로써 정확도를 향상시켰다. 실험에서, 선행연구의 언어자원만을 선형 결합한 유사 문장 계산은 F1-score 48.4%, 언어자원과 모든 단어 분포를 고려한 토픽모델의 결합은 51.6%의 성능을 보였으나, 본 논문에서 제안한 언어자원에 번역 확률을 추가하여 순차 매칭을 적용한 방법은 58.3%로 9.9%의 성능 향상을 얻었고, 여기에 중요한 단어 분포를 고려한 토픽모델을 적용한 방법이 59.1%로 7.5%의 성능 향상을 얻었다.

Analysis of Secondary Battery Trends Using Topic Modeling: Focusing on Solid-State Batteries

  • Chunghyun Do;Yong Jin Kim
    • Asian Journal of Innovation and Policy
    • /
    • 제12권3호
    • /
    • pp.345-362
    • /
    • 2023
  • As the widespread adoption and proliferation of electric vehicles continue, the secondary battery market is experiencing rapid growth. However, lithium-ion batteries, which constitute a majority of secondary batteries, present high risks of fire and explosion. Solid-state batteries are thus garnering attention as the next-generation batteries since they eliminate fire hazards and significantly reduce the risk of explosions. Against this background, the study aimed to analyze research trends and provide insights by examining 2,927 domestic papers related to solid-state batteries over the past decade (2013-2022). Specifically, we used topic modeling to extract major keywords associated with solid-state batteries research and to explore the network characteristics across major topics. The changes in research on solid-state batteries were analyzed in-depth by calculating topic dominance by year. The findings provide an overview of the emerging trends in domestic solid-state battery research, and might serve as a valuable reference in shaping long-term research directions.

토픽모델링을 활용한 한국산업경영시스템학회지의 최근 연구주제 분석 (Recent Research Trend Analysis for the Journal of Society of Korea Industrial and Systems Engineering Using Topic Modeling)

  • 박동준;구평회;오형술;윤 민
    • 산업경영시스템학회지
    • /
    • 제46권3호
    • /
    • pp.170-185
    • /
    • 2023
  • The advent of big data has brought about the need for analytics. Natural language processing (NLP), a field of big data, has received a lot of attention. Topic modeling among NLP is widely applied to identify key topics in various academic journals. The Korean Society of Industrial and Systems Engineering (KSIE) has published academic journals since 1978. To enhance its status, it is imperative to recognize the diversity of research domains. We have already discovered eight major research topics for papers published by KSIE from 1978 to 1999. As a follow-up study, we aim to identify major topics of research papers published in KSIE from 2000 to 2022. We performed topic modeling on 1,742 research papers during this period by using LDA and BERTopic which has recently attracted attention. BERTopic outperformed LDA by providing a set of coherent topic keywords that can effectively distinguish 36 topics found out this study. In terms of visualization techniques, pyLDAvis presented better two-dimensional scatter plots for the intertopic distance map than BERTopic. However, BERTopic provided much more diverse visualization methods to explore the relevance of 36 topics. BERTopic was also able to classify hot and cold topics by presenting 'topic over time' graphs that can identify topic trends over time.

지방자치단체의 스마트시티 조례 분석: 토픽모델링을 활용하여 (Analysis of Municipal Ordinances for Smart Cities of Municipal Governments: Using Topic Modeling)

  • 서형준
    • 정보화정책
    • /
    • 제30권1호
    • /
    • pp.41-66
    • /
    • 2023
  • 본 연구는 72개 지자체의 74개 스마트시티 조례를 대상으로, 지자체 스마트시티 조례의 방향성을 확인하고자 토픽모델링을 활용하여 조례의 주요 키워드를 확인하고, 조례의 키워드에 따른 주제분류를 진행하였다. 분석결과 주요 키워드는 스마트도시위원회의 구성 및 운영에 관한 키워드가 조례 내에서 높은 빈도를 보였다. 조례에 대한 토픽모델링 Latent Dirichlet Allocation(LDA) 분석결과 관련 키워드에 따라 총 8개의 주제로 분류할 수 있었다. 구체적으로 주제-1(스마트시티 추진사항 보안), 주제-2(스마트시티 산업진흥), 주제-3(스마트시티 주민협의체 구성), 주제-4(스마트시티 추진체계 지원), 주제-5(개인정보 관리), 주제-6(스마트시티 데이터 활용), 주제-7(지능정보화 행정구현), 주제-8(스마트시티 홍보) 등으로, 주제의 비중은 주제-6, 주제-4, 주제-1 등의 순으로 나타났다. 권역별 주제분류는 수도권은 주제-5, 주제-6, 주제-8 의 비중이 높았고, 지방권은 주제-2, 주제-3, 주제-4의 비중이 높아 수도권은 스마트시티의 실질 운영 관련 주제가 높았고, 지방권은 스마트시티 추진을 위한 준비단계 관련 주제 비중이 높았다.

'좋아요'와 '싫어요'같은 간접적 사회적 정보의 방향과 강도는 온라인 뉴스 콘텐츠 댓글의 숙의의 질과 어떤 관련이 있는가? 토픽 모델링을 이용한 토픽 다양성 분석 (How Are the Direction and the Intensity of Indirect Social Information such as Likes and Dislikes Related to the Deliberative Quality of Online News Content Comments? A Topic Diversity Analysis Using Topic Modeling)

  • 민진영;이애리
    • 한국정보시스템학회지:정보시스템연구
    • /
    • 제30권4호
    • /
    • pp.303-327
    • /
    • 2021
  • Purpose The online comments on news content have become social information and are understood based on deliberative democracy. Although the related research has focused on the relationship between online comments and their deliberative quality, the social information provided by online comments consists of not only direct information such as comments themselves but also indirect information such as 'likes' and 'dislikes'. Therefore, the research on online comments and deliberative quality should study this direct and indirect information together, and the direction and the degree of the indirect information should be also considered with them. Design/methodology/approach This study distinguishes comments by the attached 'likes' and 'dislikes', identifies highly supported and highly unsupported comments by the intensity of 'likes' and 'dislikes', and investigates the relationship between their existence and the deliberative quality measured as the topic diversity. Then, we applied topic modeling to the 2,390 news articles and their 74,385 comments collected from five news sites. Findings The topic diversities of the supported and unsupported comments are related to the topic diversity of all comments but the degree of the relationship is higher in the case of supported comments. Furthermore, the existence of highly supported and unsupported comments is led to less diversity of all comments compared to the case where those comments are absent. Particularly, when only highly supported comments are present, topic diversity was lower than in the opposite case.