• Title/Summary/Keyword: Topic Information

Search Result 1,917, Processing Time 0.025 seconds

Analyzing Customer Experience in Hotel Services Using Topic Modeling

  • Nguyen, Van-Ho;Ho, Thanh
    • Journal of Information Processing Systems
    • /
    • v.17 no.3
    • /
    • pp.586-598
    • /
    • 2021
  • Nowadays, users' reviews and feedback on e-commerce sites stored in text create a huge source of information for analyzing customers' experience with goods and services provided by a business. In other words, collecting and analyzing this information is necessary to better understand customer needs. In this study, we first collected a corpus with 99,322 customers' comments and opinions in English. From this corpus we chose the best number of topics (K) using Perplexity and Coherence Score measurements as the input parameters for the model. Finally, we conducted an experiment using the latent Dirichlet allocation (LDA) topic model with K coefficients to explore the topic. The model results found hidden topics and keyword sets with high probability that are interesting to users. The application of empirical results from the model will support decision-making to help businesses improve products and services as well as business management and development in the field of hotel services.

Topic Modeling with Deep Learning-based Sentiment Filters (감정 딥러닝 필터를 활용한 토픽 모델링 방법론)

  • Choi, Byeong-Seol;Kim, Namgyu
    • The Journal of Information Systems
    • /
    • v.28 no.4
    • /
    • pp.271-291
    • /
    • 2019
  • Purpose The purpose of this study is to propose a methodology to derive positive keywords and negative keywords through deep learning to classify reviews into positive reviews and negative ones, and then refine the results of topic modeling using these keywords. Design/methodology/approach In this study, we extracted topic keywords by performing LDA-based topic modeling. At the same time, we performed attention-based deep learning to identify positive and negative keywords. Finally, we refined the topic keywords using these keywords as filters. Findings We collected and analyzed about 6,000 English reviews of Gyeongbokgung, a representative tourist attraction in Korea, from Tripadvisor, a representative travel site. Experimental results show that the proposed methodology properly identifies positive and negative keywords describing major topics.

On the Distribution of‘-(N)un’in Korean (‘-은/는’의 분포에 대하여)

  • 염재일
    • Language and Information
    • /
    • v.5 no.2
    • /
    • pp.57-74
    • /
    • 2001
  • In this paper, I propose syntactic, semantic and pragmatic restrictions on the distribution of the contrastive topic marker‘-(n)un’in Korean. A contrastive topic is associated with another focus. The association with focus is subject to syntactic islands. On the other hand, there is no syntactic restriction between a phrase attached with‘-(n)un’and a focused expression within the ‘-(n)un’phrase itself. In this area there is a semantic requirement that the alternatives generated by a focused expression be maintained up to the phrase attached with‘-(n)un’. Finally, when‘-(n)un’is used in an embedded clause, the whole sentence becomes natural when the contrastive topic introduced by‘-(n)un’and its alternative contrastive topic, which is presupposed by the contrastive topic marker, jointly constitute a more complex topic which is related to the whole context. And exclusiveness facilitates the formation of the whole complex context.

  • PDF

A Study on MARC Based Topic Map (Topic Map 기반의 MARC 적용 방안 연구)

  • Jang, Hwa-Su;Ko, Il-Ju
    • Proceedings of the Korean Society of Computer Information Conference
    • /
    • 2008.06a
    • /
    • pp.309-315
    • /
    • 2008
  • 문헌정보처리 표준화도구인 MARC는 포멧의 문제점과 다양한 웹자원 메타데이터 정보조직의 문제점으로 인하여 웹 기반의 XML표준 포멧의 도입을 시도하였고, MARCXML로 변환되어 시스템간 상호운용되고 있으나, MARCXML은 서지정보의 의미특성이나 메타데이터의 표현을 고려하지 않고 단순히 MARC 레코드의 표현을 XML 구조로 변환한 것일 뿐이다. 시맨틱의 핵심기술로 부각되고 있는 Topic Map은 XML기반의 표준기술언어인 ISO의 XTM을 이용해 정보와 지식의 분산 관리를 지원하는 기술이다. 학술정보자원에 대한 DB 구축 시 Topic Map언어인 XTM을 이용한다면 이미 개발된 여러 메타데이터 등을 한곳으로 통합하면서도 신축성과 확장성을 제공하는 것이 용이하게 된다. 하지만, 기존 시스템에서 새로운 Topic Map을 구축하는 것은 많은 비용과 시간이 소요되는 등 어려운 일이다. 본 연구에서는 기 구축된 학술DB로부터 Topic Map에서 재활용할 수 있는 요소들을 추출하기 위한 정보 소스로서 데이터베이스 스키마와 MARC에서 언급하는 메타데이터를 이용하는 것은, XML의 특징인 시스템간 상호운용성을 확보함과 동시에 기초 학문자료의 복잡한 관계의 개념구조, 자료유형 및 자료간의 의미적 상관관계 등을 표현에 있어 효율적인 개발방법임을 제안한다.

  • PDF

Non face-to-face News Articles Keyword Using Topic Modeling (토픽모델링을 이용한 비대면 신문 기사 키워드 분석)

  • Shin, Ari;Hwangbo, Jun Kwon
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.26 no.11
    • /
    • pp.1751-1754
    • /
    • 2022
  • The news articles collected with keyword "non face-to-face" were analyzed through topic modeling applied with LDA algorithm. In this study, collected articles were divided into two periods, period 1(the beginning of COVID-19 spread) and period 2(the end of COVID-19 spread), according to issued date of the articles. The articles of period 1 showed support for non-face-to-face treatment, smart library, the beginning of the online financial era, non-face-to-face entrance exam and employment, stock investment for main topic words. And the articles of period 2 showed conversion to non face-to-face classes, increasing unmanned stores, online finance, education industry, home treatment for main topic words. Also, further issues were discussed through visualization of topic words. These results provide evidence that education and unmanned business in non-face-to-face industries are growing.

Ego-centered Topic Citation Analysis on Folksonomy Research Documents (폭소노미 연구 문헌에 대한 자아 중심 주제 인용 분석)

  • Lee, Jae Yun
    • Journal of the Korean Society for information Management
    • /
    • v.29 no.4
    • /
    • pp.295-312
    • /
    • 2012
  • This research aims to present the ego-centered topic citation analysis, which is a new application of White's ego-centered citation analysis, for analyzing multilayered knowledge structure of a subject domain. An experimental topic citation analysis was carried out on the folksonomy research documents retrieved from Web of Science. Ego-centered topic citation analyses on folksonomy research domain were conducted in three stages: ego-documents set analysis, topic citation identity analysis, and topic citation image analysis. The results showed that the ego-centered topic citation analysis suggested in this study was successfully performed to illustrate the inner and the outer knowledge structures of folksonomy research domain.

Analysis of English abstracts in Journal of the Korean Data & Information Science Society using topic models and social network analysis (토픽 모형 및 사회연결망 분석을 이용한 한국데이터정보과학회지 영문초록 분석)

  • Kim, Gyuha;Park, Cheolyong
    • Journal of the Korean Data and Information Science Society
    • /
    • v.26 no.1
    • /
    • pp.151-159
    • /
    • 2015
  • This article analyzes English abstracts of the articles published in Journal of the Korean Data & Information Science Society using text mining techniques. At first, term-document matrices are formed by various methods and then visualized by social network analysis. LDA (latent Dirichlet allocation) and CTM (correlated topic model) are also employed in order to extract topics from the abstracts. Performances of the topic models are compared via entropy for several numbers of topics and weighting methods to form term-document matrices.

A Study on the Application of Topic Modeling for the Book Report Text (독후감 텍스트의 토픽모델링 적용에 관한 탐색적 연구)

  • Lee, Soo-Sang
    • Journal of Korean Library and Information Science Society
    • /
    • v.47 no.4
    • /
    • pp.1-18
    • /
    • 2016
  • The purpose of this study is to explore application of topic modeling for topic analysis of book report. Topic modeling can be understood as one method of topic analysis. This analysis was conducted with texts in 23 book reports using LDA function of the "topicmodels" package provided by R. According to the result of topic modeling, 16 topics were extracted. The topic network was constructed by the relation between the topics and keywords, and the book report network was constructed by the relation between book report cases and topics. Next, Centrality analysis was conducted targeting the topic network and book report network. The result of this study is following these. First, 16 topics are shown as network which has one component. In other words, 16 topics are interrelated. Second, book report was divided into 2 groups, book reports with high centrality and book reports with low centrality. The former group has similarities with others, the latter group has differences with others in aspect of the topics of book reports. The result of topic modeling is useful to identify book reports' topics combining with network analysis.

Research Topic Analysis of the Domestic Papers Related to COVID-19 Using LDA (LDA를 사용한 COVID-19 관련 국내 논문의 연구 토픽 분석)

  • Kim, Eun-Hoe;Suh, Yu-Hwa
    • The Journal of Korea Institute of Information, Electronics, and Communication Technology
    • /
    • v.15 no.5
    • /
    • pp.423-432
    • /
    • 2022
  • This paper analyzes a total of 10,599 papers related to COVID-19 from January 2020 to July 2022 collected from the KCI site using LDA topic modeling so that academic researchers can understand the overall research trend. The results of LDA topic modeling are analyzed by major research categories so that academic researchers can easily figure out topics in their research fields. Then, the detailed research category information in which a lot of research is done by topic is analyzed. It is very important for academic researchers to understand the trend of research topics over time. Therefore, in this paper, the trend of topics is analyzed and presented using time series decomposition.

A Semantic Aspect-Based Vector Space Model to Identify the Event Evolution Relationship within Topics

  • Xi, Yaoyi;Li, Bicheng;Liu, Yang
    • Journal of Computing Science and Engineering
    • /
    • v.9 no.2
    • /
    • pp.73-82
    • /
    • 2015
  • Understanding how the topic evolves is an important and challenging task. A topic usually consists of multiple related events, and the accurate identification of event evolution relationship plays an important role in topic evolution analysis. Existing research has used the traditional vector space model to represent the event, which cannot be used to accurately compute the semantic similarity between events. This has led to poor performance in identifying event evolution relationship. This paper suggests constructing a semantic aspect-based vector space model to represent the event: First, use hierarchical Dirichlet process to mine the semantic aspects. Then, construct a semantic aspect-based vector space model according to these aspects. Finally, represent each event as a point and measure the semantic relatedness between events in the space. According to our evaluation experiments, the performance of our proposed technique is promising and significantly outperforms the baseline methods.