• Title/Summary/Keyword: 토픽분석

Search Result 660, Processing Time 0.024 seconds

An Experimental Study on Topic Distillation Using Web Site Structure (웹 사이트 구조를 이용한 토픽 검색 연구)

  • Lee, Jee-Suk;Chung, Yung-Mee
    • Journal of the Korean Society for information Management
    • /
    • v.24 no.3
    • /
    • pp.201-218
    • /
    • 2007
  • This study proposes a topic distillation algorithm that ranks the relevant sites selected from retrieved web pages, and evaluates the performance of the algorithm. The algorithm calculates the topic score of a site using its hierarchical structure. The TREC .GOV test collection and a set of TREC-2004 queries for topic distillation task are used for the experiment. The experimental results showed the algorithm returned at least 2 relevant sites in top ten retrieval results. We peformed an in-depth analysis of the relevant sites list provided by TREC-2004 to find out that the definition of topic distillation was not strictly applied in selecting relevant sites. When we re-evaluated the retrieved sites/sub-sites using the revised list of relevant sites, the performance of the proposed algorithm was improved significantly.

Topic modeling for automatic classification of learner question and answer in teaching-learning support system (교수-학습지원시스템에서 학습자 질의응답 자동분류를 위한 토픽 모델링)

  • Kim, Kyungrog;Song, Hye jin;Moon, Nammee
    • Journal of Digital Contents Society
    • /
    • v.18 no.2
    • /
    • pp.339-346
    • /
    • 2017
  • There is increasing interest in text analysis based on unstructured data such as articles and comments, questions and answers. This is because they can be used to identify, evaluate, predict, and recommend features from unstructured text data, which is the opinion of people. The same holds true for TEL, where the MOOC service has evolved to automate debating, questioning and answering services based on the teaching-learning support system in order to generate question topics and to automatically classify the topics relevant to new questions based on question and answer data accumulated in the system. Therefore, in this study, we propose topic modeling using LDA to automatically classify new query topics. The proposed method enables the generation of a dictionary of question topics and the automatic classification of topics relevant to new questions. Experimentation showed high automatic classification of over 0.7 in some queries. The more new queries were included in the various topics, the better the automatic classification results.

Research Trends in Korean Healing Facilities and Healing Programs Using LDA Topic Modeling (LDA 토픽모델링을 활용한 국내 치유시설과 치유프로그램 연구 동향)

  • Lee, Ju-Hong;Lee, Kyung-Jin;Sung, Jung-Han
    • Journal of the Korean Institute of Landscape Architecture
    • /
    • v.51 no.3
    • /
    • pp.95-106
    • /
    • 2023
  • Korean healing research has developed over the past 20 years along with the growing social interest in healing. The field of healing research is diverse and includes legislated natural-based healing. In this study, abstracts of 2,202 academic journals, master's, and doctoral dissertations published in KCI and RISS were collected and analyzed. As for the research method, LDA topic modeling used to classify research topics, and time-series publication trends were examined. As a result of the study, it identified that the topic of Korean healing research was connected with 5 types and 4 mediators. The five were "Healing Tourism," "Mind and Art Healing," "Forest Therapy," "Healing Space," and "Youth Restoration and Healing," and the four mediators were "Forest," "Nature," "Culture", and "Education". In addition, only legalized healing studies extracted from Korean healing research and the topics were analyzed. As a result, legalized healing research classified into four. The four types were "Healing Spatial Environment Plan", "Healing Therapy Experiment", "Agricultural Education Experiential Healing", and "Healing Tourism Factor". Forest Therapy, which has the largest amount of research in legalized healing, Agro Healing and Garden Healing which operate similar programs through plants, and Marine Healing using marine resources also analyzed. As a result, topics that show the unique characteristics of individual healing studies and topics that are considered universal in all healing studies derived. This study is significant in that it identified the overall trend of research on Korean healing facilities and programs by utilizing LDA topic modeling.

Research Trends Investigation Using Text Mining Techniques: Focusing on Social Network Services (텍스트마이닝을 활용한 연구동향 분석: 소셜네트워크서비스를 중심으로)

  • Yoon, Hyejin;Kim, Chang-Sik;Kwahk, Kee-Young
    • Journal of Digital Contents Society
    • /
    • v.19 no.3
    • /
    • pp.513-519
    • /
    • 2018
  • The objective of this study was to examine the trends on social network services. The abstracts of 308 articles were extracted from web of science database published between 1994 and 2016. Time series analysis and topic modeling of text mining were implemented. The topic modeling results showed that the research topics were mainly 20 topics: trust, support, satisfaction model, organization governance, mobile system, internet marketing, college student effect, opinion diffusion, customer, information privacy, health care, web collaboration, method, learning effectiveness, knowledge, individual theory, child support, algorithm, media participation, and context system. The time series regression results indicated that trust, support satisfaction model, and remains of the topics were hot topics. This study also provided suggestions for future research.

A Convergence Study on the Topic and Sentiment of COVID19 Research in Korea Using Text Analysis (텍스트 분석을 이용한 코로나19 관련 국내 논문의 주제 및 감성에 관한 융합 연구)

  • Heo, Seong-Min;Yang, Ji-Yeon
    • Journal of the Korea Convergence Society
    • /
    • v.12 no.4
    • /
    • pp.31-42
    • /
    • 2021
  • The purpose of this study was to explore research topics and examine the trend in COVID19 related research papers. We identified eight topics using latent Dirichlet allocation and found acceptable validity in comparison with the structural topic model. The subtopics have been extracted using k-means clustering and plotted in PCA space. Additionally, we discovered the topics bearing negative tones and warning signs by sentiment analysis. The results flagged up the issues of the topics, Biomedical Related, International Dynamics and Psychological Impact. The findings could serve as a guideline for researchers who explore new research directions and policymakers who need to make decisions about which research projects to support.

'Korean Wave' News Analysis Using News Big Data ('한류' 경향에 관한 국내 언론 기사 빅데이터 분석 연구)

  • Hwang, Seo-I;Park, Jeong-Bae
    • Journal of Korea Entertainment Industry Association
    • /
    • v.14 no.5
    • /
    • pp.1-14
    • /
    • 2020
  • This study conducted a topic modeling and semantic network analysis of 'korean wave' and its meaning in Korean society from 2000 to 2019 by applying an agenda setting theory. For this purpose, a total of 197,992 newspaper articles which reported 'korean wave' issues were analyzed by applying topic modeling and semantic network analysis. As a result, first, the word 'korean wave' mainly appeared in korean-related regions in the korean press. culture and economy. second, a total of 9 topics related to korean wave issues appeared. This was followed by 'broadcast', 'export', 'domestic and foreign affairs', 'education', 'beauty and fashion', 'music and performance', 'tourism', 'media(platform)', and 'region'. Lastly, korean wave was mainly discussed at the cultural and economic ares. In addition, it was clustered into five characteristics: 'cultural hallyu', 'business hallyu', 'education', 'environment', and 'geography'.

An Analysis of Research Trends on Basic Academic Abilities in Mathematics with Frequency Analysis and Topic Modeling (빈도 분석 및 토픽모델링을 활용한 수학 교과에서 기초학력 관련 연구 동향 분석)

  • Cho, Mi Kyung
    • Communications of Mathematical Education
    • /
    • v.37 no.4
    • /
    • pp.615-633
    • /
    • 2023
  • This study analyzed Korean studies up to August 2023 to suggest the direction of future research on basic academic abilities in mathematics. For this purpose, frequency analysis and LDA-based topic modeling were conducted on the Korean abstracts of 197 domestic studies. The results showed that, first, 'academic achievement', 'impact', 'effect', and 'factors' were all ranked at the top of the TFs and TF-IDFs. Second, as a result of LDA-based topic modeling, five topics were identified: causes of basic academic abilities deficiency, learning status of math underachievers, teacher expertise in teaching math underachievers, supporting programs for math underachievers, and results of National Assessment of Educational Achievement. As a direction for future research, this study suggests focusing on the growth of math underachievers, systematizing the programs provided to students who need learning support in mathematics, and developing teacher expertise in teaching math underachievers.

Spatial analysis based on topic modeling using foreign tourist review data: Case of Daegu (외국인 관광객 리뷰데이터를 활용한 토픽모델링 기반의 공간분석: 대구광역시를 사례로)

  • Jung, Ji-Woo;Kim, Seo-Yun;Kim, Hyeon-Yu;Yoon, Ju-Hyeok;Jang, Won-Jun;Kim, Keun-Wook
    • Journal of Digital Convergence
    • /
    • v.19 no.8
    • /
    • pp.33-42
    • /
    • 2021
  • As smartphone-based tourism platforms have become active, policy establishment and service enhancement using review data are being made in various fields. In the case of the preceding studies using tourism review data, most of the studies centered on domestic tourists were conducted, and in the case of foreign tourist studies, studies were conducted only on data collected in some languages and text mining techniques. In this study, 3,515 review data written by foreigners were collected by designating the "Daegu attractions" keyword through the online review site. And LDA-based topic modeling was performed to derive tourism topics. The spatial approach through global and local spatial autocorrelation analysis for each topic can be said to be different from previous studies. As a result of the analysis, it was confirmed that there is a global spatial autocorrelation, and that tourist destinations mainly visited by foreigners are concentrated locally. In addition, hot spots have been drawn around Jung-gu in most of the topics. Based on the analysis results, it is expected to be used as a basic research for spatial analysis based on local government foreign tourism policy establishment and topic modeling. And The limitations of this study were also presented.

4차 산업혁명의 주요 이슈 분석

  • Jeon, Jeong-Hwan
    • Proceedings of the Korea Technology Innovation Society Conference
    • /
    • 2017.05a
    • /
    • pp.69-69
    • /
    • 2017
  • ${\Box}$ 연구목적: 4차 산업혁명의 주요 이슈 분석 ${\bullet}$ 4차 산업혁명시대에 인공지능, 자율주행, 무인운송, 3D 프린터, 스마트팩토리..등 다양한 이슈가 등장 ${\bullet}$ 어떠한 이슈들이 있는지 분석하고자 함 ${\Box}$ 연구방법론: 빅데이터 분석기법 중에서 토픽 모델링을 활용 ${\Box}$ 연구데이터: 2013년1월부터 2017년3월까지 4차 산업혁명 관련 신문 기사 활용.

  • PDF

토픽모델링을 활용한 부산항 항만안전성 이슈 동향에 관한 연구

  • 이정민;하도연;김율성
    • Proceedings of the Korean Institute of Navigation and Port Research Conference
    • /
    • 2023.11a
    • /
    • pp.66-67
    • /
    • 2023
  • 최근 들어, 현대사회는 예측이 불가능한 다양한 위험성들이 존재하여 글로벌 의존도가 높은 항만물류산업의 위험부담이 증가하고 있다. 이에 본 연구에서는 항만산업의 안전성에 영향을 미치는 요인을 알아보기 위해 과거부터 현재까지 국내 항만 안전성에 영향을 미친 이슈들을 시계열적으로 살펴보고자 하였다. 이를 위하여 국내를 대표하는 부산항의 항만 안전성과 관련된 뉴스 기사 텍스트 데이터를 활용하여 LDA 토픽모델링 분석을 진행하여 부산항 항만안전 주요 이슈들의 동향을 살펴보고자 하였다.

  • PDF