• Title/Summary/Keyword: topic modeling analysis

Search Result 672, Processing Time 0.025 seconds

An Ontology-Based Labeling of Influential Topics Using Topic Network Analysis

  • Kim, Hyon Hee;Rhee, Hey Young
    • Journal of Information Processing Systems
    • /
    • v.15 no.5
    • /
    • pp.1096-1107
    • /
    • 2019
  • In this paper, we present an ontology-based approach to labeling influential topics of scientific articles. First, to look for influential topics from scientific article, topic modeling is performed, and then social network analysis is applied to the selected topic models. Abstracts of research papers related to data mining published over the 20 years from 1995 to 2015 are collected and analyzed in this research. Second, to interpret and to explain selected influential topics, the UniDM ontology is constructed from Wikipedia and serves as concept hierarchies of topic models. Our experimental results show that the subjects of data management and queries are identified in the most interrelated topic among other topics, which is followed by that of recommender systems and text mining. Also, the subjects of recommender systems and context-aware systems belong to the most influential topic, and the subject of k-nearest neighbor classifier belongs to the closest topic to other topics. The proposed framework provides a general model for interpreting topics in topic models, which plays an important role in overcoming ambiguous and arbitrary interpretation of topics in topic modeling.

Research Trends on Doctor's Job Competencies in Korea Using Text Network Analysis (텍스트네트워크 분석을 활용한 국내 의사 직무역량 연구동향 분석)

  • Kim, Young Jon;Lee, Jea Woog;Yune, So Jung
    • Korean Medical Education Review
    • /
    • v.24 no.2
    • /
    • pp.93-102
    • /
    • 2022
  • We use the concept of the "doctor's role" as a guideline for developing medical education programs for medical students, residents, and doctors. Therefore, we should regularly reflect on the times and social needs to develop a clear sense of that role. The objective of the present study was to understand the knowledge structure related to doctor's job competencies in Korea. We analyzed research trends related to doctor's job competencies in Korea Citation Index journals using text network analysis through an integrative approach focusing on identifying social issues. We finally selected 1,354 research papers related to doctor's job competencies from 2011 to 2020, and we analyzed 2,627 words through data pre-processing with the NetMiner ver. 4.2 program (Cyram Inc., Seongnam, Korea). We conducted keyword centrality analysis, topic modeling, frequency analysis, and linear regression analysis using NetMiner ver. 4.2 (Cyram Inc.) and IBM SPSS ver. 23.0 (IBM Corp., Armonk, NY, USA). As a result of the study, words such as "family," "revision," and "rejection" appeared frequently. In topic modeling, we extracted five potential topics: "topic 1: Life and death in medical situations," "topic 2: Medical practice under the Medical Act," "topic 3: Medical malpractice and litigation," "topic 4: Medical professionalism," and "topic 5: Competency development education for medical students." Although there were no statistically significant changes in the research trends for each topic over time, it is nonetheless known that social changes could affect the demand for doctor's job competencies.

Seasonal analysis of Beach-related Issues using Local Newspaper Articles and Topic Modeling (지역신문기사 자료와 토픽모델링을 이용한 해변 관련 계절별 현안분석)

  • Yoo, Mu-Sang;Jeong, Su-Yeon;Kim, Geon-Hu;Sohn, Chul
    • Journal of the Korean Regional Science Association
    • /
    • v.34 no.4
    • /
    • pp.19-34
    • /
    • 2018
  • The purpose of this study is to analyze the seasonal issues using the local newspaper articles with the keyword beach from 2004 to 2017. Topic modeling and Time series regression analysis based on open source programs were performed for analysis. Topic modeling results showed 35 topics in spring, 47 topics in summer, 36 topics in autumn and 35 topics in winter. The common themes were 'beaches', 'festivals and events', 'accident and environmental issues', 'tourism', 'development and sale', 'administration and policy' and 'weather'. Time series regression analysis showed in the spring, 5 Hot-Topics and 2 Cold-Topic were found out of the 35 topics. In the summer, 6 Hot-Topics and 3 Cold-Topic were found out of the 47 topics. In the autumn, 4 Hot-Topics and 3 Cold-Topic were found out of the 36 topics. In the winter, 3 Hot-Topics and 3 Cold-Topic were found out of the 35 topics. And for each season, topics that do not fall into the Hot-Topic and Cold-Topic are classified as Neutral-Topic. In this study if seasonal uses are different such as beaches are deemed that seasonal topic modeling for analysis of regional issues will yield more useful results and enable detailed diagnosis.

Analysis of Descriptive Lecture Evaluation on Liberal Arts ICT utilization using Topic Modeling (토픽 모델링을 활용한 교양 ICT 활용과정 서술형 강의평가 분석)

  • Kim, HyoSook
    • Journal of Platform Technology
    • /
    • v.8 no.1
    • /
    • pp.33-40
    • /
    • 2020
  • The purpose of this study is to identify factors in selecting the elective ICT utilization lecture and to find positive and negative elements of the lecture through conducting topic modeling analysis of text mining of the narrative lecture evaluation. In order to do so, from pre-processing of data, keyword frequency analysis to wordcloud visualization and topic modeling analysis have been conducted from 'reasons of selecting the lecture,' 'improvements to be made on the lecture,' and 'what I liked about the lecture' categories regarding the ICT utilization lecture which was opened in the second semester of 2019 at M University. The analysis results show that students mostly registered for the ICT utilization lecture at M University to obtain a certificate and the fact being certified and taking the lecture can be done simultaneously is a positive element of taking the lecture. On the other hand, negative element included inconvenience of the classroom setting environment.

  • PDF

Analysis on Topic Trends and Topic Modeling of KSHSM Journal Papers using Text Mining (텍스트마이닝을 활용한 보건의료산업학회지의 토픽 모델링 및 토픽트렌드 분석)

  • Cho, Kyoung-Won;Bae, Sung-Kwon;Woo, Young-Woon
    • The Korean Journal of Health Service Management
    • /
    • v.11 no.4
    • /
    • pp.213-224
    • /
    • 2017
  • Objectives : The purpose of this study was to analyze representative topics and topic trends of papers in Korean Society and Health Service Management(KSHSM) Journal. Methods : We collected English abstracts and key words of 516 papers in KSHSM Journal from 2007 to 2017. We utilized Python web scraping programs for collecting the papers from Korea Citation Index web site, and RStudio software for topic analysis based on latent Dirichlet allocation algorithm. Results : 9 topics were decided as the best number of topics by perplexity analysis and the resultant 9 topics for all the papers were extracted using Gibbs sampling method. We could refine 9 topics to 5 topics by deep consideration of meanings of each topics and analysis of intertopic distance map. In topic trends analysis from 2007 to 2017, we could verify 'Health Management' and 'Hospital Service' were two representative topics, and 'Hospital Service' was prevalent topic by 2011, but the ratio of the two topics became to be similar from 2012. Conclusions : We discovered 5 topics were the best number of topics and the topic trends reflected the main issues of KSHSM Journal, such as name revision of the society in 2012.

Analysis of Consulting Research Trends Using Topic Modeling (토픽 모델링을 활용한 컨설팅 연구동향 분석)

  • Kim, Min Kwan;Lee, Yong;Han, Chang Hee
    • Journal of Korean Society of Industrial and Systems Engineering
    • /
    • v.40 no.4
    • /
    • pp.46-54
    • /
    • 2017
  • 'Consulting', which is the main research topic of the knowledge service industry, is a field of study that is essential for the growth and development of companies and proliferation to specialized fields. However, it is difficult to grasp the current status of international research related to consulting, mainly on which topics are being studied, and what are the latest research topics. The purpose of this study is to analyze the research trends of academic research related to 'consulting' by applying quantitative analysis such as topic modeling and statistic analysis. In this study, we collected statistical data related to consulting in the Scopus DB of Elsevier, which is a representative academic database, and conducted a quantitative analysis on 15,888 documents. We scientifically analyzed the research trends related to consulting based on the bibliographic data of academic research published all over the world. Specifically, the trends of the number of articles published in the major countries including Korea, the author key word trend, and the research topic trend were compared by country and year. This study is significant in that it presents the result of quantitative analysis based on bibliographic data in the academic DB in order to scientifically analyze the trend of academic research related to consulting. Especially, it is meaningful that the traditional frequency-based quantitative bibliographic analysis method and the text mining (topic modeling) technique are used together and analyzed. The results of this study can be used as a tool to guide the direction of research in consulting field. It is expected that it will help to predict the promising field, changes and trends of consulting industry related research through the trend analysis.

Research Trend Analysis for Smart Grids Using Dynamic Topic Modeling (동적 토픽분석을 활용한 스마트그리드 연구동향 분석)

  • Na, Sang-Tae;Ahn, Joo-Eon;Jung, Min-Ho;Kim, Ja-Hee
    • The Transactions of The Korean Institute of Electrical Engineers
    • /
    • v.66 no.4
    • /
    • pp.613-620
    • /
    • 2017
  • The power grid has been changed to a smart grid system to satisfy the growing need for power grid complexity, demand, reliability, security, and efficiency with a combination of existing power and ICT technology. This study analyzes the research trends in smart grid technology in the period since the introduction of the smart grid system and compares it with industrial trends to grasp the progress and characteristics of Smart Grid technology and look for ways to innovate the technology. To do this, we analyze the research trends using dynamic topic modeling, which is capable of time-series research topic analysis. Next, we compare the results of research trends with industrial trends analyzed by Gartner's experts to demonstrate that smart grid research is evolving to the level of industrialization. The results of this study are quantitative analysis through data mining, and it is expected that it will be used in many fields such as companies that want to participate in industry and government agencies that need to establish policies by showing more objective analysis results.

An analysis of indoor environment research trends in Korea using topic modeling : Case study on abstracts from the journal of the Korean society for indoor environment (토픽모델링을 활용한 실내환경 분야 연구동향 파악 : 실내환경학회지 초록 사례연구)

  • Jeon, Hyung Jin;Kim, Do Youn;Han, Kook Jin;Kim, Dong Woo;Son, Seung Woo;Lee, Cheol Min
    • Journal of odor and indoor environment
    • /
    • v.17 no.4
    • /
    • pp.322-329
    • /
    • 2018
  • The objective of this study is to identify the research trend in the field of indoor environment in Korea. We collected 419 papers published in the Journal of the Korean Society for indoor environment between 2004 and 2018, and attempted to produce datasets using a topic modeling technique, Latent Dirichlet Allocation(LDA). The result of topic modeling showed that 8 topics ("VOCs investigation", "Subway environment", "Building thermal environment", "School health", "Building particulate matter", "Asbestos risk", "Radon risk", "Air cleaner and treatment") could be extracted using Gibbs sampling method. In terms of topic trends, investigation of volatile organic compounds, subway environment, school health, and building particulate matter showed a decreasing tendency, while the building thermal environment, asbestos risk, radon risk, air cleaners, and air treatment showed an increasing tendency. The results of this topic modeling could help us to understand current trends related indoor environment, and provide valuable information in developing future research and policy frameworks.

Cancer Research Trends in Traditional Korean Medical Journals since 2000 - Topic Modeling Using Latent Dirichlet Allocation and Keyword Network Analysis (2000년 이후 국내 한의학 암 관련 연구 동향 분석 - Latent Dirichlet Allocation 기반 토픽 모델링 및 연관어 네트워크 분석)

  • Kyeore Bae
    • The Journal of Internal Korean Medicine
    • /
    • v.43 no.6
    • /
    • pp.1075-1088
    • /
    • 2022
  • Objectives: The aim of this study is to analyze cancer research trends in traditional Korean medical journals indexed in the Korea Citation Index since 2000. Methods: Cancer research papers published in traditional Korean medical journals were searched in databases from inception to October 2022. The numbers of publications by journal and by year were descriptively assessed. After natural language processing, topic modeling (based on Latent Dirichlet allocation) and keyword network analysis were conducted. Results: This research trend analysis involved 1,265 papers. Six topics were identified by topic modeling: case reports on symptom management, literature reviews, experiments on apoptosis, herbal extract treatments of breast carcinoma cell lines, anti-proliferative effects of herbal extracts, and anti-tumor effects. Keyword network analysis found that the effects of herbal medicine were assessed in clinical and experimental studies, while acupuncture was mainly mentioned in clinical reports. Conclusions: Cancer research papers in traditional Korean medical journals have contributed to evidence-based medicine. Further experimental studies are needed to elucidate the effects of on different hallmarks of cancer. Rigorous clinical studies are needed to support clinical guidelines.

Overseas Research Trends Related to 'Research Ethics' Using LDA Topic Modeling

  • YANG, Woo-Ryeong;YANG, Hoe-Chang
    • Journal of Research and Publication Ethics
    • /
    • v.3 no.1
    • /
    • pp.7-11
    • /
    • 2022
  • Purpose: The purpose of this study is to derive clues about the development direction of research ethics and areas of interest which has recently become a social issue in Korea by confirming overseas research trends. Research design, data and methodology: We collected 2,760 articles in scienceON, which including 'research ethics' in their paper. For analysis, frequency analysis, word clouding, keyword association analysis, and LDA topic modeling were used. Results: It was confirmed that many of the papers were published in medical, bio, pharmaceutical, and nursing journals and its interest has been continuously increasing. From word frequency analysis, many words of medical fields such as health, clinical, and patient was confirmed. From topic modeling, 7 topics were extracted such as ethical policy development and human clinical ethics. Conclusions: We founded that overseas research trends on research ethics are related to basic aspects than Korea. This means that a fundamental approach to ethics and the application of strict standards can become the basis for cultivating an overall ethical awareness. Therefore, academic discussions on the application of strict standards for publishing ethics and conducting researches in various fields where community awareness and social consensus are necessary for overall ethical awareness.