• Title/Summary/Keyword: topic modeling analysis

Search Result 694, Processing Time 0.025 seconds

Deep Learning Research Trend Analysis using Text Mining

  • Lee, Jee Young
    • International Journal of Advanced Culture Technology
    • /
    • v.7 no.4
    • /
    • pp.295-301
    • /
    • 2019
  • Since the third artificial intelligence boom was triggered by deep learning, it has been 10 years. It is time to analyze and discuss the research trends of deep learning for the stable development of AI. In this regard, this study systematically analyzes the trends of research on deep learning over the past 10 years. We collected research literature on deep learning and performed LDA based topic modeling analysis. We analyzed trends by topic over 10 years. We have also identified differences among the major research countries, China, the United States, South Korea, and United Kingdom. The results of this study will provide insights into research direction on deep learning in the future, and provide implications for the stable development strategy of deep learning.

An Exploratory Study of Health Inequality Discourse Using Korean Newspaper Articles: A Topic Modeling Approach

  • Kim, Jin-Hwan
    • Journal of Preventive Medicine and Public Health
    • /
    • v.52 no.6
    • /
    • pp.384-392
    • /
    • 2019
  • Objectives: This study aimed to explore the health inequality discourse in the Korean press by analyzing newspaper articles using a relatively new content analysis technique. Methods: This study used the search term "health inequality" to collect articles containing that term that were published between 2000 and 2018. The collected articles went through pre-processing and topic modeling, and the contents and temporal trends of the extracted topics were analyzed. Results: A total of 1038 articles were identified, and 5 topics were extracted. As the number of studies on health inequality has increased over the past 2 decades, so too has the number of news articles regarding health inequality. The extracted topics were public health policies, social inequalities in health, inequality as a social problem, healthcare policies, and regional health gaps. The total number of occurrences of each topic increased every year, and the trend observed for each theme was influenced by events related to its contents, such as elections. Finally, the frequency of appearance of each topic differed depending on the type of news source. Conclusions: The results of this study can be used as preliminary data for future attempts to address health inequality in Korea. To make addressing health inequality part of the public agenda, the media's perspective and discourse regarding health inequality should be monitored to facilitate further strategic action.

Topic Modeling of Korean Newspaper Articles on Aging via Latent Dirichlet Allocation

  • Lee, So Chung
    • Asian Journal for Public Opinion Research
    • /
    • v.10 no.1
    • /
    • pp.4-22
    • /
    • 2022
  • The purpose of this study is to explore the structure of social discourse on aging in Korea by analyzing newspaper articles on aging. The analysis is composed of three steps: first, data collection and preprocessing; second, identifying the latent topics; and third, observing yearly dynamics of topics. In total, 1,472 newspaper articles that included the word "aging" within the title were collected from 10 major newspapers between 2006 and 2019. The underlying topic structure was analyzed using Latent Dirichlet Allocation (LDA), a topic modeling method widely adopted by text mining academics and researchers. Seven latent topics were generated from the LDA model, defined as social issues, death, private insurance, economic growth, national debt, labor market innovation, and income security. The topic loadings demonstrated a clear increase in public interest on topics such as national debt and labor market innovation in recent years. This study concludes that media discourse on aging has shifted towards more productivity and efficiency related issues, requiring older people to be productive citizens. Such subjectivation connotes a decreased role of the government and society by shifting the responsibility to individuals not being able to adapt successfully as productive citizens within the labor market.

A Study on the Research Trends in Int'l Trade Using Topic modeling (토픽모델링을 활용한 무역분야 연구동향 분석)

  • Jee-Hoon Lee;Jung-Suk Kim
    • Korea Trade Review
    • /
    • v.45 no.3
    • /
    • pp.55-69
    • /
    • 2020
  • This study examines the research trends and knowledge structure of international trade studies using topic modeling method, which is one of the main methodologies of text mining. We collected and analyzed English abstracts of 1,868 papers of three Korean major journals in the area of international trade from 2003 to 2019. We used the Latent Dirichlet Allocation(LDA), an unsupervised machine learning algorithm to extract the latent topics from the large quantity of research abstracts. 20 topics are identified without any prior human judgement. The topics reveal topographical maps of research in international trade and are representative and meaningful in the sense that most of them correspond to previously established sub-topics in trade studies. Then we conducted a regression analysis on the document-topic distributions generated by LDA to identify hot and cold topics. We discovered 2 hot topics(internationalization capacity and performance of export companies, economic effect of trade) and 2 cold topics(exchange rate and current account, trade finance). Trade studies are characterized as a interdisciplinary study of three agendas(i.e. international economy, International Business, trade practice), and 20 topics identified can be grouped into these 3 agendas. From the estimated results of the study, we find that the Korean government's active pursuit of FTA and consequent necessity of capacity building in Korean export firms lie behind the popularity of topic selection by the Korean researchers in the area of int'l trade.

Analysis of Research Trends in Elementary Information Education in Korea using Topic Modeling (토픽 모델링을 활용한 국내 초등 정보교육 연구동향 분석)

  • Shim, Jaekwoun
    • Journal of The Korean Association of Information Education
    • /
    • v.25 no.2
    • /
    • pp.347-354
    • /
    • 2021
  • As interest in artificial intelligence education for elementary school students has recently increased, it is necessary to analyze the existing elementary information education research from a macroscopic point of view to understand the current situation and to provide implications for subsequent research. This study analyzed Journal of The Korean Association of Information Education for the purpose of looking at the research trend of elementary information education in Korea. For the data of the study, all papers published until 2020 in the first issue of the journal were selected, and 11 research topics were derived by modeling topics. As a result of the study, topic T1, the highest proportion, was analyzed to account for about 38%, and keywords such as education, research, analysis, elementary school, and information were derived according to the order of contribution to topic T1. As a result of regression analysis according to the year of the topic, it was found that the research trend is changing to computing thinking, software education, and artificial intelligence education. The significance of this study is that text data related to elementary information education is objectively clustered.

How Are the Direction and the Intensity of Indirect Social Information such as Likes and Dislikes Related to the Deliberative Quality of Online News Content Comments? A Topic Diversity Analysis Using Topic Modeling ('좋아요'와 '싫어요'같은 간접적 사회적 정보의 방향과 강도는 온라인 뉴스 콘텐츠 댓글의 숙의의 질과 어떤 관련이 있는가? 토픽 모델링을 이용한 토픽 다양성 분석)

  • Min, Jin Young;Lee, Ae Ri
    • The Journal of Information Systems
    • /
    • v.30 no.4
    • /
    • pp.303-327
    • /
    • 2021
  • Purpose The online comments on news content have become social information and are understood based on deliberative democracy. Although the related research has focused on the relationship between online comments and their deliberative quality, the social information provided by online comments consists of not only direct information such as comments themselves but also indirect information such as 'likes' and 'dislikes'. Therefore, the research on online comments and deliberative quality should study this direct and indirect information together, and the direction and the degree of the indirect information should be also considered with them. Design/methodology/approach This study distinguishes comments by the attached 'likes' and 'dislikes', identifies highly supported and highly unsupported comments by the intensity of 'likes' and 'dislikes', and investigates the relationship between their existence and the deliberative quality measured as the topic diversity. Then, we applied topic modeling to the 2,390 news articles and their 74,385 comments collected from five news sites. Findings The topic diversities of the supported and unsupported comments are related to the topic diversity of all comments but the degree of the relationship is higher in the case of supported comments. Furthermore, the existence of highly supported and unsupported comments is led to less diversity of all comments compared to the case where those comments are absent. Particularly, when only highly supported comments are present, topic diversity was lower than in the opposite case.

Active Senior Contents Trend Analysis using LDA Topic Modeling (LDA 토픽 모델링을 이용한 액티브 시니어 콘텐츠 트렌드 분석)

  • Lee, Dongwoo;Kim, Yoosin;Shin, Eunjung
    • Journal of Internet Computing and Services
    • /
    • v.22 no.5
    • /
    • pp.35-45
    • /
    • 2021
  • The purpose of this study is to understand the characteristics and trends of active senior. As the baby boom generation become the age of the elderly, they are more active than senior. These seniors are called active seniors, a new consumer group. Many countries and companies are also interested in providing relevant policies and services, but there is lack of researches on active senior trends. This study collects the 8,740 posts related to active seniors on social media from January 1st, 2018 to June 31st, 2021, and conducted keyword frequency analysis, TF-IDF analysis and LDA topic modeling. Through LDA topic modeling, topics are classified into 10 categories: lifestyle, benefits, shopping, government business, government education, health, society and economy, care industry, silver housing, leisure. The results of this study can be utilized as fundamental data to help understand the academic and industrial aspects of active senior.

Detection of Complaints of Non-Face-to-Face Work before and during COVID-19 by Using Topic Modeling and Sentiment Analysis (동적 토픽 모델링과 감성 분석을 이용한 COVID-19 구간별 비대면 근무 부정요인 검출에 관한 연구)

  • Lee, Sun Min;Chun, Se Jin;Park, Sang Un;Lee, Tae Wook;Kim, Woo Ju
    • The Journal of Information Systems
    • /
    • v.30 no.4
    • /
    • pp.277-301
    • /
    • 2021
  • Purpose The purpose of this study is to analyze the sentiment responses of the general public to non-face-to-face work using text mining methodology. As the number of non-face-to-face complaints is increasing over time, it is difficult to review and analyze in traditional methods such as surveys, and there is a limit to reflect real-time issues. Approach This study has proposed a method of the research model, first by collecting and cleansing the data related to non-face-to-face work among tweets posted on Twitter. Second, topics and keywords are extracted from tweets using LDA(Latent Dirichlet Allocation), a topic modeling technique, and changes for each section are analyzed through DTM(Dynamic Topic Modeling). Third, the complaints of non-face-to-face work are analyzed through the classification of positive and negative polarity in the COVID-19 section. Findings As a result of analyzing 1.54 million tweets related to non-face-to-face work, the number of IDs using non-face-to-face work-related words increased 7.2 times and the number of tweets increased 4.8 times after COVID-19. The top frequently used words related to non-face-to-face work appeared in the order of remote jobs, cybersecurity, technical jobs, productivity, and software. The words that have increased after the COVID-19 were concerned about lockdown and dismissal, and business transformation and also mentioned as to secure business continuity and virtual workplace. New Normal was newly mentioned as a new standard. Negative opinions found to be increased in the early stages of COVID-19 from 34% to 43%, and then stabilized again to 36% through non-face-to-face work sentiment analysis. The complaints were, policies such as strengthening cybersecurity, activating communication to improve work productivity, and diversifying work spaces.

A Study on Children's Images during the Liberation Period Using Topic Modeling: With a focus on The Children's News (토픽 모델링을 이용한 해방기 아동상 연구 - 「어린이신문」을 중심으로 -)

  • Jang, Seok-Eun;Lee, Hye-Eun
    • Journal of the Korean BIBLIA Society for library and Information Science
    • /
    • v.33 no.3
    • /
    • pp.157-178
    • /
    • 2022
  • This study explores children's images in The Children's News, a children's newspaper during the Liberation period. For this purpose, frequency analysis, topic modeling, and time series analysis were performed from the first issue of December 1, 1945 to the 86 issue of December 13, 1947, except for No. 34, which was not passed down. As a result of frequency analysis, keywords related to country, school, and family appeared frequently, and through topic modeling, children's images were observed in these topics, including children with patriotism, children with scientific literacy, children with artistic refinement, and children as social beings. The time series analysis results show that the percentage of patriotism-related topics was high during the early days of the Liberation period when The Children's News were published, but as the ratio of topics such as science and art gradually increased, it was confirmed that the image of children was diversified.

An Analysis of Research Trends on Basic Academic Abilities in Mathematics with Frequency Analysis and Topic Modeling (빈도 분석 및 토픽모델링을 활용한 수학 교과에서 기초학력 관련 연구 동향 분석)

  • Cho, Mi Kyung
    • Communications of Mathematical Education
    • /
    • v.37 no.4
    • /
    • pp.615-633
    • /
    • 2023
  • This study analyzed Korean studies up to August 2023 to suggest the direction of future research on basic academic abilities in mathematics. For this purpose, frequency analysis and LDA-based topic modeling were conducted on the Korean abstracts of 197 domestic studies. The results showed that, first, 'academic achievement', 'impact', 'effect', and 'factors' were all ranked at the top of the TFs and TF-IDFs. Second, as a result of LDA-based topic modeling, five topics were identified: causes of basic academic abilities deficiency, learning status of math underachievers, teacher expertise in teaching math underachievers, supporting programs for math underachievers, and results of National Assessment of Educational Achievement. As a direction for future research, this study suggests focusing on the growth of math underachievers, systematizing the programs provided to students who need learning support in mathematics, and developing teacher expertise in teaching math underachievers.