• Title/Summary/Keyword: LDA Topic Model

Search Result 109, Processing Time 0.026 seconds

The Research Features Analysis of Leisure and Recreation based on Co-authors Network and Topic Model (공저자 네트워크 및 토픽 모델링 기반 여가레크리에이션 학술 연구 특징 분석)

  • Park, SungGeon;Park, Kwang-Won;Kang, Hyun-Wook
    • 한국체육학회지인문사회과학편
    • /
    • v.57 no.2
    • /
    • pp.279-289
    • /
    • 2018
  • The purpose of this study is to investigate features of leisure and recreation scholarship study in The Korean Journal of physical education based on co-authors network and topic modeling through using Word Cloud and LDA Topic Modeling(Latent Dirichlet Allocation). The data collected for this study are 2,697 papers published online from January 2008 to March 2017 on the Korean journal of physical education. Respectively ordered analysis targets are the major author, author of correspondence, co-author 1, co-author 2, co-author n in related document to explore studies' trends using the 369 documents. As a result, the co-author network analysis result found that 451 were linked to the research network, on average researchers had 1.52 relationships and the average distance between researchers was 2.33. The Representative author's concentration of connection was ranked high in the order of the following, Lee. K. M., Hwang. S. H., H., Lee. C. S., and proximity centers were shown in Seo K. B., Han. J. H., Kim. K. J. Finally, parameter-centric features appeared in order of Lee. C. W. and Seo. K. B. was most actively connected between the researchers of the leisure-related academic papers. Future research needs discussions among scholars regarding the trend and direction of future leisure research.

Topic Model Analysis of Research Themes and Trends in the Journal of Economic and Environmental Geology (기계학습 기반 토픽모델링을 이용한 학술지 "자원환경지질"의 연구주제 분류 및 연구동향 분석)

  • Kim, Taeyong;Park, Hyemin;Heo, Junyong;Yang, Minjune
    • Economic and Environmental Geology
    • /
    • v.54 no.3
    • /
    • pp.353-364
    • /
    • 2021
  • Since the mid-twentieth century, geology has gradually evolved as an interdisciplinary context in South Korea. The journal of Economic and Environmental Geology (EEG) has a long history of over 52 years and published interdisciplinary articles based on geology. In this study, we performed a literature review using topic modeling based on Latent Dirichlet Allocation (LDA), an unsupervised machine learning model, to identify geological topics, historical trends (classic topics and emerging topics), and association by analyzing titles, keywords, and abstracts of 2,571 publications in EEG during 1968-2020. The results showed that 8 topics ('petrology and geochemistry', 'hydrology and hydrogeology', 'economic geology', 'volcanology', 'soil contaminant and remediation', 'general and structural geology', 'geophysics and geophysical exploration', and 'clay mineral') were identified in the EEG. Before 1994, classic topics ('economic geology', 'volcanology', and 'general and structure geology') were dominant research trends. After 1994, emerging topics ('hydrology and hydrogeology', 'soil contaminant and remediation', 'clay mineral') have arisen, and its portion has gradually increased. The result of association analysis showed that EEG tends to be more comprehensive based on 'economic geology'. Our results provide understanding of how geological research topics branch out and merge with other fields using a useful literature review tool for geological research in South Korea.

Comparative Study of Information Literacy Education and Librarian Teacher Evaluation Index in Teachers' Competency Development Evaluation (정보활용교육 주요 토픽과 교원능력개발평가 사서교사 평가지표 비교 연구)

  • Lee, Min-Soo;Kim, Hea-Jin
    • Journal of Korean Library and Information Science Society
    • /
    • v.53 no.3
    • /
    • pp.455-477
    • /
    • 2022
  • This study aimed to compare and analyze librarian teacher evaluation index from evaluation of teachers' competency development with the the topics of information utilization education. To this end, LDA topic modeling was conducted by collecting papers related to information utilization education published in four major journals in the field of literature and information from 1995 to May 2022. As a result of topic modeling, it can be seen that information utilization education (T10) was the most actively discussed at 12.0% of the 20 topics, followed by library utilization classes (T2) 10.4% and user service (T3) 8.8%.On the other hand, 3.3% of reading discussion (T7), 2.9% of reading education (T19), 2.1% of manpower management (T13), and 2.1% of librarian teacher job satisfaction (T17) showed the lowest distributions 3.3%, 2.9%, 2.1%, and 2.1%, respectively. In addition, although librarian teacher's class model development (T1) and curriculum development (T20) are essential processes for collaborative classes and information utilization education, they were not reflected in the current teacher competency development evaluation index. Therefore, this study proposed that 'instructional model and curriculum development' indicator should be added on 'training and support classes' factors in the Librarian Teacher Evaluation Index in Teachers' Competency Development Evaluation for further evaluation.

An Analysis of Civil Complaints about Traffic Policing Using the LDA Model (토픽모델링을 활용한 교통경찰 민원 분석)

  • Lee, Sangyub
    • The Journal of The Korea Institute of Intelligent Transport Systems
    • /
    • v.20 no.4
    • /
    • pp.57-70
    • /
    • 2021
  • This study aims to investigate the security demand about the traffic policing by analyzing civil complaints. Latent Dirichlet Allocation(LDA) was applied to extract key topics for 2,062 civil complaints data related to traffic policing from e-People. And additional analysis was made of reports of violations, which accounted for a high proportion. In this process, the consistency and convergence of keywords and representative documents were considered together. As a result of the analysis, complaints related to traffic police could be classified into 41 topics, including traffic safety facilities, passing through intersections(signals), provisional impoundment of vehicle plate, and personal mobility. It is necessary to strengthen crackdowns on violations at intersections and violations of motorcycles and take preemptive measures for the installation and operation of unmanned traffic control equipments, crosswalks, and traffic lights. In addition, it is necessary to publicize the recently amended laws a implemented policies, e-fine, procedure after crackdown.

Technology Development Strategy of Piggyback Transportation System Using Topic Modeling Based on LDA Algorithm

  • Jun, Sung-Chan;Han, Seong-Ho;Kim, Sang-Baek
    • Journal of the Korea Society of Computer and Information
    • /
    • v.25 no.12
    • /
    • pp.261-270
    • /
    • 2020
  • In this study, we identify promising technologies for Piggyback transportation system by analyzing the relevant patent information. In order for this, we first develop the patent database by extracting relevant technology keywords from the pioneering research papers for the Piggyback flactcar system. We then employed textmining to identify the frequently referred words from the patent database, and using these words, we applied the LDA (Latent Dirichlet Allocation) algorithm in order to identify "topics" that are corresponding to "key" technologies for the Piggyback system. Finally, we employ the ARIMA model to forecast the trends of these "key" technologies for technology forecasting, and identify the promising technologies for the Piggyback system. with keyword search method the patent analysis. The results show that data-driven integrated management system, operation planning system and special cargo (especially fluid and gas) handling/storage technologies are identified to be the "key" promising technolgies for the future of the Piggyback system, and data reception/analysis techniques must be developed in order to improve the system performance. The proposed procedure and analysis method provides useful insights to develop the R&D strategy and the technology roadmap for the Piggyback system.

Detection of Complaints of Non-Face-to-Face Work before and during COVID-19 by Using Topic Modeling and Sentiment Analysis (동적 토픽 모델링과 감성 분석을 이용한 COVID-19 구간별 비대면 근무 부정요인 검출에 관한 연구)

  • Lee, Sun Min;Chun, Se Jin;Park, Sang Un;Lee, Tae Wook;Kim, Woo Ju
    • The Journal of Information Systems
    • /
    • v.30 no.4
    • /
    • pp.277-301
    • /
    • 2021
  • Purpose The purpose of this study is to analyze the sentiment responses of the general public to non-face-to-face work using text mining methodology. As the number of non-face-to-face complaints is increasing over time, it is difficult to review and analyze in traditional methods such as surveys, and there is a limit to reflect real-time issues. Approach This study has proposed a method of the research model, first by collecting and cleansing the data related to non-face-to-face work among tweets posted on Twitter. Second, topics and keywords are extracted from tweets using LDA(Latent Dirichlet Allocation), a topic modeling technique, and changes for each section are analyzed through DTM(Dynamic Topic Modeling). Third, the complaints of non-face-to-face work are analyzed through the classification of positive and negative polarity in the COVID-19 section. Findings As a result of analyzing 1.54 million tweets related to non-face-to-face work, the number of IDs using non-face-to-face work-related words increased 7.2 times and the number of tweets increased 4.8 times after COVID-19. The top frequently used words related to non-face-to-face work appeared in the order of remote jobs, cybersecurity, technical jobs, productivity, and software. The words that have increased after the COVID-19 were concerned about lockdown and dismissal, and business transformation and also mentioned as to secure business continuity and virtual workplace. New Normal was newly mentioned as a new standard. Negative opinions found to be increased in the early stages of COVID-19 from 34% to 43%, and then stabilized again to 36% through non-face-to-face work sentiment analysis. The complaints were, policies such as strengthening cybersecurity, activating communication to improve work productivity, and diversifying work spaces.

Accessibility Analysis Method based on Public Facility Attraction Index Using SNS Data (SNS 데이터를 이용한 공공시설 매력도지수에 따른 접근성 분석기법)

  • Lee, Ji Won;Yu, Ki Yun;Kim, Ji Young
    • Journal of the Korean Society of Surveying, Geodesy, Photogrammetry and Cartography
    • /
    • v.37 no.1
    • /
    • pp.29-42
    • /
    • 2019
  • In order to expand the qualitative aspects of public facility, this study used SNS data to derive user-oriented preference factors for public facilities and then were quantified in terms of supply side and demand side. To derive preference factor, LDA, one of topic modeling, was used and attraction index was calculated for each facility. In addition we analyzed spatial accessibility to measure the degree of service experience of users by using 2SFCA model. The study area covered public libraries of Seoul, Korea. As a result of study, five topics were extracted as preference factors for the public library: Circumstance, Scale of facility, Cultural program, Parenting, Books and materials. In particular topic of circumstance and parenting were newly derived preference factors unknown in previous studies. As a result of calculating attraction index for each library, the index of Songpa Library, Jungdok Library, and Namsan Library was high. Songpa library has received good evaluation in parenting factor, and Jungdok & Namsan library in circumstance factor. The accessibility of each region seems to better in center of Seoul where public libraries are crowded, but shrinking toward the outskirts. We expect that the proposed method will contribute to user-oriented public facility evaluation and policy decision making.

Exploring Issues Related to the Metaverse from the Educational Perspective Using Text Mining Techniques - Focusing on News Big Data (텍스트마이닝 기법을 활용한 교육관점에서의 메타버스 관련 이슈 탐색 - 뉴스 빅데이터를 중심으로)

  • Park, Ju-Yeon;Jeong, Do-Heon
    • Journal of Industrial Convergence
    • /
    • v.20 no.6
    • /
    • pp.27-35
    • /
    • 2022
  • The purpose of this study is to analyze the metaverse-related issues in the news big data from an educational perspective, explore their characteristics, and provide implications for the educational applicability of the metaverse and future education. To this end, 41,366 cases of metaverse-related data searched on portal sites were collected, and weight values of all extracted keywords were calculated and ranked using TF-IDF, a representative term weight model, and then word cloud visualization analysis was performed. In addition, major topics were analyzed using topic modeling(LDA), a sophisticated probability-based text mining technique. As a result of the study, topics such as platform industry, future talent, and extension in technology were derived as core issues of the metaverse from an educational perspective. In addition, as a result of performing secondary data analysis under three key themes of technology, job, and education, it was found that metaverse has issues related to education platform innovation, future job innovation, and future competency innovation in future education. This study is meaningful in that it analyzes a vast amount of news big data in stages to draw issues from an education perspective and provide implications for future education.

News Article Analysis of the 4th Industrial Revolution and Advertising before and after COVID-19: Focusing on LDA and Word2vec (코로나 이전과 이후의 4차 산업혁명과 광고의 뉴스기사 분석 : LDA와 Word2vec을 중심으로)

  • Cha, Young-Ran
    • The Journal of the Korea Contents Association
    • /
    • v.21 no.9
    • /
    • pp.149-163
    • /
    • 2021
  • The 4th industrial revolution refers to the next-generation industrial revolution led by information and communication technologies such as artificial intelligence (AI), Internet of Things (IoT), robot technology, drones, autonomous driving and virtual reality (VR) and it also has made a significant impact on the development of the advertising industry. However, the world is rapidly changing to a non-contact, non-face-to-face living environment to prevent the spread of COVID 19. Accordingly, the role of the 4th industrial revolution and advertising is changing. Therefore, in this study, text analysis was performed using Big Kinds to examine the 4th industrial revolution and changes in advertising before and after COVID 19. Comparisons were made between 2019 before COVID 19 and 2020 after COVID 19. Main topics and documents were classified through LDA topic model analysis and Word2vec, a deep learning technique. As the result of the study showed that before COVID 19, policies, contents, AI, etc. appeared, but after COVID 19, the field gradually expanded to finance, advertising, and delivery services utilizing data. Further, education appeared as an important issue. In addition, if the use of advertising related to the 4th industrial revolution technology was mainstream before COVID 19, keywords such as participation, cooperation, and daily necessities, were more actively used for education on advanced technology, while talent cultivation appeared prominently. Thus, these research results are meaningful in suggesting a multifaceted strategy that can be applied theoretically and practically, while suggesting the future direction of advertising in the 4th industrial revolution after COVID 19.

Classifying Temporal Topics with Similar Patterns on Twitter

  • Yun, Hong-Won
    • Journal of information and communication convergence engineering
    • /
    • v.9 no.3
    • /
    • pp.295-300
    • /
    • 2011
  • Twitter is a popular microblogging service that enables the users to send and read short text messages. These messages are becoming source to analyze topic trends and identify relations among temporal topics. In this paper, we propose a method to classify the temporal topics on Twitter as a problem of grouping the similar patterns. To provide a starting point for a classification under the same topics, we identify the content word weighting scheme based on Latent Dirichlet Allocation (LDA). And we formulate how the temporal topics in the time window can be classified like peaky topics, constant topics, and periodic topics. We provide different real case studies which show the validity of the proposed method. Evaluations show that the proposed method is useful as a classifying model in the analysis of the temporal topics.