• Title/Summary/Keyword: 워드 클라우드 분석

Search Result 121, Processing Time 0.028 seconds

A Study on Trend Analysis in Convergence Research Applying Word Cloud in Korea (워드 클라우드 기법을 이용한 국내 융복합 학술연구 트렌드 분석)

  • Kim, Joon-Hwan;Mun, Hyung-Jin;Lee, Hang
    • Journal of Digital Convergence
    • /
    • v.19 no.2
    • /
    • pp.33-38
    • /
    • 2021
  • The convergence trend is the core of the 4th industrial revolution, and due to such expectations and possibilities, various countermeasures are being sought in diverse fields. This study conducted a quantitative analysis to identify the trend of convergence research over the past 10 years. Specifically, major research keywords were extracted, word cloud techniques were applied, and visualized to identify trends in academic research on convergence. To this end, research papers from 2012 to 2020 published in journal of digital convergence were investigated. The analysis period was divided into two periods: the former 4 years(2012-2015) and the latter 4 years(2016-2019) to confirm the difference in research trends. In addition, the research papers of 2020 were analyzed in order to more clearly understand the changes in the research trend of the last year due to the COVID-19. The results of this study are significant in that they can be used as useful basic data for future research and to understand research trends as keywords in the field of convergence.

Suggestion of development for domestic game market through big data analysis of global game trend (글로벌 게임 트렌드의 빅데이터 분석을 통한 국내 게임 시장의 발전 방향성 제시)

  • Song, Junhyup;Lim, Minwoo;Kim, Hansoo
    • Proceedings of the Korean Society of Computer Information Conference
    • /
    • 2022.07a
    • /
    • pp.161-164
    • /
    • 2022
  • 게임 산업은 기술의 발전과 비대면 서비스 수요 증가로 해마다 발전하고 있다. 본 연구는 사용자들의 수요를 조사하기 위하여 대중성이 가장 높은 온라인 게임 플랫폼에서 이용 시간이 많은 게임 정보를 확인하였다. HTML 파싱(parsing) 라이브러리를 통해 해당 게임들의 리뷰를 크롤링하여 엑셀 파일로 데이터베이스화하였고, 자연어 처리 라이브러리를 활용하여 데이터를 정제하였다. 총 5개 장르에 대하여 분석한 결과 각 장르에 해당하는 대표적인 키워드를 확인할 수 있었다. 취득한 키워드는 범용 시각화 패키지를 활용하여 워드 클라우드 형태로 한눈에 알아볼 수 있도록 시각화하였다.

  • PDF

Malware Analysis Mechanism using the Word Cloud based on API Statistics (API 통계 기반의 워드 클라우드를 이용한 악성코드 분석 기법)

  • Yu, Sung-Tae;Oh, Soo-Hyun
    • Journal of the Korea Academia-Industrial cooperation Society
    • /
    • v.16 no.10
    • /
    • pp.7211-7218
    • /
    • 2015
  • Tens of thousands of malicious codes are generated on average in a day. New types of malicious codes are surging each year. Diverse methods are used to detect such codes including those based on signature, API flow, strings, etc. But most of them are limited in detecting new malicious codes due to bypass techniques. Therefore, a lot of researches have been performed for more efficient detection of malicious codes. Of them, visualization technique is one of the most actively researched areas these days. Since the method enables more intuitive recognition of malicious codes, it is useful in detecting and examining a large number of malicious codes efficiently. In this paper, we analyze the relationships between malicious codes and Native API functions. Also, by applying the word cloud with text mining technique, major Native APIs of malicious codes are visualized to assess their maliciousness. The proposed malicious code analysis method would be helpful in intuitively probing behaviors of malware.

A Study on Unstructured text data Post-processing Methodology using Stopword Thesaurus (불용어 시소러스를 이용한 비정형 텍스트 데이터 후처리 방법론에 관한 연구)

  • Won-Jo Lee
    • The Journal of the Convergence on Culture Technology
    • /
    • v.9 no.6
    • /
    • pp.935-940
    • /
    • 2023
  • Most text data collected through web scraping for artificial intelligence and big data analysis is generally large and unstructured, so a purification process is required for big data analysis. The process becomes structured data that can be analyzed through a heuristic pre-processing refining step and a post-processing machine refining step. Therefore, in this study, in the post-processing machine refining process, the Korean dictionary and the stopword dictionary are used to extract vocabularies for frequency analysis for word cloud analysis. In this process, "user-defined stopwords" are used to efficiently remove stopwords that were not removed. We propose a methodology for applying the "thesaurus" and examine the pros and cons of the proposed refining method through a case analysis using the "user-defined stop word thesaurus" technique proposed to complement the problems of the existing "stop word dictionary" method with R's word cloud technique. We present comparative verification and suggest the effectiveness of practical application of the proposed methodology.

A Study on the Analysis of Accident Types in Public and Private Construction Using Web Scraping and Text Mining (웹 스크래핑과 텍스트마이닝을 이용한 공공 및 민간공사의 사고유형 분석)

  • Yoon, Younggeun;Oh, Taekeun
    • The Journal of the Convergence on Culture Technology
    • /
    • v.8 no.5
    • /
    • pp.729-734
    • /
    • 2022
  • Various studies using accident cases are being conducted to identify the causes of accidents in the construction industry, but studies on the differences between public and private construction are insignificant. In this study, web scraping and text mining technologies were applied to analyze the causes of accidents by order type. Through statistical analysis and word cloud analysis of more than 10,000 structured and unstructured data collected, it was confirmed that there was a difference in the types and causes of accidents in public and private construction. In addition, it can contribute to the establishment of safety management measures in the future by identifying the correlation between major accident causes.

Comparative Analysis of News Big Data related to SARS-CoV, MERS-CoV, and SARS-CoV-2 (COVID-19)

  • Woo, Jae-Hyun
    • Journal of the Korea Society of Computer and Information
    • /
    • v.26 no.8
    • /
    • pp.91-101
    • /
    • 2021
  • This paper intends to draw implications for preparing for Post-Corona in the health field and policy fields as the global pandemic is experienced due to COVID-19. The purpose of this study is to analyze the news and trends of media companies through temporal analysis of the three infectious diseases, SARS-CoV, MERS-CoV, and SARS-CoV-2 (COVID-19), in which the domestic infectious disease preventive system was active throughout the first year of the outbreak. To this end, by using the news analysis program of the Korea Press Foundation 'Big Kinds', the number of news articles per year was digitized based on the period when each infectious disease had an impact on Korea, and major trends were implemented and analyzed in a word cloud. As a result of the analysis, the number of articles related to infectious diseases peaked when the World Health Organization (WHO) declared a warning and (suspicious) confirmed cases occurred. According to keyword and word cloud analysis, 'infectious disease outbreak and major epidemic areas', 'prevention authorities', and 'disease information and confirmed patient information' were found to be the main common features, and differences were derived from the three infectious diseases. In addition, the current status of the infodemic was identified by performing word cloud analysis on information in uncertainty. The results of this study are significant in that they were able to derive the roles of the health authorities and the media that should be preceded in the event of a new disease epidemic through previously experienced infectious diseases, and areas to be rearranged.

Research Trend Analysis on Smart healthcare by using Topic Modeling and Ego Network Analysis (토픽모델링과 에고 네트워크 분석을 활용한 스마트 헬스케어 연구동향 분석)

  • Yoon, Jee-Eun;Suh, Chang-Jin
    • Journal of Digital Contents Society
    • /
    • v.19 no.5
    • /
    • pp.981-993
    • /
    • 2018
  • Smart healthcare is convergence of ICT and healthcare services, and interdisciplinary research has been actively conducted in various fields. The objective of this study is to investigate trends of smart healthcare research using topic modeling and ego network analysis. Text analysis, frequency analysis, topic modeling, word cloud, and ego network analysis were conducted for the abstracts of 2,690 articles in Scopus from 2001 to April 2018. Topic Modeling analysis resulted in eight topics, Topics included "AI in healthcare", "Smart hospital", "Healthcare platform", "Blockchain in healthcare", "Smart health data", "Mobile healthcare", " Wellness care", "Cognitive healthcare". In order to examine the topic modeling results core deeply, we analyzed word cloud and ego network analysis for eight topics. This study aims to identify trends in smart healthcare research and suggest implications for establishing future research direction.

Recent Research Trends Analysis of Building Information Modeling using WordCloud through Comparison of Korean and International Journals (워드클라우드를 이용한 국내·외 BIM 연구 동향 분석)

  • Seo, Min-Goo;Lee, Ung-Kyun
    • Journal of the Korea Institute of Building Construction
    • /
    • v.19 no.1
    • /
    • pp.95-103
    • /
    • 2019
  • Introduction and use of Building Information Modeling(BIM) in construction projects have increased steadily over the past few years. However, the level of domestic BIM utilization is still tenuous compared to the international scene. Therefore, this study aims to present the possible directions for BIM research through an analysis of research literatures in Korea as well as in foreign countries. Papers on BIM were collected for this study from Korea and foreign countries for the field of architecture, and analyses and comparisons were performed by year and field. Further, the research patterns were analyzed using WordCloud, which is one of the popular big data techniques. From the analysis, it is found that the design field still constitutes the largest component of research, but the construction field is actively developing as well. In addition, it is realized that domestic BIM research continues to grow on collaboration and environment-friendly methodologies since 2012; it is also demonstrated that foreign BIM research has undergone changes in research trends every year including recently, and is progressing actively. Therefore, this study concludes that it is necessary to actively conduct research in the field of Industry Foundation Class(IFC) in the future. The results of this study can further be used as reference data for conducting BIM studies in Korea in the future.

A Case Study on Text Analysis Using Meal Kit Product Review Data (밀키트 제품 리뷰 데이터를 이용한 텍스트 분석 사례 연구)

  • Choi, Hyeseon;Yeon, Kyupil
    • The Journal of the Korea Contents Association
    • /
    • v.22 no.5
    • /
    • pp.1-15
    • /
    • 2022
  • In this study, text analysis was performed on the mealkit product review data to identify factors affecting the evaluation of the mealkit product. The data used for the analysis were collected by scraping 334,498 reviews of mealkit products in Naver shopping site. After preprocessing the text data, wordclouds and sentiment analyses based on word frequency and normalized TF-IDF were performed. Logistic regression model was applied to predict the polarity of reviews on mealkit products. From the logistic regression models derived for each product category, the main factors that caused positive and negative emotions were identified. As a result, it was verified that text analysis can be a useful tool that provides a basis for maximizing positive factors for a specific category, menu, and material and removing negative risk factors when developing a mealkit product.

Parents' Perceptions of Cognitive Rehabilitation for Children With Developmental Disabilities: A Mixed-Method Approach of Phenomenological Methodology and Word Cloud Analysis (발달장애 아동 부모의 인지재활 경험에 대한 질적 연구: 워드 클라우드 분석과 현상학적 연구 방법 혼합설계)

  • Ju, Yu-Mi;Kim, Young-Geun;Lee, Hee-Ryoung;Hong, Seung-Pyo;Han, Dae-Sung
    • Therapeutic Science for Rehabilitation
    • /
    • v.13 no.1
    • /
    • pp.49-63
    • /
    • 2024
  • Objective : The purpose of this study was to investigate parental perspectives on cognitive rehabilitation using a combination of phenomenological research methodology and word cloud analysis. Methods : Interviews were conducted with five parents of children with developmental disabilities. Word cloud analysis was conducted using Python, and five researchers analyzed the meaning units and themes using phenomenological methods. Words with high frequency were considered as a heuristic tool. Results : A total of 43 meaning units and nine components related to the phenomenon of cognitive rehabilitation were derived, and three themes were finalized. The main themes encompassed the definition of cognitive rehabilitation, challenges associated with cognitive rehabilitation, and factors influencing the selection of a cognitive rehabilitation institute. Cognitive rehabilitation emerged as a treatment focused on improving learning, daily functioning, and cognitive abilities in children with developmental disabilities. The perceived issues with cognitive rehabilitation pertained to treatment methods, therapist expertise, and associated costs. In addition, parents highlighted the importance of therapist expertise, humane personality, and affordability of cost and schedule when choosing a cognitive rehabilitation institute. Conclusion : Parents expressed expectations for substantial improvements in their children's daily functioning through cognitive rehabilitation. However, challenges were identified in clinical practices. Going forward, we expect that cognitive rehabilitation will evolve into a better therapeutic support service addressing the concerns raised by parents.