• Title/Summary/Keyword: 빈도 기반 텍스트 분석

Search Result 106, Processing Time 0.025 seconds

An Analysis of the International Trends of Research on Artificial Intelligence in Education Using Topic Modeling (인공지능 활용 교육의 토픽모델링 분석을 통한 수학교육 연구 방향의 함의)

  • Noh, Jihwa;Ko, Ho Kyoung;Kim, Byeongsoo;Huh, Nan
    • Journal of the Korean School Mathematics Society
    • /
    • v.26 no.1
    • /
    • pp.1-19
    • /
    • 2023
  • This study analyzed the international trends of research concerning artificial intelligence in education by examining 352 papers recently published in the International Journal of Artificial Intelligence in Education(IJAIED) with the topic modeling method. The IJAIED is the official, SCOPUS-indexed journal of the International AIED Society. The analysis revealed that international AIED research trends could be categorized into eight topics with topics such as analyzing student behavior model in learning systems and designing feedback to student solutions being increased over time, whereas research focusing on data handling methods was decreased over time. Based on the findings implications and suggestions for the research and development of the applications of AIED were provided.

A Study on the Risk-based Rainfall Standards Representing Regional Flood Damage (위험기반 지역별 홍수피해 강우기준 산정 방안)

  • Yu, Yeong Uk;Seong, Yeon Jeong;Jang, Woong Chul;Jung, Young Hun
    • Proceedings of the Korea Water Resources Association Conference
    • /
    • 2022.05a
    • /
    • pp.59-59
    • /
    • 2022
  • 세계적으로 지구온난화를 동반한 기후변화로 인해 자연재난이 빈번하게 발생하고 있다. 재해의 발생 유형 중 집중호우와 태풍으로 인한 수문학적 재해가 대부분을 차지하고 있다. 이와 같이 홍수로 인해 발생하는 피해는 강우의 특성과 지역적 특성에 따라 피해의 규모와 범위가 달라진다. 따라서 이러한 이질적인 홍수피해로부터 재산과 인명을 보호하기 위해서는 위해성(Hazard), 노출성(Exposure), 취약성(Vulnerability)을 고려하여 지역 특성에 맞는 홍수방어계획을 수립해야한다. 본 연구에서는 전국 228개 행정구역을 대상으로 과거에 실제로 발생하였던 홍수피해 사례 조사를 통해 지역별 홍수피해 특성을 파악하여 지역 특성을 고려한 홍수피해 강우기준을 제시하고자 하였다. 이를 위해서 재해연보 보고서에 기재되어 있는 과거 홍수피해 기간과 홍수피해액을 수집하였고, 홍수피해 기간동안의 강우량과 뉴스 기사를 수집하여 뉴스 기사에서 언급되었던 홍수피해 현상 정보를 수집하였다. 수집된 홍수피해 정보를 통해 지역별 노출성과 취약성이 반영된 현상기반 강우등급을 제시하였으며, 이와 함께 지역별 강우특성을 나타내며 위해성을 내포하고 있는 확률강우량과의 합성을 통해 위해성, 노출성, 취약성을 고려한 지역별 홍수피해 강우기준을 제시하였다. 대부분 홍수피해에 관한 정보를 재해연보 보고서를 활용하여 수집하지만 홍수피해 현상에 대한 정보를 포함하고 있지 않기 때문에 지역별로 홍수피해로부터 발생하는 홍수피해 유형에 대해 파악하기에는 한계가 있다. 따라서 본 연구에서는 과거 홍수피해가 발생했던 기간에 대해 뉴스 기사를 수집하여 홍수피해 현상 정보를 수집하였고, 수집된 홍수피해 현상 정보를 텍스트 마이닝(Text Mining) 기법을 적용하여 홍수피해 현상 키워드 빈도분석을 통해 어떠한 홍수피해 유형에 취약한지 파악하였다.

  • PDF

A Diachronic Lexical Analysis of the North Korean English Textbooks (북한 영어 교과서 어휘의 통시적 분석)

  • Kim, Jiyoung;Lee, Je-Young;Kim, Jeong-ryeol
    • The Journal of the Korea Contents Association
    • /
    • v.17 no.4
    • /
    • pp.331-341
    • /
    • 2017
  • This paper aims to analyze English vocabulary of the North Korean textbooks diachronically using the constructed English textbook corpus. The North Korea English textbooks attained from Information Center on North Korea of the Ministry of Unification are divided into before and after Kim Jong-Il era for the year of 1996 in which the curriculum revision has been conducted. They are stored as text files to analyse vocabularies using WordSmith Tools 7.0. The vocabulary size of the revised textbooks increased after the curriculum reorganization, but the number of vocabulary types and vocabulary diversity decreased. After the curriculum revision, it was found that lots of vocabulary related to the establishment of the Kim Jong-Il system appeared as the keyword. It was also found that some vocabularies reflected the economic and social life of North Korea. In addition, through comparison of the 100 high-frequency word list and keywords, it can be concluded that the vocabulary of the English textbooks of North Korea is gradually changing into communicative contents from contents related with written language.

Exploring the Trend of Korean Creative Dance by Analyzing Research Topics : Application of Text Mining (연구주제 분석을 통한 한국창작무용 경향 탐색 : 텍스트 마이닝의 적용)

  • Yoo, Ji-Young;Kim, Woo-Kyung
    • Journal of Korea Entertainment Industry Association
    • /
    • v.14 no.6
    • /
    • pp.53-60
    • /
    • 2020
  • The study is based on the assumption that the trend of phenomena and trends in research are contextually consistent. Therefore the purpose of this study is to explore the trend of dance through the subject analysis of the Korean creative dance study by utilizing text mining. Thus, 1,291 words were analyzed in the 616 journal title, which were established on the paper search website. The collection, refining and analysis of the data were all R 3.6.0 SW. According to the study, keywords representing the times were frequently used before the 2000s, but Korean creative dance research types were also found in terms of education and physical training. Second, the frequency of keywords related to the dance troupe's performance was high after the 2000s, but it was confirmed that Choi Seung-hee was still in an important position in the study of Korean creative dance. Third, an analysis of the overall research subjects of the Korean creative dance study showed that the research on 'Art of Choi Seung-hee in the modern era' was the highest proportion. Fourth, the Hot Topics, which are rising as of 2000, appeared as 'the performance activities of the National Dance Company' and 'the choreography expression and utilization of traditional dance'. However, since the recent trend of the National Dance Company's performance is advocating 'modernization based on tradition', it has been confirmed that the trend of Korean creative dance since the 2000s has been focused on the use of traditional dance motifs. Fifth, the Cold Topic, which has been falling as of 2000, has been shown to be a study of 'dancing expressions by age'. It was judged that interest in research also decreased due to the tendency to mix various dance styles after the establishment of the genre of Korean creative dance.

A Comparative Analysis of Complex Disaster Research Trends Using Network Analysis (네트워크 분석을 활용한 국내·외 복합재난 연구 동향 분석)

  • Woosik Kim;Yeonwoo Choi;Youjeong Hong;Dong Keun Yoon
    • Journal of the Society of Disaster Information
    • /
    • v.18 no.4
    • /
    • pp.908-921
    • /
    • 2022
  • Purpose: As the connection between physical and non-physical structures in cities is expanding and becoming more complex, the risk of complex disaster which causes damage in a complex way is increasing. Preparing for these complex disasters, it is important to preemptively identify and manage disasters that can develop into complex disasters. Therefore, this study analyzes the disaster types studied as complex disasters by analyzing the trends of domestic and international studies related to complex disasters, and presents the direction of complex disaster management in the future. Method: We first established co-occurrence networks between disaster types based on 993 articles related to complex disasters published in disaster-related journals for the last 20 years (2002-2021). Then, through network analysis, domestic and international complex disaster research trends were compared and analyzed. Result: Research on complex disasters related to storm and flood damage, infrastructure failure and fire was high in domestic studies, and it was analyzed that research on complex disasters related to earthquakes and landslides has recently increased. However, in international studies, the proportion of studies on infrastructure failure along with storm and flood damage and earthquake was high, and various types of disasters such as tsunami and drought appeared. Conclusion: The results of this study are expected to increase the understanding of the trends in complex disaster research and provide suggestions of domestic complex disaster research in the future.

The Characteristics and Improvement Directions of Regional Climate Change Adaptation Policies in accordance with Damage Cases (지자체 기후변화 적응 대책 특성 및 개선 방향)

  • Ahn, Yoonjung;Kang, Youngeun;Park, Chang Sug;Kim, Ho Gul
    • Journal of Environmental Impact Assessment
    • /
    • v.25 no.4
    • /
    • pp.296-306
    • /
    • 2016
  • There is a growing interest in establishing a regional climate change adaptation policy as the climate change impact in the region and local scale increases. This study focused on the analysis of 32 regions on its characteristics of local climate change adaptation plans. First, statistic program R was used for conducting cluster analysis based on the frequency and budgets of adaptation plan. Further, we analyzed damage frequency from newspapers regarding climate change impacts in eight categories which were caused by extreme weather events on 2,565 cases for 24 years. Lastly, the characteristics of climate change adaptation plan was compared with damage frequency patterns for evaluating the adequacy of climate change adaptation plan on each cluster. Four different clusters were created by cluster analysis. Most clusters clearly have their own characteristics on certain sectors. There was a high frequency of damage in 'disaster' and 'health' sectors. Climate change adaptation plan and budget also invested a lot on those sectors. However, when comparing the relative rate among regional governments, there was a difference between types of damage and climate change adaptation plan. We assumed that the difference could come from that each region established their adaptation plans based on not only the frequency of damage, but vulnerability assessment, and expert opinions as well. The result of study could contribute to policy making of climate change adaptation plan.

Multi-Label Classification for Corporate Review Text: A Local Grammar Approach (머신러닝 기반의 기업 리뷰 다중 분류: 부분 문법 적용을 중심으로)

  • HyeYeon Baek;Young Kyun Chang
    • Information Systems Review
    • /
    • v.25 no.3
    • /
    • pp.27-41
    • /
    • 2023
  • Unlike the previous works focusing on the state-of-the-art methodologies to improve the performance of machine learning models, this study improves the 'quality' of training data used in machine learning. We propose a method to enhance the quality of training data through the processing of 'local grammar,' frequently used in corpus analysis. We collected a vast amount of unstructured corporate review text data posted by employees working in the top 100 companies in Korea. After improving the data quality using the local grammar process, we confirmed that the classification model with local grammar outperformed the model without it in terms of classification performance. We defined five factors of work engagement as classification categories, and analyzed how the pattern of reviews changed before and after the COVID-19 pandemic. Through this study, we provide evidence that shows the value of the local grammar-based automatic identification and classification of employee experiences, and offer some clues for significant organizational cultural phenomena.

Sensitivity Identification Method for New Words of Social Media based on Naive Bayes Classification (나이브 베이즈 기반 소셜 미디어 상의 신조어 감성 판별 기법)

  • Kim, Jeong In;Park, Sang Jin;Kim, Hyoung Ju;Choi, Jun Ho;Kim, Han Il;Kim, Pan Koo
    • Smart Media Journal
    • /
    • v.9 no.1
    • /
    • pp.51-59
    • /
    • 2020
  • From PC communication to the development of the internet, a new term has been coined on the social media, and the social media culture has been formed due to the spread of smart phones, and the newly coined word is becoming a culture. With the advent of social networking sites and smart phones serving as a bridge, the number of data has increased in real time. The use of new words can have many advantages, including the use of short sentences to solve the problems of various letter-limited messengers and reduce data. However, new words do not have a dictionary meaning and there are limitations and degradation of algorithms such as data mining. Therefore, in this paper, the opinion of the document is confirmed by collecting data through web crawling and extracting new words contained within the text data and establishing an emotional classification. The progress of the experiment is divided into three categories. First, a word collected by collecting a new word on the social media is subjected to learned of affirmative and negative. Next, to derive and verify emotional values using standard documents, TF-IDF is used to score noun sensibilities to enter the emotional values of the data. As with the new words, the classified emotional values are applied to verify that the emotions are classified in standard language documents. Finally, a combination of the newly coined words and standard emotional values is used to perform a comparative analysis of the technology of the instrument.

Social Perceptions and Attitudes toward the Elderly Shared Online: Focusing on Social Big Data Analysis (온라인상에서 공유되는 노인에 대한 사회적 인식과 태도: 소셜 빅데이터 분석을 중심으로)

  • An, Soontae;Lee, Hannah;Chung, Soondool
    • 한국노년학
    • /
    • v.41 no.4
    • /
    • pp.505-525
    • /
    • 2021
  • Purpose. The purpose of this study is to examine how the phrase "old person" are expressed and used in the online sphere. Based on the theoretical concept of stigma, this study investigates the images and attitudes in society toward the elderly, and the characteristics of hate speech aimed at the elderly. Method. This study conducted text mining based on social big data using anonymous conversations. Results. It was confirmed that the elderly images shared online were generally negative. The attitudes expressed toward them also tended to be negative due to the negative images that are propagated of the elderly. The hate speech relating to the elderly, in usages such as 'Teul-ttag' and 'Kon-dae', were mainly identified in comments that negatively evaluate the elderly, and these expressions demonstrate the depth of hate and discrimination towards the elderly who are considered burdensome by young people. Interestingly, the hateful expressions towards the elderly were found more with regard to issues related to politics and economics and not just any content about the elderly. Conclusions. This study discussed the ways and means to enhance inter-generational understanding and solidity.

Web Site Keyword Selection Method by Considering Semantic Similarity Based on Word2Vec (Word2Vec 기반의 의미적 유사도를 고려한 웹사이트 키워드 선택 기법)

  • Lee, Donghun;Kim, Kwanho
    • The Journal of Society for e-Business Studies
    • /
    • v.23 no.2
    • /
    • pp.83-96
    • /
    • 2018
  • Extracting keywords representing documents is very important because it can be used for automated services such as document search, classification, recommendation system as well as quickly transmitting document information. However, when extracting keywords based on the frequency of words appearing in a web site documents and graph algorithms based on the co-occurrence of words, the problem of containing various words that are not related to the topic potentially in the web page structure, There is a difficulty in extracting the semantic keyword due to the limit of the performance of the Korean tokenizer. In this paper, we propose a method to select candidate keywords based on semantic similarity, and solve the problem that semantic keyword can not be extracted and the accuracy of Korean tokenizer analysis is poor. Finally, we use the technique of extracting final semantic keywords through filtering process to remove inconsistent keywords. Experimental results through real web pages of small business show that the performance of the proposed method is improved by 34.52% over the statistical similarity based keyword selection technique. Therefore, it is confirmed that the performance of extracting keywords from documents is improved by considering semantic similarity between words and removing inconsistent keywords.