Browse > Article
http://dx.doi.org/10.23097/JPAF.2021.23(2):71

Korea National College of Agriculture and Fisheries in Naver News by Web Crolling : Based on Keyword Analysis and Semantic Network Analysis  

Joo, J.S. (Korea National College of Agriculture and Fisheries)
Lee, S.Y. (Korea National College of Agriculture and Fisheries)
Kim, S.H. (Korea National College of Agriculture and Fisheries)
Park, N.B. (Korea National College of Agriculture and Fisheries)
Publication Information
Journal of Practical Agriculture & Fisheries Research / v.23, no.2, 2021 , pp. 71-86 More about this Journal
Abstract
This study was conducted to find information on the university's image from words related to 'Korea National College of Agriculture and Fisheries (KNCAF)' in Naver News. For this purpose, word frequency analysis, TF-IDF evaluation and semantic network analysis were performed using web crawling technology. In word frequency analysis, 'agriculture', 'education', 'support', 'farmer', 'youth', 'university', 'business', 'rural', 'CEO' were important words. In the TF-IDF evaluation, the key words were 'farmer', 'dron', 'agricultural and livestock food department', 'Jeonbuk', 'young farmer', 'agriculture', 'Chonju', 'university', 'device', 'spreading'. In the semantic network analysis, the Bigrams showed high correlations in the order of 'youth' - 'farmer', 'digital' - 'agriculture', 'farming' - 'settlement', 'agriculture' - 'rural', 'digital' - 'turnover'. As a result of evaluating the importance of keywords as five central index, 'agriculture' ranked first. And the keywords in the second place of the centrality index were 'farmers' (Cc, Cb), 'education' (Cd, Cp) and 'future' (Ce). The sperman's rank correlation coefficient by centrality index showed the most similar rank between Degree centrality and Pagerank centrality. The KNCAF articles of Naver News were used as important words such as 'agriculture', 'education', 'support', 'farmer', 'youth' in terms of word frequency. However, in the evaluation including document frequency, the words such as 'farmer', 'dron', 'Ministry of Agriculture, Food and Rural Affairs', 'Jeonbuk', and 'young farmers' were found to be key words. The centrality analysis considering the network connectivity between words was suitable for evaluation by Cd and Cp. And the words with strong centrality were 'agriculture', 'education', 'future', 'farmer', 'digital', 'support', 'utilization'.
Keywords
Web crawling; Semantic network analysis; Pagerank centrality(Cp); Degree centrality(Cd); Eigenvector centrality(Ce); Sperman's rank correlation coefficient;
Citations & Related Records
연도 인용수 순위
  • Reference
1 조민호. (2019). 데이터 분석 전문가를 위한 R 데이터 분석. 정보문화사
2 박경진, 정덕호, 하민수, 이준기. (2014). 언어 네트워크분석에 기초한 과학학습의 목적에 대한 고등학교 교사와 학생들의 인식. Journal of the Korean Association for Science Education, 34(6), 571~581   DOI
3 주진수 외 5인. (2020). 한국농수산대학 신입생 자기소개서의 텍스트 마이닝과 연관규칙 분석 (2). 현장농수산연구지Vol. 22(2), No.2: 99-114.
4 주진수 외 5인. (2021). 언어네트워크분석을 활용한 한국농수산대학 신입생 자기소개서 분석. 현장농수산연구지 Vol. 23(1), No.1: 89-104.
5 https://briatte.github.io/ggnet/. ggnet2: network visualization with ggplot2
6 https://da-it-so.tistory.com/43. TF-IDF 기법 이해하기
7 https://iamdaisy.tistory.com/31?category=620658. 소셜네트워크 분석의 이해
8 https://data-traveler.tistory.com/33. R을 이용한 텍스트마이닝_TF-IDF(코드 및 설명)
9 주진수 외 5인. (2020). 한국농수산대학 신입생 자기소개서의 텍스트 마이닝과 연관규칙분석 (1). 현장농수산연구지 Vol. 22(1), No.1: 113-130.
10 https://bookdown.org/yuaye_kt/RTIPS/Texnetword.html. Chapter 11 텍스트 데이터-단어 네트워크맵(1)
11 김영우. (2017). 쉽게 배우는 R 데이터 분석, 이지스퍼블리싱.