A Study on the Analysis of Agricultural R&D Keywords Using Textmining Method (텍스트마이닝을 활용한 농업 R&D 키워드 분석)

  • Kim, Ji-Hoon;Kim, Seong-Sup
    • Journal of the Korea Academia-Industrial cooperation Society
    • v.22 no.2
    • pp.721-732
    • 2021
  • This study analyzed keywords for agricultural R&D using the textmining method to examine the trend of agricultural R&D. Data used for the analysis included R&D project information provided by NTIS, and the research and development step by year from 2003 to 2018 were classified and applied. The TF-IDF approach was used as the analysis method, and ranking was derived based on score. Furthermore, we analyzed by grouping for similar keywords. The main analysis results are as follows. First, agricultural R&D trends are changing according to the introduction of new technologies and changes in the external environment. Second, keyword changes appeared with a time lag in the R&D step. The main keywords are changing in the order of basic research - applied research - development research. Third, the main keyword of agricultural R&D was 'rice.' However, the direction and purpose of the research were changing according to changes in the domestic and foreign agricultural environments.

A Study on the Reclassification of Author Keywords for Automatic Assignment of Descriptors (디스크립터 자동 할당을 위한 저자키워드의 재분류에 관한 실험적 연구)

  • Kim, Pan-Jun;Lee, Jae-Yun
    • Journal of the Korean Society for information Management
    • v.29 no.2
    • pp.225-246
    • 2012
  • This study purported to investigate the possibility of automatic descriptor assignment using the reclassification of author keywords in domestic scholarly databases. In the first stage, we selected optimal classifiers and parameters for the reclassification by comparing the characteristics of machine learning classifiers. In the next stage, learning the author keywords that were assigned to the selected articles on readings, the author keywords were automatically added to another set of relevant articles. We examined whether the author keyword reclassifications had the effect of vocabulary control just as descriptors collocate the documents on the same topic. The results showed the author keyword reclassification had the capability of the automatic descriptor assignment.

Design and Implementation of Potential Advertisement Keyword Extraction System Using SNS (SNS를 이용한 잠재적 광고 키워드 추출 시스템 설계 및 구현)

  • Seo, Hyun-Gon;Park, Hee-Wan
    • Journal of the Korea Convergence Society
    • v.9 no.7
    • pp.17-24
    • 2018
  • One of the major issues in big data processing is extracting keywords from internet and using them to process the necessary information. Most of the proposed keyword extraction algorithms extract keywords using search function of a large portal site. In addition, these methods extract keywords based on already posted or created documents or fixed contents. In this paper, we propose a KAES(Keyword Advertisement Extraction System) system that helps the potential shopping keyword marketing to extract issue keywords and related keywords based on dynamic instant messages such as various issues, interests, comments posted on SNS. The KAES system makes a list of specific accounts to extract keywords and related keywords that have most frequency in the SNS.

Chunking Annotation Corpus Construction for Keyword Extraction in News Domain (뉴스 기사 키워드 추출을 위한 구묶음 주석 말뭉치 구축)

  • Kim, Tae-Young;Kim, Jeong Ah;Kim, Bo Hui;Oh, Hyo Jung
    • Annual Conference on Human and Language Technology
    • /
    • 2020.10a
    • pp.595-597
    • 2020
  • 빅데이터 시대에서 대용량 문서의 의미를 자동으로 파악하기 위해서는 문서 내에서 주제 및 내용을 포괄하는 핵심 단어가 키워드 단위로 추출되어야 한다. 문서에서 키워드가 될 수 있는 단위는 복합명사를 포함한 단어가 될 수도, 그 이상의 묶음이 될 수도 있다. 한국어는 언어적 특성상 구묶음 개념이 적용되는 데, 이를 통해 주요 키워드가 될 수 있는 말덩이 추출이 가능하다. 따라서 본 연구에서는 문서에서 단어뿐만 아니라 다양한 단위의 키워드 묶음을 태깅하는 가이드라인 정의를 비롯해 태깅도구를 활용한 코퍼스 구축 방법론을 고도화하고, 그 방법론을 실제로 뉴스 도메인에 적용하여 주석 말뭉치를 구축함으로써 검증하였다. 본 연구의 결과물은 텍스트 문서의 내용을 파악하고 분석이 필요한 모든 텍스트마이닝 관련 기술의 기초 작업으로 활용 가능하다.

Keyword and Network Analysis of University Core Competency Studies (대학 핵심역량 관련 연구들의 주요 키워드와 네트워크 분석)

  • Kwon, Choong-Hoon
    • Proceedings of the Korean Society of Computer Information Conference
    • 2021.01a
    • pp.133-134
    • 2021
  • 본 연구는 최근 고등학교기관(대학)의 평가에서 가장 중심 단어가 되고 있는 있는 '핵심역량' 관련 최근 연구들의 주요 키워드들과 그들간의 네트워크를 분석하고자 한다. 본 연구에서는 2011년부터 2020년까지(최근 10년간)의 '대학 핵심역량' 관련 등재지(등재 후보지 포함)에 발표된 총 176건의 관련 연구물들을 언어 네트워크 분석 방법론을 활용하여, 주요 키워드 추출 및 워드클라우드 제시, 주요 핵심어들 간의 관계성(의미망 네트워크) 분석 등을 진행하고자 한다. 이와 같은 연구 결과는 관련 학자들이 연구를 진행할 때, 대학 관계자가 학교단위 교육활동 계획 기획 및 평가활동을 할 때 매우 중요한 기초 자료로 활용될 것으로 기대된다.

Keyword Analysis of COVID-19 in News Big Data : Focused on 4 Major Daily Newspapers

  • Kwon, Seong-Wook
    • Journal of the Korea Society of Computer and Information
    • v.25 no.12
    • pp.101-107
    • 2020
  • This paper aims to compare and analyze the major keywords according to the political orientation of progressive and conservative newspapers by utilizing the big data of the four major domestic daily newspapers related to COVID-19, which has entered a long-term war. To this end, 93,917 news reports from Jan. 20 to Sept. 15, 2020 were divided into four stages and the major keywords of the four newspapers were implemented and analyzed in WordCloud. According to the analysis, the conservative newspaper focused on the government's response, criticism, and China's responsibility by mentioning the keywords "government," "president," "state of affairs" and "mask" more than the progressive newspaper, while the progressive newspaper uses keywords that emphasize the seriousness of the disease and the occurrence of a dangerous situation. The Chosun Ilbo found that the use of various keywords during the massive outbreak of collective infections (2.18-5.15), and that the JoongAng Ilbo used keywords criticizing government policies in relation to reports of infectious diseases such as COVID-19, but also used keywords that emphasize the seriousness of diseases used by progressive newspapers and the occurrence of dangerous situations.

A Study on the Research Trend in the Dyslexia and Learning Disability Trough a Keyword Network Analysis (키워드 네트워크 분석을 통한 난독증과 학습장애 관련 연구 동향 분석)

  • Lee, Woo-Jin;Kim, Tae-Gang
    • Journal of Digital Convergence
    • v.17 no.1
    • pp.91-98
    • 2019
  • The present study was performed to investigate the general research trends of dyslexia and learning disability to explore the centrality of related variables though analysis of keyword networks. Data were collected from ten years articles research information sharing service(RISS) which is provided by korea education and research information service(KERIS). The research subjects selected for the analysis were keyword cleansing work, extraction major keyword using KrKwic program and using NodeXL program to Visualize the center of connection between keyword. The results of this were as follows. First, totally 72 of keyword were extracted from keyword cleansing process and among those keyword. major keywords included learning disability, dyslexia, RTI. Second, analysis of the betweenness centrality of dyslexia and learing disabilities shows that learning disabilities are a key word that has been addressed in the study of dyslexia and learning disabilities in korea. The results of these studies suggest a method of analyzing trends in qualitative and qualitative analysis in relation to dyslexia and learning disorder.

Trend Analysis of News Articles Regarding Sungnyemun Gate using Text Mining (텍스트마이닝을 활용한 숭례문 관련 기사의 트렌드 분석)

  • Kim, Min-Jeong;Kim, Chul Joo
    • The Journal of the Korea Contents Association
    • v.17 no.3
    • pp.474-485
    • 2017
  • Sungnyemun Gate, Korea's National Treasure No.1, was destroyed by fire on February 10, 2008 and has been re-opened to the public again as of May 4, 2013 after a reconstruction work. Sungnyemun Gate become a national issue and draw public attention to be a major topic on news or research. In this research, text mining and association rule mining techniques were used on keyword of newspaper articles related to Sungnyemun Gate as a cultural heritage from 2002 to 2016 to find major keywords and keyword association rule. Next, we analyzed some typical and specific keywords that appear frequently and partially depending on before and after the fire and newpaper companies. Through this research, the trends and keywords of newspapers articles related to Sungnyemun Gate could be understood, and this research can be used as fundamental data about Sungnyemun Gate to information producer and consumer.

Analysis of Keyword Association and Keyword Network of #MeToo Movement on Twitter (트위터에 나타난 미투운동의 키워드 연관성 및 키워드 네트워크 분석)

  • Kwak, Soo-Jeong;Kim, Hyon Hee
    • Annual Conference of KIPS
    • 2018.05a
    • pp.311-314
    • 2018
  • 최근 '미투운동'이 활발히 진행되면서 새로운 페미니즘의 물결을 맞이하였다. 이전의 페미니즘 운동과의 차이점은 SNS 를 통해 익명으로 활동하며 전파속도가 굉장히 빠르다는 것이다. 본 연구는 미투운동의 이러한 특성을 고려하여 실제 트위터 데이터에서 주요 키워드를 파악하고, 해당 키워드의 연관성 및 네트워크 분석으로 사회적 맥락을 알아본다.

A Keyword Analysis of Collection Development Policies of University and Public Libraries Using Text Mining (텍스트 마이닝을 활용한 대학도서관과 공공도서관의 장서개발 정책 키워드 분석)

  • Da-Hyeon Lee;Dong-Hee Shin
    • Journal of the Korean Society for Library and Information Science
    • v.58 no.1
    • pp.285-302
    • 2024
  • For this article, we conducted frequency analysis, topic modeling, and network analysis on eleven texts related to collection development policy found in the National Library of Korea. We deduced the main keywords related to collection development policies and analyzed the relationship between them. We subsequently conducted a pie coefficient analysis to identify the characteristics of collection development policies of university libraries and public libraries by category. The results showed that keywords such as "material," "library," "collection development," "user," and "collection" were the main keywords in frequency analysis and network centrality. Meanwhile, the pie coefficient analysis revealed that keywords such as "university," "construction," "student," "target," and "cost" were prevalent in university libraries, indicating that the academic needs of users and the discussion of digital resources were primary issues, while keywords related to the information needs of various user groups-including "adults," "survey," "feature," and "religion" -appeared in public libraries.