• Title/Summary/Keyword: 검색어 추출

Search Result 328, Processing Time 0.022 seconds

NFT(Non-Fungible Token) Patent Trend Analysis using Topic Modeling

  • Sin-Nyum Choi;Woong Kim
    • Journal of the Korea Society of Computer and Information
    • /
    • v.28 no.12
    • /
    • pp.41-48
    • /
    • 2023
  • In this paper, we propose an analysis of recent trends in the NFT (Non-Fungible Token) industry using topic modeling techniques, focusing on their universal application across various industrial fields. For this study, patent data was utilized to understand industry trends. We collected data on 371 domestic and 454 international NFT-related patents registered in the patent information search service KIPRIS from 2017, when the first NFT standard was introduced, to October 2023. In the preprocessing stage, stopwords and lemmas were removed, and only noun words were extracted. For the analysis, the top 50 words by frequency were listed, and their corresponding TF-IDF values were examined to derive key keywords of the industry trends. Next, Using the LDA algorithm, we identified four major latent topics within the patent data, both domestically and internationally. We analyzed these topics and presented our findings on NFT industry trends, underpinned by real-world industry cases. While previous review presented trends from an academic perspective using paper data, this study is significant as it provides practical trend information based on data rooted in field practice. It is expected to be a useful reference for professionals in the NFT industry for understanding market conditions and generating new items.

The Tresnds of Artiodactyla Researches in Korea, China and Japan using Text-mining and Co-occurrence Analysis of Words (텍스트마이닝과 동시출현단어분석을 이용한 한국, 중국, 일본의 우제목 연구 동향 분석)

  • Lee, Byeong-Ju;Kim, Baek-Jun;Lee, Jae Min;Eo, Soo Hyung
    • Korean Journal of Environment and Ecology
    • /
    • v.33 no.1
    • /
    • pp.9-15
    • /
    • 2019
  • Artiodactyla, which is an even-toed mammal, widely inhabits worldwide. In recent years, wild Artiodactyla species have attracted public attention due to the rapid increase of crop damage and road-kill caused by wild Artiodactyla such as water deer and wild boar and the decrease of some species such as long-tailed goral and musk deer. In spite of such public attention, however, there have been few studies on Artiodactyla in Korea, and no studies have focused on the trend analysis of Artiodactyla, making it difficult to understand actual problems. Many recent studies on trend used text-mining and co-occurrence analysis to increase objectivity in the classification of research subjects by extracting keywords appearing in literature and quantifying relevance between words. In this study, we analyzed texts from research articles of three countries (Korea, China, and Japan) through text-mining and co-occurrence analysis and compared the research subjects in each country. We extracted 199 words from 665 articles related to Artiodactyla of three countries through text-mining. Three word-clusters were formed as a result of co-occurrence analysis on extracted words. We determined that cluster1 was related to "habitat condition and ecology", cluster2 was related to "disease" and cluster3 was related to "conservation genetics and molecular ecology". The results of comparing the rates of occurrence of each word clusters in each country showed that they were relatively even in China and Japan whereas Korea had a prevailing rate (69%) of cluster2 related to "disease". In the regression analysis on the number of words per year in each cluster, the number of words in both China and Japan increased evenly by year in each cluster while the rate of increase of cluster2 was five times more than the other clusters in Korea. The results indicate that Korean researches on Artiodactyla tended to focus on diseases more than those in China and Japan, and few researchers considered other subjects including habitat characteristics, behavior and molecular ecology. In order to control the damage caused by Artiodactyla and to establish a reasonable policy for the protection of endangered species, it is necessary to accumulate basic ecological data by conducting researches on wild Artiodactyla more.

A study on the classification of research topics based on COVID-19 academic research using Topic modeling (토픽모델링을 활용한 COVID-19 학술 연구 기반 연구 주제 분류에 관한 연구)

  • Yoo, So-yeon;Lim, Gyoo-gun
    • Journal of Intelligence and Information Systems
    • /
    • v.28 no.1
    • /
    • pp.155-174
    • /
    • 2022
  • From January 2020 to October 2021, more than 500,000 academic studies related to COVID-19 (Coronavirus-2, a fatal respiratory syndrome) have been published. The rapid increase in the number of papers related to COVID-19 is putting time and technical constraints on healthcare professionals and policy makers to quickly find important research. Therefore, in this study, we propose a method of extracting useful information from text data of extensive literature using LDA and Word2vec algorithm. Papers related to keywords to be searched were extracted from papers related to COVID-19, and detailed topics were identified. The data used the CORD-19 data set on Kaggle, a free academic resource prepared by major research groups and the White House to respond to the COVID-19 pandemic, updated weekly. The research methods are divided into two main categories. First, 41,062 articles were collected through data filtering and pre-processing of the abstracts of 47,110 academic papers including full text. For this purpose, the number of publications related to COVID-19 by year was analyzed through exploratory data analysis using a Python program, and the top 10 journals under active research were identified. LDA and Word2vec algorithm were used to derive research topics related to COVID-19, and after analyzing related words, similarity was measured. Second, papers containing 'vaccine' and 'treatment' were extracted from among the topics derived from all papers, and a total of 4,555 papers related to 'vaccine' and 5,971 papers related to 'treatment' were extracted. did For each collected paper, detailed topics were analyzed using LDA and Word2vec algorithms, and a clustering method through PCA dimension reduction was applied to visualize groups of papers with similar themes using the t-SNE algorithm. A noteworthy point from the results of this study is that the topics that were not derived from the topics derived for all papers being researched in relation to COVID-19 (

    ) were the topic modeling results for each research topic (
    ) was found to be derived from For example, as a result of topic modeling for papers related to 'vaccine', a new topic titled Topic 05 'neutralizing antibodies' was extracted. A neutralizing antibody is an antibody that protects cells from infection when a virus enters the body, and is said to play an important role in the production of therapeutic agents and vaccine development. In addition, as a result of extracting topics from papers related to 'treatment', a new topic called Topic 05 'cytokine' was discovered. A cytokine storm is when the immune cells of our body do not defend against attacks, but attack normal cells. Hidden topics that could not be found for the entire thesis were classified according to keywords, and topic modeling was performed to find detailed topics. In this study, we proposed a method of extracting topics from a large amount of literature using the LDA algorithm and extracting similar words using the Skip-gram method that predicts the similar words as the central word among the Word2vec models. The combination of the LDA model and the Word2vec model tried to show better performance by identifying the relationship between the document and the LDA subject and the relationship between the Word2vec document. In addition, as a clustering method through PCA dimension reduction, a method for intuitively classifying documents by using the t-SNE technique to classify documents with similar themes and forming groups into a structured organization of documents was presented. In a situation where the efforts of many researchers to overcome COVID-19 cannot keep up with the rapid publication of academic papers related to COVID-19, it will reduce the precious time and effort of healthcare professionals and policy makers, and rapidly gain new insights. We hope to help you get It is also expected to be used as basic data for researchers to explore new research directions.

  • Improving Bidirectional LSTM-CRF model Of Sequence Tagging by using Ontology knowledge based feature (온톨로지 지식 기반 특성치를 활용한 Bidirectional LSTM-CRF 모델의 시퀀스 태깅 성능 향상에 관한 연구)

    • Jin, Seunghee;Jang, Heewon;Kim, Wooju
      • Journal of Intelligence and Information Systems
      • /
      • v.24 no.1
      • /
      • pp.253-266
      • /
      • 2018
    • This paper proposes a methodology applying sequence tagging methodology to improve the performance of NER(Named Entity Recognition) used in QA system. In order to retrieve the correct answers stored in the database, it is necessary to switch the user's query into a language of the database such as SQL(Structured Query Language). Then, the computer can recognize the language of the user. This is the process of identifying the class or data name contained in the database. The method of retrieving the words contained in the query in the existing database and recognizing the object does not identify the homophone and the word phrases because it does not consider the context of the user's query. If there are multiple search results, all of them are returned as a result, so there can be many interpretations on the query and the time complexity for the calculation becomes large. To overcome these, this study aims to solve this problem by reflecting the contextual meaning of the query using Bidirectional LSTM-CRF. Also we tried to solve the disadvantages of the neural network model which can't identify the untrained words by using ontology knowledge based feature. Experiments were conducted on the ontology knowledge base of music domain and the performance was evaluated. In order to accurately evaluate the performance of the L-Bidirectional LSTM-CRF proposed in this study, we experimented with converting the words included in the learned query into untrained words in order to test whether the words were included in the database but correctly identified the untrained words. As a result, it was possible to recognize objects considering the context and can recognize the untrained words without re-training the L-Bidirectional LSTM-CRF mode, and it is confirmed that the performance of the object recognition as a whole is improved.

    Popularization of Marathon through Social Network Big Data Analysis : Focusing on JTBC Marathon (소셜 네트워크 빅데이터 분석을 통한 마라톤 대중화 : JTBC 마라톤대회를 중심으로)

    • Lee, Ji-Su;Kim, Chi-Young
      • Journal of Korea Entertainment Industry Association
      • /
      • v.14 no.3
      • /
      • pp.27-40
      • /
      • 2020
    • The marathon has long been established as a representative lifestyle for all ages. With the recent expansion of the Work and Life Balance trend across the society, marathon with a relatively low barrier to entry is gaining popularity among young people in their 20s and 30s. By analyzing the issues and related words of the marathon event, we will analyze the spottainment elements of the marathon event that is popular among young people through keywords, and suggest a development plan for the differentiated event. In order to analyze keywords and related words, blogs, cafes and news provided by Naver and Daum were selected as analysis channels, and 'JTBC Marathon' and 'Culture' were extracted as key words for data search. The data analysis period was limited to a three-month period from August 13, 2019 to November 13, 2019, when the application for participation in the 2019 JTBC Marathon was started. For data collection and analysis, frequency and matrix data were extracted through social matrix program Textom. In addition, the degree of the relationship was quantified by analyzing the connection structure and the centrality of the degree of connection between the words. Although the marathon is a personal movement, young people share a common denominator of "running" and form a new cultural group called "running crew" with other young people. Through this, it was found that a marathon competition culture was formed as a festival venue where people could train together, participate together, and escape from the image of a marathon run alone and fight with themselves.

    Analysis of Research Trends Related to Children's Department of Church School : Focusing on Domestic Dissertations (교회학교 유치부 관련 연구 동향 분석 : 국내 학위 논문 중심으로)

    • Kim, Minjung
      • Journal of Christian Education in Korea
      • /
      • v.71
      • /
      • pp.181-210
      • /
      • 2022
    • The purpose of this study was to investigate the research trends related to the children's department of church schools. The purpose of this study is to present basic data for the study of the children's department of church schools by analyzing the research period, research contents, research methods, and subjects of research related to the children's department of church schools. For this study, 50 domestic master's and doctoral dissertations searched through the National Assembly Library and the Research Information Sharing Service(RISS) were extracted with the keywords of 'church school' and 'children's department'. The frequency and percentage were calculated by analyzing the research related to the children's department of the church school according to four criteria: research period, research content, research method, and research subject. As a result of the study, first, the research trend of research papers in the children's department of church schools was found to be 49 articles (98%) for master's degrees and 1 article (2%) for doctoral degrees from 1980 to 2022. Trends by research period are focused on master's degrees. Second, the trend by research content was 27 practical studies (54%) and 23 theory studies (46%). In the research related to the children's department of church schools, the practical research accounted for a relatively high percentage compared to the theory research. Third, the trends by research method were in the order of 30 literature studies (60%), 19 quantitative studies (38%), and 1 qualitative study (2%). Research related to children's departments in church schools is being actively conducted with a focus on literature research. Fourth, as for the trends by study subject, the study was conducted focusing on physical subjects, with 35 subjects (70%) and 15 subjects (30%) of personal subjects. As research is conducted from physical objects to church schools and media, it is necessary to study the connection between church schools and families. As the research on church school kindergarten is focused on adults (teachers, parents, and educational preachers), in-depth research on children in church schools and qualitative research with voices from the field of children's department in church schools are required.

    Analysis of Research Trends Related to Christian Picture Books : Focusing on Domestic Dissertations (기독교 그림책 관련 연구 동향 분석 : 국내 학위 논문 중심으로)

    • Kim, Minjung
      • Journal of Christian Education in Korea
      • /
      • v.68
      • /
      • pp.245-277
      • /
      • 2021
    • The purpose of this study was to investigate the trend of Christian picture book-related research. The purpose of this study is to present basic data for various and balanced research and development in the Christian picture book field by analyzing the research period, research content, and research method related to Christian picture books. For this study, 45 domestic master's and doctoral dissertations were extracted through the National Assembly Library and the Academic Research Information Service (RISS) with the keywords of 'Christian picture book', 'Bible picture book', 'Christian story', and 'Bible story'. The frequency and percentage were calculated by analyzing Christian picture book-related studies according to four criteria: research period, research content, research method, and research subject. As a result of the study, first, the trend of Christian picture book research papers by research period from 1999 to 2021 was 43 master's articles (95.6%) and 2 doctoral articles (4.4%), focusing on Christian picture book-related studies. Second, the trend by research content was found to be 12 basic studies (26.6%) and 33 practical studies (73.4%). Research related to Christian picture books is being actively conducted focusing on practical research rather than basic research. Third, the trend by research method was in the order of 33 quantitative studies (73.4%), 11 literature studies (24.4%), and 1 qualitative study (2.2%). Research related to Christian picture books is centered on quantitative research, and literature research and qualitative research account for a relatively low proportion. Fourth, as for the trends by study subject, there were 35 human subjects (77.8%) and 10 physical subjects (22.2%). Among human subjects, 33 single subjects (73.4%) and 2 mixed subjects (4.4%) were found, and among single subjects, 30 studies (66.7%) targeting children were high. In other words, research on Christian picture books had a higher proportion of studies with children as a single subject than mixed subjects between children and children, children and teachers, and between children and parents.

    Analysis of Research Trends Related to Forest Play: Focusing on Domestic Dissertations (숲놀이 관련 연구 동향 분석: 국내 학위 논문 중심으로)

    • Kim, Minjung
      • Journal of Christian Education in Korea
      • /
      • v.69
      • /
      • pp.77-104
      • /
      • 2022
    • The purpose of this study was to investigate the research trend of forest play. The purpose of this study is to provide basic data for the vitalization of forest play research by analyzing the research period, research content, and research methods. For this study, 57 domestic master's and doctoral dissertations were extracted through the National Assembly Library and the Research Information Sharing Service(RISS) with the keywords of 'forest', 'play', and 'forest play'. The frequency and percentage were calculated by analyzing forest play research based on four criteria: research period, research content, research method, and research subject. As a result of the research, first, the trend of forest play research by period is from 2011 to 2021, with 49 articles (85.9%) for master's degrees and 8 articles (14.1%) for doctor's degrees. Second, the trend by research content was found to be 16 basic studies (28.1%) and 41 practical studies (71.9%). Forest play research is being actively conducted centered on practical research. Third, the trends by research method were in the order of 39 quantitative studies (68.4%), 17 qualitative studies (29.8%), and 1 literature study (1.8%). Forest play research is focused on quantitative research, and comparatively qualitative research and literature research account for a low proportion. Fourth, the trend by study subject was 56 single subject studies (98.2%). The single subjects were 52 children (91.2%), 3 teachers (5.2%), and 1 parent (1.8%). As for the mixed subjects, there is one study (1.8%) targeting children and parents, and it is necessary to conduct a study with mixed subjects. As for the study of material subjects, 42 articles (73.7%) in the natural environment, 13 articles (22.8%) in educational institutions, and 2 articles (3.5%) in the media were found in the order. Research on the home environment related to forest play is insufficient, so research on parents, children-parents, and home environment related to forest play should be conducted in the future.


    (34141) Korea Institute of Science and Technology Information, 245, Daehak-ro, Yuseong-gu, Daejeon
    Copyright (C) KISTI. All Rights Reserved.