• Title/Summary/Keyword: citation prediction

Search Result 9, Processing Time 0.026 seconds

A Study on Developing a Prediction Model of Patent Citation Counts (특허인용 예측모형 구축에 관한 연구)

  • Yoo, Jae-Bok;Chung, Young-Mee
    • Journal of the Korean Society for information Management
    • /
    • v.27 no.4
    • /
    • pp.239-258
    • /
    • 2010
  • The purpose of this study is to develop a prediction model of patent citation counts based on major factors which affect patent citation. To this end, we performed multiple regression analysis between the patent citation counts and five explanatory variables such as the number of pages, the number of claims, the reference-average-citation rate, the strength of bibliographic coupling, and the document similarity proved as having 5% or more standardized variances($r^2$) with patent citation counts, with a test dataset of U.S. patents in five subject fields. As a result, our prediction models showed 58.3% to 89.6% predictability depending on subject fields and revealed the document similarity has the highest impact on citation counts among the five predictive variables in all the subject fields. The result of comparison between the predicted citation counts and the actual ones confirmed the usefulness of the citation prediction models built for each subject field.

Documents recommendation using large citation data (거대 인용 자료를 이용한 문서 추천 방법)

  • Chae, Minwoo;Kang, Minsoo;Kim, Yongdai
    • Journal of the Korean Data and Information Science Society
    • /
    • v.24 no.5
    • /
    • pp.999-1011
    • /
    • 2013
  • In this research, we propose a document recommendation method which can find documents that are relatively important to a specific document based on citation information. The key idea is parameter tuning in the Neumann kernal which is an intermediate between a measure of importance (HITS) and of relatedness (co-citation). Our method properly selects the tuning parameter ${\gamma}$ in the Neumann kernal minimizing the prediction error in future citation. We also discuss some comutational issues needed for analysing large citation data. Finally, results of analyzing patents data from the US Patent Office are given.

The Determining Effects of the Backward Citations on the Attributes of Patent Quality : Using the Korean Patent Citations (특허의 질적 특성에 특허인용이 미치는 효과 분석 : 한국 특허의 전후방 특허인용관계를 중심으로)

  • Choo, Kineung
    • Journal of Korea Technology Innovation Society
    • /
    • v.21 no.3
    • /
    • pp.1127-1154
    • /
    • 2018
  • This paper aims to contribute to estimating the value of a patent by explaining the unobservable attributes of patent quality using observable patent citation indices. The paper first constructs patent citation data and identifies firm, university, and research institute among assignees, and then tries to explain attributes of patent quality using backward citation indices. Backward citation indices carrying information about technological sources which a given patent is based on turn out to be good predictors of forward citation indices carrying information about attributes of patent quality. Finding the functional relationships between attributes of patent quality and backward citations will lead to the improved estimation and prediction of patent value. It is found out that backward citation indices are strongly correlated the technological diversity of a patent. The paper also suggests that with whom an organization chooses to collaborate affects the attributes of patent quality.

Review of Trends in Recent Climate Research by Korean Climatologists (최근 한국의 기후학 연구 동향)

  • Lee, Eun-Gul;Lee, Kyoung-Mi;Lee, Seung-Ho
    • Journal of the Korean Geographical Society
    • /
    • v.47 no.4
    • /
    • pp.490-513
    • /
    • 2012
  • This study reviewed recent trends in climate research by Korean climatologists. We analyzed six domestic journals listed in the Korean Citation Index and four international journals listed in the Science Citation Index during 2001-2011. Research on climate change has rapidly increased during the study period and studies on precipitation variability have been given continual attentions among Korean climatologists. In climate change research, meteorologists focused on characteristics, prediction, and causes while geographers were more interested in characteristics and impacts of climate change. In applied climatology and bioclimatology, research on the impacts of climate change on agriculture, livestock, vegetation, and human health has increased under recent climate change. While there has been steady interest in climatography by Korean climatologists, the number of papers has generally decreased over the recent period.

  • PDF

An Analysis on the Pattern of Citation in Articles for the Prediction of Article Impact (논문 영향력 예측을 위한 저자별 논문의 피인용 패턴 분석)

  • Lee, Hyejin;Lee, Choon-shil
    • Proceedings of the Korean Society for Information Management Conference
    • /
    • 2013.08a
    • /
    • pp.15-18
    • /
    • 2013
  • 본 연구는 최근에 발표된 단위 논문별 영향력을 예측하기 위해 해당 저자가 이전에 발표했던 논문들의 년도별 피인용 추이를 분석하였다. 최근 IF가 가장 높은 "Cell & Tissue Engineering" 분야와 그 분야에서 IF가 가장 높은 저널 "Stem Cell"을 선정하여 최근에 실린 논문의 저자정보를 파악하고, 해당 저자의 이전에 발표한 논문들을 추출하였다. 분석 결과, 저자가 발표한 논문들이 최근에 인용이 많이 일어나면 이후 발표한 논문의 인용빈도가 높아지는 것으로 나타났으며, 특히 동일한 h-index를 가진 저자들 간의 비교에서 더욱 두드러지게 나타났다.

  • PDF

Review of Research Trends on Landslide Hazards (산사태 재해 관련 학술동향 분석)

  • Kim, J.H.;Kim, W.Y.
    • The Journal of Engineering Geology
    • /
    • v.23 no.3
    • /
    • pp.305-314
    • /
    • 2013
  • Recent international and national research trends in landslide hazards were analyzed by performing a literature search of relevant scientific journals. For obtaining data from Korea, we used 'Information for Environmental Geology' (IEG), which covers 17 journals in the field of environmental geology. A total of 54 articles related to landslide hazards were found in 5 journals published in the period 2000-2012. The most common topic was landslide prediction or susceptibility (29 articles), followed by landslide mechanisms. For international information, we analyzed 1,851 articles from the 'Web Of Science' published from 2003 to the present. Researchers in Italy have published the greatest number of papers in this field, while papers from Korea rank first in terms of the citation index.

Topic Modeling of Profit Adjustment Research Trend in Korean Accounting (텍스트 마이닝을 이용한 이익조정 연구동향 토픽모델링)

  • Kim, JiYeon;Na, HongSeok;Park, Kyung Hwan
    • Journal of Digital Convergence
    • /
    • v.19 no.1
    • /
    • pp.125-139
    • /
    • 2021
  • This study identifies the trend of Korean accounting researches on profit adjustment. We analyzed the abstract of accounting research articles published in Korean Citation Index (KCI) by using text mining technique. Among papers whose themes were profit adjustment, topics were divided into 4 parts: (i) Auditing and audit reports, (ii) corporate taxes and debt ratios, (iii) general management strategy of companies, and (iv) financial statements and accounting principles. Unlike the prediction that financial statements and accounting principles would be the main topic, auditing was analyzed as the most studied area. We analyzed topic trends based on the number of papers by topic, and could figure out the impact of K-IFRS introduction on profit adjustment research. By using Big Data method, this study enabled the division of research themes that have not been available in the past studies. This study enables the policy makers and business managers to learn about additional considerations in addition to accounting principles related to profit adjustment.

Machine- and Deep Learning Modelling Trends for Predicting Harmful Cyanobacterial Cells and Associated Metabolites Concentration in Inland Freshwaters: Comparison of Algorithms, Input Variables, and Learning Data Number (담수 유해남조 세포수·대사물질 농도 예측을 위한 머신러닝과 딥러닝 모델링 연구동향: 알고리즘, 입력변수 및 학습 데이터 수 비교)

  • Yongeun Park;Jin Hwi Kim;Hankyu Lee;Seohyun Byeon;Soon-Jin Hwang;Jae-Ki Shin
    • Korean Journal of Ecology and Environment
    • /
    • v.56 no.3
    • /
    • pp.268-279
    • /
    • 2023
  • Nowadays, artificial intelligence model approaches such as machine and deep learning have been widely used to predict variations of water quality in various freshwater bodies. In particular, many researchers have tried to predict the occurrence of cyanobacterial blooms in inland water, which pose a threat to human health and aquatic ecosystems. Therefore, the objective of this study were to: 1) review studies on the application of machine learning models for predicting the occurrence of cyanobacterial blooms and its metabolites and 2) prospect for future study on the prediction of cyanobacteria by machine learning models including deep learning. In this study, a systematic literature search and review were conducted using SCOPUS, which is Elsevier's abstract and citation database. The key results showed that deep learning models were usually used to predict cyanobacterial cells, while machine learning models focused on predicting cyanobacterial metabolites such as concentrations of microcystin, geosmin, and 2-methylisoborneol (2-MIB) in reservoirs. There was a distinct difference in the use of input variables to predict cyanobacterial cells and metabolites. The application of deep learning models through the construction of big data may be encouraged to build accurate models to predict cyanobacterial metabolites.

A Study on Recent Research Trend in Management of Technology Using Keywords Network Analysis (키워드 네트워크 분석을 통해 살펴본 기술경영의 최근 연구동향)

  • Kho, Jaechang;Cho, Kuentae;Cho, Yoonho
    • Journal of Intelligence and Information Systems
    • /
    • v.19 no.2
    • /
    • pp.101-123
    • /
    • 2013
  • Recently due to the advancements of science and information technology, the socio-economic business areas are changing from the industrial economy to a knowledge economy. Furthermore, companies need to do creation of new value through continuous innovation, development of core competencies and technologies, and technological convergence. Therefore, the identification of major trends in technology research and the interdisciplinary knowledge-based prediction of integrated technologies and promising techniques are required for firms to gain and sustain competitive advantage and future growth engines. The aim of this paper is to understand the recent research trend in management of technology (MOT) and to foresee promising technologies with deep knowledge for both technology and business. Furthermore, this study intends to give a clear way to find new technical value for constant innovation and to capture core technology and technology convergence. Bibliometrics is a metrical analysis to understand literature's characteristics. Traditional bibliometrics has its limitation not to understand relationship between trend in technology management and technology itself, since it focuses on quantitative indices such as quotation frequency. To overcome this issue, the network focused bibliometrics has been used instead of traditional one. The network focused bibliometrics mainly uses "Co-citation" and "Co-word" analysis. In this study, a keywords network analysis, one of social network analysis, is performed to analyze recent research trend in MOT. For the analysis, we collected keywords from research papers published in international journals related MOT between 2002 and 2011, constructed a keyword network, and then conducted the keywords network analysis. Over the past 40 years, the studies in social network have attempted to understand the social interactions through the network structure represented by connection patterns. In other words, social network analysis has been used to explain the structures and behaviors of various social formations such as teams, organizations, and industries. In general, the social network analysis uses data as a form of matrix. In our context, the matrix depicts the relations between rows as papers and columns as keywords, where the relations are represented as binary. Even though there are no direct relations between papers who have been published, the relations between papers can be derived artificially as in the paper-keyword matrix, in which each cell has 1 for including or 0 for not including. For example, a keywords network can be configured in a way to connect the papers which have included one or more same keywords. After constructing a keywords network, we analyzed frequency of keywords, structural characteristics of keywords network, preferential attachment and growth of new keywords, component, and centrality. The results of this study are as follows. First, a paper has 4.574 keywords on the average. 90% of keywords were used three or less times for past 10 years and about 75% of keywords appeared only one time. Second, the keyword network in MOT is a small world network and a scale free network in which a small number of keywords have a tendency to become a monopoly. Third, the gap between the rich (with more edges) and the poor (with fewer edges) in the network is getting bigger as time goes on. Fourth, most of newly entering keywords become poor nodes within about 2~3 years. Finally, keywords with high degree centrality, betweenness centrality, and closeness centrality are "Innovation," "R&D," "Patent," "Forecast," "Technology transfer," "Technology," and "SME". The results of analysis will help researchers identify major trends in MOT research and then seek a new research topic. We hope that the result of the analysis will help researchers of MOT identify major trends in technology research, and utilize as useful reference information when they seek consilience with other fields of study and select a new research topic.