Browse > Article
http://dx.doi.org/10.9708/jksci.2022.27.08.143

Patent Keyword Analysis using Gamma Regression Model and Visualization  

Jun, Sunghae (Dept. of Big Data and Statistics, Cheongju University)
Abstract
Since patent documents contain detailed results of research and development technologies, many studies on various patent analysis methods for effective technology analysis have been conducted. In particular, research on quantitative patent analysis by statistics and machine learning algorithms has been actively conducted recently. The most used patent data in quantitative patent analysis is technology keywords. Most of the existing methods for analyzing the keyword data were models based on the Gaussian probability distribution with random variable on real space from negative infinity to positive infinity. In this paper, we propose a model using gamma probability distribution to analyze the frequency data of patent keywords that can theoretically have values from zero to positive infinity. In addition, in order to determine the regression equation of the gamma-based regression model, two-mode network is constructed to visualize the technological association between keywords. Practical patent data is collected and analyzed for performance evaluation between the proposed method and the existing Gaussian-based analysis models.
Keywords
Patent Keyword; Generalized Linear Model; Gaussian Distribution; Gamma Distribution; Patent Analysis;
Citations & Related Records
Times Cited By KSCI : 1  (Citation Analysis)
연도 인용수 순위
1 A. T. Roper, S. W. Cunningham, A. L. Porter, T. W. Mason, F. A. Rossini, and J. Banks, "Forecasting and Management of Technology" Hoboken, NJ, John Wiley & Sons, 2011.
2 J. Kim, N. Kim, Y. Jung, and S. Jun, "Patent data analysis using functional count data model," Soft Computing, Vol. 23, Iss. 18, pp. 8815-8826, September 2019.   DOI
3 K. S. Alkaabi, and J. Yu, "A Study on the Integration Between Smart Mobility Technology and Information Communication Technology (ICT) Using Patent Analysis," Journal of The Korea Society of Computer and Information, Vol. 24, No. 6, pp. 89-97, 2019.   DOI
4 S. Park, and S. Jun, "Technological Cognitive Diagnosis Model for Patent Keyword Analysis," ICT Express, Vol. 6, pp. 57-61, 2020. https://doi.org/10.1016/j.icte.2019.09.004   DOI
5 J. Choi, D. Jang, S. Jun, and S. Park, "A Predictive Model of Technology Transfer using Patent Analysis," Sustainability, Vol. 7, No. 12, pp. 16175-16195, 2015.   DOI
6 J. Choi, and S. Jun, "Big Data Smoothing and Outlier Removal for Patent Big Data Analysis," Journal of The Korea Society of Computer and Information, Vol. 21, No. 8, pp. 77-84, 2016.   DOI
7 D. Hunt, L. Nguyen, and M. Rodgers, "Patent Searching Tools & Techniques" Hoboken, NJ, Wiley, 2007.
8 I. Feinerer, K. Hornik, and D. Meyer, "Text mining infrastructure in R," Journal of Statistical Software, Vol. 25, No. 5, pp. 1-54, March 2008.
9 K. P. Murphy, "Machine Learning: a probabilistic perspective" Cambridge MA, MIT Press, 2012.
10 S. Jun "Bayesian Count Data Modeling for Finding Technological Sustainability," Sustainability, Vol. 10, No. 9, pp. 3220, 2018.
11 D. C. Montgomery, E. A. Peck, and G. G. Vining, "Introduction to Linear Regression Analysis" Hoboken, New Jersey, John Wiley & Sons, 2012.
12 C. T. Butts, "network: A Package for Managing Relational Data in R," Journal of Statistical Software, Vol. 24, No. 2, pp. 1-36, 2008.
13 R. V. Hogg, J. W. Mckean, and A. T. Craig, "Introduction to Mathematical Statistics, eighth edition" Pearson, Essex, UK, 2020.
14 A. Gelman, J. B. Carlin, H. S. Stern, D. B. Dunson, A. Vehtari, and D. B. Rubin, "Bayesian Data Analysis, Third Edition" Boca Raton, FL, Chapman & Hall/CRC Press, 2013.
15 USPTO, The United States Patent and Trademark Office, http://www.uspto.gov, 2022.
16 T. Hastie, R. Tibshirani, and J, Friedman, "The Elements of Statistical Learning, Data Mining, Inference, and Prediction" New York, Springer, 2017.
17 S. Jun, S. J. Lee, J. B. Ryu, and S. Park, "A novel method of IP R&D using patent analysis and expert survey," Queen Mary Journal of Intellectual Property, Vol. 5, No. 4, pp. 474-494, October 2015.   DOI
18 J. Park, and D. Kang, "Trend Analysis of Unmanned Technology Using Patent Information," Journal of The Korea Society of Computer and Information, Vol. 22, No. 3, pp. 89-96, 2017.   DOI
19 D. Uhm, J. Ryu, and S. Jun, "Patent Data Analysis of Artificial Intelligence Using Bayesian Interval Estimation," Applied Sciences, Vol. 10, pp. 570, 2020. https://doi.org/10.3390/app10020570   DOI
20 I. Feinerer, and K. Hornik, Package 'tm' Ver. 0.7-8 Text Mining Package, CRAN of R project, 2020
21 C. T. Butts, D. Hunter, M. Handcock, S. Bender-deMoll, J. Horner, L. Wang, P. N. Krivitsky, B. Knapp, M. Bojanowski, and C. Klumb, Package 'network' Ver. 1.17.2, Classes for Relational Data, CRAN of R project, 2022.
22 R. V. Hogg, E. A. Tanis, and D. L. Zimmerman, "Probability and Statistical Inference ninth edition" Essex, England, Pearson, 2015.
23 J. Park, "Trend Analysis of Artificial Intelligence Technology Using Patent Information," Journal of The Korea Society of Computer and Information, Vol. 23, No. 4, pp. 9-16, 2018.   DOI
24 J. Kim, J. Ryu, S. Lee, and S. Jun, "Penalized Regression Models for Patent Keyword Analysis," Model Assisted Statistics and Applications-International Journal, Vol. 12. pp. 239-244, 2017.   DOI
25 R Development Core Team, R: A language and environment for statistical computing, R Foundation for Statistical Computing, Vienna, Austria, http://www.R-project.org, 2022.