Analysis method of patent document to Forecast Patent Registration

Koo, Jung-Min;Park, Sang-Sung;Shin, Young-Geun;Jung, Won-Kyo;Jang, Dong-Sik;

doi:10.5762/KAIS.2010.11.4.1458

Journal of the Korea Academia-Industrial cooperation Society (한국산학기술학회논문지)

Volume 11 Issue 4
/
Pages.1458-1467
/
2010
/
1975-4701(pISSN)
/
2288-4688(eISSN)

The Korea Academia-Industrial cooperation Society (한국산학기술학회)

DOI QR Code

Analysis method of patent document to Forecast Patent Registration

특허 등록 예측을 위한 특허 문서 분석 방법

Koo, Jung-Min (Division of Information Management Engineering, Korea University) ;
Park, Sang-Sung (Division of Information Management Engineering, Korea University) ;
Shin, Young-Geun (Division of Information Management Engineering, Korea University) ;
Jung, Won-Kyo (Division of Information Management Engineering, Korea University) ;
Jang, Dong-Sik (Division of Information Management Engineering, Korea University)

구정민 (고려대학교 정보경영공학부) ;
박상성 (고려대학교 정보경영공학부) ;
신영근 (고려대학교 정보경영공학부) ;
정원교 (고려대학교 정보경영공학부) ;
장동식 (고려대학교 정보경영공학부)

Received : 2010.02.05
Accepted : 2010.04.09
Published : 2010.04.30

https://doi.org/10.5762/KAIS.2010.11.4.1458 Citation PDF KSCI

Download PDF

⟨ Previous Next ⟩

Abstract

Recently, imitation and infringement rights of an intellectual property are being recognized as impediments to nation's industrial growth. To prevent the huge loss which comes from theses impediments, many researchers are studying protection and efficient management of an intellectual property in various ways. Especially, the prediction of patent registration is very important part to protect and assert intellectual property rights. In this study, we propose the patent document analysis method by using text mining to predict whether the patent is registered or rejected. In the first instance, the proposed method builds the database by using the word frequencies of the rejected patent documents. And comparing the builded database with another patent documents draws the similarity value between each patent document and the database. In this study, we used k-means which is partitioning clustering algorithm to select criteria value of patent rejection. In result, we found conclusion that some patent which similar to rejected patent have strong possibility of rejection. We used U.S.A patent documents about bluetooth technology, solar battery technology and display technology for experiment data.

최근 지식재산권의 모방과 권리 침해는 국가 산업발전의 저해요소로 인식되고 있다. 많은 연구자들은 이러한 저해요소로 인하여 발생하는 막대한 손실을 막기 위해 지식재산권의 보호와 효율적 관리에 관한 연구를 다양하게 진행 중이다. 특히, 특허 등록 예측은 지식재산권 보호와 권리 주장을 위해 매우 중요한 연구이다. 본 연구는 텍스트 마이닝 기법을 이용한 특허문서 분석을 통하여 특허 등록 및 거절 여부를 예측하는 방법을 제안한다. 먼저 거절된 특허문서들의 단어 빈도수를 이용하여 데이터베이스를 생성한다. 그리고 생성한 데이터베이스와 다른 특허문서들을 비교하여 각 문서와 데이터베이스와의 유사한 정도를 판단하는 유사치를 도출한다. 본 논문에서는 특허 거절 기준 값을 선정하기 위하여 분할 군집화 알고리즘인 k-means 사용하였다. 그 결과로 거절된 특허 문서와 유사한 특허 문서는 거절될 가능성이 높다는 결론을 얻을 수 있었다. 실험을 위한 데이터는 현재 미국에 출원되어 있는 블루투스 기술, 태양전지 기술 그리고 디스플레이에 관한 특허 문서를 이용하였다.

Keywords

Patent Forecast Text Mining

References

http://en.wikipedia.org/wiki/Software_patents
Archibugi. D. and Pianta. M, " Measuring technological change through patents and innovation survey", Technovation, Vol.16, No.9, pp.451 - 468, 1996. https://doi.org/10.1016/0166-4972(96)00031-4
Be'de'carrax. C. and Huot. C, "A new methodology for systematic exploitation of technology databases", Information Processing & Management, Vol.30, No.3, pp.407 - 418, 1994. https://doi.org/10.1016/0306-4573(94)90053-1
Ernst. H., "Use of patent data for technological forecasting: the diffusion of CNC-technology in the machine tool industry", Small Business Economics, Vol9, No.4, pp.361 - 381, 1997. https://doi.org/10.1023/A:1007921808138
Lai. K.-K. and Wu. S.-J, "Using the patent co-citation approach to establish a new patent classification system", Information Processing & Management, Vol.41, No.2, pp.313 - 330, 2005. https://doi.org/10.1016/j.ipm.2003.11.004
Fattori. M, Pedrazzi. G. and Turra. R. "Text mining applied to patent mapping: a practical business case", World Patent Information, Vol.25, No.4, pp.335 - 342, 2003. https://doi.org/10.1016/S0172-2190(03)00113-3
Lent. B, Agrawal. R. and Srikant. R, "Discovering trends in text databases", In Proceedings of international conference on knowledge discovery and data mining, 1997.
B. G. Yoon and Y. T. Park, "A text-mining-based patent network: Analytical tool for high-technology trend", Journal of High Technology Management Research Vol.15, No.1, pp.37 - 50, 2004. https://doi.org/10.1016/j.hitech.2003.09.003
Y. S. Tian, Y. H. Kim, Y. J. Jeong, J. H. Ryu, and S. H. Myaeng, "A Language Model and Clue based Machine Learning Method for Discovering Technology Trends from Patent Text", Journal of Korean Institute of Information Scientists and Engineers, Vol 36, No 5, pp.420-429, 2009.
Clifton. C. and Cooley. R, "TopCat: Data Mining for Topic Identification in a Text Corpus", Proceedings of the Third European Conference of Principles and Practice of Knowledge Discovery in Databases, 1999.
Yang. Y, "An Evaluation of Statistical Approaches to Text Categorization", Journal of Information Retrieval, Vol.1, No.1-2, pp.69-90, 1999. https://doi.org/10.1023/A:1009982220290
Korea Intellectual Property Office, Patent and Information Analysis, Korea Intellectual Property Office, 2007.
오일석, Pattern Recognition, 교보문고, 2008.