Browse > Article
http://dx.doi.org/10.5762/KAIS.2011.12.8.3677

A Study on Development of Patent Information Retrieval Using Textmining  

Go, Gwang-Su (School of Industrial Management Engineering, Korea University)
Jung, Won-Kyo (School of Industrial Management Engineering, Korea University)
Shin, Young-Geun (School of Industrial Management Engineering, Korea University)
Park, Sang-Sung (School of Industrial Management Engineering, Korea University)
Jang, Dong-Sik (School of Industrial Management Engineering, Korea University)
Publication Information
Journal of the Korea Academia-Industrial cooperation Society / v.12, no.8, 2011 , pp. 3677-3688 More about this Journal
Abstract
The patent information retrieval system can serve a variety of purposes. In general, the patent information is retrieved using limited key words. To identify earlier technology and priority rights repeated effort is needed. This study proposes a method of content-based retrieval using text mining. Using the proposed algorithm, each of the documents is invested with characteristic value. The characteristic values are used to compare similarities between query documents and database documents. Text analysis is composed of 3 steps: stop-word, keyword analysis and weighted value calculation. In the test results, the general retrieval and the proposed algorithm were compared by using accuracy measurements. As the study arranges the result documents as similarities of the query documents, the surfer can improve the efficiency by reviewing the similar documents first. Also because of being able to input the full-text of patent documents, the users unacquainted with surfing can use it easily and quickly. It can reduce the amount of displayed missing data through the use of content based retrieval instead of keyword based retrieval for extending the scope of the search.
Keywords
Text Mining; TF-IDF; Precision; Stop-word; Patent Information Retrieval;
Citations & Related Records
Times Cited By KSCI : 4  (Citation Analysis)
연도 인용수 순위
1 Y. L. Chen and Y. T. Chiu, "An IPC-based vector space model for patent retrieval", Information Processing & Management, Vol. 47, Issue. 3, pp. 309-322, May, 2011.   DOI
2 K. K. Lai and S. J. Wu, "Using the patent co-citation approach to establish a new patent classification system", Information Processing & Management, Vol. 41, Issue. 2, pp. 313-330, March, 2005.   DOI
3 WIPS, http://search.wips.co.kr/
4 I. Feinerer, K. Hornik and D. Meyer. "Text mining infrastructure in R", Journal of Statistical Software, Vol. 25, Issue. 5, pp. 1-54, March 2008.
5 WIKIPEDIA, http://en.wikipedia.org/wiki/Vector_space_model
6 G. Salton and M. J. McGill, "Introduction to modern information retrieval", McGraw-Hill, 1983.
7 J. B. Baik, S. M. Kim and S. W. Lee, "Extracting Alternative Word Candidates for Patent Information Search", Korean Institute of Information Scientists and Engineers, Vol. 15, Issue. 2, pp. 299-303, April, 2009.   과학기술학회마을
8 Korea Intellectual Property Office, Patent and Information Analysis, Korea Intellectual Property Office, December, 2007.
9 http://www.kipris.or.kr/kor/use/use_1.jsp
10 KIPO, "The understanding of Intellectual Property", pp. 128-138, December, 2009.
11 K. J. Son and S. J. Lee, "Weighting Methods for compound Nouns in Patent Retrieval System", Korean Institute of Information Scientists and Engineers, Vol. 31, Issue. 1, pp. 895-897, April, 2004.   과학기술학회마을
12 H. G. Kim, S. H. Lee and Y. H. Mook, "Patent Search System Using IPC Clustering", Korea Contents Association, vol. 5, Issue. 2, pp. 103-106, November, 2007.
13 L. H. Tong, H. Cong and S. Lixiang, "Automatic classification of patent documents for TRIZ users", World Patent Information, Vol. 28, Issue. 1, pp. 6-13, March, 2006.   DOI