Search | Korea Science

A Study of Patent Document Processing by SGML (SGML을 이용한 특허정보처리 연구)

Kwon, Young-Sook
- Journal of Information Management
- /
- v.30 no.3
- /
- pp.44-54
- /
- 1999
A description of SGML(Standard Generalized Markup Language) is given together with a detailed description of WIPO Standard ST.32. The benefits of the use of SGML are highlighted-its system Independence and flexibility in building publication systems and full-text databases. A structure of WIPO Standard ST,32 based patent content is defined by DTD(document type definition) written in ST.32, and full-text itself is described with generalized markup depending on DTD. This article explains how to represent a document structure : a hierarchy structure like a entire document, a specific, sub-document, a paragraph, or non-hirarchy structure like a table drawings, or chemical structures. Merits of SGML In patent document processing are also discussed.
PDF

A Study on Implementation of Patent Fulltext Linking System based on Patent DOI (특허 DOI에 기반한 특허원문연계체계 구축에 관한 연구)

권오진;노경란
- Proceedings of the Korea Contents Association Conference
- /
- 2003.05a
- /
- pp.319-322
- /
- 2003
This Paper reviews various formats of patent numbers that unique identifier of patent document according to country. And it describes standard DOI model that can unify each country＇s serveral patent numbers. Based on this patent DOI, it describes implementation method to link digitalized patent document to patent bibliographic record at a time.
PDF

Searching Patents Effectively in terms of Keyword Distributions (키워드 분포를 고려한 효과적 특허검색기법)

Lee, Wookey;Song, Justin Jongsu;Kang, Michael Mingu
- Journal of Information Technology and Architecture
- /
- v.9 no.3
- /
- pp.323-331
- /
- 2012
With the advancement of the area of knowledge and information, Intellectual Property, especially, patents have captured attention more and more emergent. The increasing need for efficient way of patent information search has been essential, but the prevailing patent search engines have included too many noises for the results due to the Boolean models. This has occasioned too much time for the professional experts to investigate the results manually. In this paper, we reveal the differences between the conventional document search and patent search and analyze the limitations of existing patent search. Furthermore, we propose a specialized in patent search, so that the relationship between the keywords within each document and their significance within each patent document search keyword can be identified. Which in turn, the keywords and the relationships have been appointed a ranking for this patent in the upper ranks and the noise in the data sub-ranked. Therefore this approach is proposed to significantly reduce noise ratio of the data from the search results. Finally, in, we demonstrate the superiority of the proposed methodology by comparing the Kipris dataset.
KSCI

LDA Topic Modeling and Recommendation of Similar Patent Document Using Word2vec (LDA 토픽 모델링과 Word2vec을 활용한 유사 특허문서 추천연구)

Apgil Lee;Keunho Choi;Gunwoo Kim
- Information Systems Review
- /
- v.22 no.1
- /
- pp.17-31
- /
- 2020
With the start of the fourth industrial revolution era, technologies of various fields are merged and new types of technologies and products are being developed. In addition, the importance of the registration of intellectual property rights and patent registration to gain market dominance of them is increasing in oversea as well as in domestic. Accordingly, the number of patents to be processed per examiner is increasing every year, so time and cost for prior art research are increasing. Therefore, a number of researches have been carried out to reduce examination time and cost for patent-pending technology. This paper proposes a method to calculate the degree of similarity among patent documents of the same priority claim when a plurality of patent rights priority claims are filed and to provide them to the examiner and the patent applicant. To this end, we preprocessed the data of the existing irregular patent documents, used Word2vec to obtain similarity between patent documents, and then proposed recommendation model that recommends a similar patent document in descending order of score. This makes it possible to promptly refer to the examination history of patent documents judged to be similar at the time of examination by the examiner, thereby reducing the burden of work and enabling efficient search in the applicant's prior art research. We expect it will contribute greatly.
https://doi.org/10.14329/isr.2020.22.1.017 인용 PDF

A Study on the Development of LDA Algorithm-Based Financial Technology Roadmap Using Patent Data

Koopo KWON;Kyounghak LEE
- Korean Journal of Artificial Intelligence
- /
- v.12 no.3
- /
- pp.17-24
- /
- 2024
This study aims to derive a technology development roadmap in related fields by utilizing patent documents of financial technology. To this end, patent documents are extracted by dragging technical keywords from prior research and related reports on financial technology. By applying the TF-IDF (Term Frequency-Inverse Document Frequency) technique in the extracted patent document, which is a text mining technique, to the extracted patent documents, the Latent Dirichlet Allocation (LDA) algorithm was applied to identify the keywords and identify the topics of the core technologies of financial technology. Based on the proportion of topics by year, which is the result of LDA, promising technology fields and convergence fields were identified through trend analysis and similarity analysis between topics. A first-stage technology development roadmap for technology field development and a second-stage technology development roadmap for convergence were derived through network analysis about the technology data-based integrated management system of the high-dimensional payment system using RF and intelligent cards, as well as the security processing methodology for data information and network payment, which are identified financial technology fields. The proposed method can serve as a sufficient reason basis for developing financial technology R&D strategies and technology roadmaps.
https://doi.org/10.24225/kjai.2024.12.3.17 인용 PDF

Analysis method of patent document to Forecast Patent Registration (특허 등록 예측을 위한 특허 문서 분석 방법)

Koo, Jung-Min;Park, Sang-Sung;Shin, Young-Geun;Jung, Won-Kyo;Jang, Dong-Sik
- Journal of the Korea Academia-Industrial cooperation Society
- /
- v.11 no.4
- /
- pp.1458-1467
- /
- 2010
Recently, imitation and infringement rights of an intellectual property are being recognized as impediments to nation's industrial growth. To prevent the huge loss which comes from theses impediments, many researchers are studying protection and efficient management of an intellectual property in various ways. Especially, the prediction of patent registration is very important part to protect and assert intellectual property rights. In this study, we propose the patent document analysis method by using text mining to predict whether the patent is registered or rejected. In the first instance, the proposed method builds the database by using the word frequencies of the rejected patent documents. And comparing the builded database with another patent documents draws the similarity value between each patent document and the database. In this study, we used k-means which is partitioning clustering algorithm to select criteria value of patent rejection. In result, we found conclusion that some patent which similar to rejected patent have strong possibility of rejection. We used U.S.A patent documents about bluetooth technology, solar battery technology and display technology for experiment data.
https://doi.org/10.5762/KAIS.2010.11.4.1458 인용 PDF KSCI

Patent Document Similarity Based on Image Analysis Using the SIFT-Algorithm and OCR-Text

Park, Jeong Beom;Mandl, Thomas;Kim, Do Wan
- International Journal of Contents
- /
- v.13 no.4
- /
- pp.70-79
- /
- 2017
Images are an important element in patents and many experts use images to analyze a patent or to check differences between patents. However, there is little research on image analysis for patents partly because image processing is an advanced technology and typically patent images consist of visual parts as well as of text and numbers. This study suggests two methods for using image processing; the Scale Invariant Feature Transform(SIFT) algorithm and Optical Character Recognition(OCR). The first method which works with SIFT uses image feature points. Through feature matching, it can be applied to calculate the similarity between documents containing these images. And in the second method, OCR is used to extract text from the images. By using numbers which are extracted from an image, it is possible to extract the corresponding related text within the text passages. Subsequently, document similarity can be calculated based on the extracted text. Through comparing the suggested methods and an existing method based only on text for calculating the similarity, the feasibility is achieved. Additionally, the correlation between both the similarity measures is low which shows that they capture different aspects of the patent content.
https://doi.org/10.5392/IJoC.2017.13.4.070 인용 PDF KSCI

Recognition of Named Entity of Patent Document Applying NLP

Lee, Tae-Seok
- Proceedings of the Korea Contents Association Conference
- /
- 2014.06a
- /
- pp.301-302
- /
- 2014
PDF

A Study on Developing a Prediction Model of Patent Citation Counts (특허인용 예측모형 구축에 관한 연구)

Yoo, Jae-Bok;Chung, Young-Mee
- Journal of the Korean Society for information Management
- /
- v.27 no.4
- /
- pp.239-258
- /
- 2010
The purpose of this study is to develop a prediction model of patent citation counts based on major factors which affect patent citation. To this end, we performed multiple regression analysis between the patent citation counts and five explanatory variables such as the number of pages, the number of claims, the reference-average-citation rate, the strength of bibliographic coupling, and the document similarity proved as having 5% or more standardized variances($r^2$) with patent citation counts, with a test dataset of U.S. patents in five subject fields. As a result, our prediction models showed 58.3% to 89.6% predictability depending on subject fields and revealed the document similarity has the highest impact on citation counts among the five predictive variables in all the subject fields. The result of comparison between the predicted citation counts and the actual ones confirmed the usefulness of the citation prediction models built for each subject field.
https://doi.org/10.3743/KOSIM.2010.27.4.239 인용 PDF

Analysis of Factors Influencing Patent Citations (특허 인용에 영향을 미치는 요인 분석)

Yoo, Jae-Bok;Chung, Young-Mee
- Journal of the Korean Society for information Management
- /
- v.27 no.1
- /
- pp.103-118
- /
- 2010
Recently, the valuation of patented technology has been greatly emphasized, and patent citation has been accepted as a very useful index of this technology. In this study, we performed correlation analyses between the patent citation counts and 17 explanatory variables of morphological, technological, and conceptual factors with a test dataset of U.S. patents in five subject fields. Seven variables having 5% or more standardized variances($r^2$) with patent citation counts were identified; number of pages, number of claims, reference-average-citation rate, patent increase/decrease rate, strength of bibliographic coupling, co-citation counts and document similarity. The result of the ANOVA test shows that the mean values of these variables vary among most subject fields.
https://doi.org/10.3743/KOSIM.2010.27.1.103 인용 PDF

Search Result 46, Processing Time 0.022 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)