Browse > Article

Extracting Alternative Word Candidates for Patent Information Search  

Baik, Jong-Bum (숭실대학교 컴퓨터학과)
Kim, Seong-Min (숭실대학교 컴퓨터학과)
Lee, Soo-Won (숭실대학교 컴퓨터학과)
Abstract
Patent information search is used for checking existence of earlier works. In patent information search, there are many reasons that fails to get appropriate information. This research proposes a method extracting alternative word candidates in order to minimize search failure due to keyword mismatch. Assuming that two words have similar meaning if they have similar co-occurrence words, the proposed method uses the concept of concentration, association word set, cosine similarity between association word sets and a ranking modification technique. Performance of the proposed method is evaluated using a manually extracted alternative word candidate list. Evaluation results show that the proposed method outperforms the document vector space model in recall.
Keywords
AssociationWord; AlternativeWord; Similar Word; Patent Information Search;
Citations & Related Records
연도 인용수 순위
  • Reference
1 박용준, "특허정보 검색방법", (주)아이피플, 2005
2 Pierre P. Senellart and Vincent D. Blondel, “Auto-matic discovery of similar words,” in Survey of Text Mining, Springer, 2003
3 Vincent D. Blondel and Pierre P. Senellart, 'Auto-matic extraction of synonyms in a dictionary,' Presented at the TextMining Workshop, Arlington, Virginia, 2002
4 Jiawel Han and Micheline Kamber, Data Mining Concepts and Techniques, 2nd ed., Morgan Kauf-mann, 2006
5 장백국제특허법률사무소, "선행기술 검색안내," http://www.k8.co.kr/htm/8-2_1.htm/
6 Magnus Sahlgren, "The Word-Space Model," Ph.D. Dissertation, Stockholm University, Stockholm, Sweden 2006
7 Hsinchun Chen and Kevin J. Lynch, “Automatic construction of networks of concepts characterizing document databases,” IEEE Transactions on Sys-tems, Man and Cybernetics, Vol.22(5), 885-902, 1992   DOI   ScienceOn
8 이성진, "키워드 샾에서의 상품 추천을 위한 연관 키워드 그룹 추출 기법", M.S. Thesis, Soongsil Uni-versity, Seoul, Korea 2003
9 Jon M. Kleinberg, 'Automatic construction of net-works of concepts characterizing document data-bases,' Journal of the ACM, Vol.46(5), 604-632, 1999   DOI   ScienceOn