Browse > Article

Automatic Construction of Alternative Word Candidates to Improve Patent Information Search Quality  

Baik, Jong-Bum (숭실대학교 컴퓨터학과)
Kim, Seong-Min (숭실대학교 컴퓨터학과)
Lee, Soo-Won (숭실대학교 컴퓨터학부)
Abstract
There are many reasons that fail to get appropriate information in information retrieval. Allomorph is one of the reasons for search failure due to keyword mismatch. This research proposes a method to construct alternative word candidates automatically in order to minimize search failure due to keyword mismatch. Assuming that two words have similar meaning if they have similar co-occurrence words, the proposed method uses the concept of concentration, association word set, cosine similarity between association word sets and a filtering technique using confidence. Performance of the proposed method is evaluated using a manually extracted alternative list. Evaluation results show that the proposed method outperforms the context window overlapping in precision and recall.
Keywords
Allomorph; Synonym; Associated Word; Information Retrieval;
Citations & Related Records
Times Cited By KSCI : 1  (Citation Analysis)
연도 인용수 순위
1 Hsinchun Chen and Kevin J. Lynch, 'Automatic construction of networks of concepts characterizing document databases,' IEEE Transactions on Systems, Man and Cybernetics, vol.22(5), pp.885-902, 1992   DOI   ScienceOn
2 Magnus Sahlgren, 'The Word-Space Model,' Ph.D. Dissertation, Stockholm University, Stockholm, Sweden, 2006
3 P. D. Turney, Mining the Web for synonyms: PMI-IR versus LSA on TOEFL. In Proceedings of the Twelfth European Conference on Machine Learning, 2001
4 Islam, A. and Inkpen, D., 'Second Order Cooccurrence PMI for Determining the Semantic Similarity of Words,' In Proceedings of the International Conference on Language Resources and Evaluation, Genoa, Italy, 2006
5 Patrick Pantel and Dekang Lin, Discovering word senses from text. In Proceedings of ACM SIGKDD Conference on Knowledge Discovery and Data Mining, pp.613-619, Edmonton, Canada, 2002   DOI
6 Pierre P. Senellart and Vincent D. Blondel, 'Automatic discovery of similar words,' in Survey of Text Mining, Springer, 2003
7 이성진, '키워드 샾에서의 상품 추천을 위한 연관 키워드 그룹 추출 기법', M.S. Thesis, Soongsil University, Seoul, Korea 2003
8 장백국제특허법률사무소, '선행기술 검색안내', http://www.k8.co.kr/htm/8-2_1.htm/
9 J. Baik and S. Kim and S. Lee, 'Extracting Alternative Word Candidates for Patent Information Search,' Journal of KIISE : Computing Practices and Letters, vol.15, no.4, pp.299-303, Apr. 2009. (in Korean)   과학기술학회마을
10 Ruiz-Casado, M. and Alfonseca, E. and Castells, P., 'Using context-window overlapping in synonym discovery and ontology extension,' In Proceedings of the International Conference Recent Advances in Natural Language Processing, RANLP-2005, 2005
11 박용준, '특허정보 검색방법', (주)아이피풀, 2005