Adaptive Thesaurus using a Neural Network

신경망을 이용한 적응형 시소러스

  • Published : 2000.12.01

Abstract

정보검색 분야에서 시소러스 용어와 용어 사이의 관계를 나타내어, 질의어와 검색될 정보 사이에 존재하는 용어적 차이를 줄이는데 사용될 수 있다. 시소러스를 사용하는 방법 중 진보된 것은 용어 사이의 관계에 가중치를 주어, 소위 스프레딩 엑티베이션 방법을 이용하여 주어진 용어에서 다른 용어들 사이의 유사성을 측정하여 이를 검색에 이용한다. 그러나, 이러한 방법은 가중치를 어떻게 할당하느냐에 따라 그 결과가 달라지는 문제점이 발생한다. 본 논문에서는 시소러스의 가중치를 사용자의 검색된 정보에 대한 적합성 반응에 근거하여 조절할 수 있는 신경망 기반 시소러스를 제안한다. 제안된 시소러스의 타당성을 위하여 프로토타입의 시소러스를 WordNet으로부터 추출하여 실험하였으며, 그 결과로 recall-precision 값이 향상됨을 보였다.

Keywords

References

  1. Chen, H., Schatz, B., Yim, T., and Fye, D. 'Automatic Thesaurus Generation for an Electronic Community System,' Journal of the American Society for Information Science, 1994
  2. Collins, A. M. and Loftus, E. F. 'A Spreading-Activation Theory of Semantic Processing,' Psychological Review, Vol. 82, pp. 407-425, 1975 https://doi.org/10.1037/0033-295X.82.6.407
  3. Miller, G. A., Beckwith, R., Fellbaum, C., Gross, D., and Miller, K. 'Introduction to WordNet: An On-line Lexical Database,' Princeton University, 1993
  4. Editors of The American Heritage Dictionary, 'Roget's II : the new thesaurus,' Boston : Houghton Mifflin Co., 1995
  5. ISO 2788-1986(E) : Documentation Guidelines for the Establishment and Development of Monolingual Thesauri. 2nd
  6. Salton, G. and McGill, M. J. Introduction to Modem Information Retrieval. McGraw Hill, New York, 1983
  7. Wong, S.K.M., Ziarko, W., Raghavan,V., and Wong, P. C. N. On Modeling of Information Retrieval Concepts in Vector Spaces, ACM Transaction on Database System, Vol. 12, No.2, June 1987. pp. 299-32l https://doi.org/10.1145/22952.22957
  8. Wong, S.K.M., Ziarko, W., and Wong, P. C. N. Generalized Vector Space Model in Information Retrieval. In Proceedings of the 8th Annual International ACM-SIGIR Conference, 1985, New York, pp. 18-25 https://doi.org/10.1145/253495.253506
  9. Gwang S. Jung, 'Connectionist Domain Knowledge: Acquisition And Its Evaluation In Information Retrieval,' PhD thesis, The Center for Advanced Computer studies, University of Louisiana at Lafayette, 1991
  10. Wong, S.K.M. and Cai, Y.J. Computation of term associations by a neural network. In Proceedings of ACM-SIGIR Conference, 1993, pp. 107-115 https://doi.org/10.1145/160688.160703
  11. Wilkinson, R. and Hingston, P. Using the cosine measure in a neural network for document retrieval. In Proceedings of ACM-SIGIR Conference, 1991, pp. 202-210 https://doi.org/10.1145/122860.122880
  12. Lippmann, R. P. An introduction to computing with neural nets. IEEE ASSP Magazine, Vol. 3, No.4, pp. 4-22
  13. Salton, G. and Buckley, C. On the use of spreading activation methods in automatic information retrieval. In 11th Annual International ACM SIGIR Conference on Research and Development in Retrieval (988), pp, 147-160.10 https://doi.org/10.1145/62437.62447
  14. Lancaster, F. W. Information retrieval systems: characteristics, testing and evaluation, 2nd Ed., John Wiely and Sons, New York, 1979
  15. Salton, G., Fox, E. A., and Wu, H. Extnded Boolean Information Retrieval, Vol. 36, No. 11, December 1983, Communication of the ACM, pp. 1022-1036 https://doi.org/10.1145/182.358466
  16. Bookstein, A. Fussy requests: An approach to weighted Boolean searches, J. ASIS, Vol 31, No. 4, July, 1980, pp. 275-279
  17. Waller, W. G. and Kraft, D. H. A mathematical model for a weighted Boolean retrieval system. Information Processing and Management, Vol 15, No.5, 1979, pp. 235-245 https://doi.org/10.1016/0306-4573(79)90030-X
  18. Wong, S.K.M., Ziarko, W., Raghavan,V., and Wong, P. C. N. Extended Boolean Query Processing in the Generalized Vector Space Model, Information Systems Vol. 14, No.1, pp. 47-63, 1989 https://doi.org/10.1016/0306-4379(89)90024-0
  19. Kim, M., and Raghavan, V. V. Adaptive concept-based retrieval using a neural network. In Proceedings of the ACM SIGIR 2000 Workshop on Mathematical/Formal Methods in Information Retrieval, Athens, Greece, July 28, 2000
  20. Google Search Engine, http://www.gnoglc.com