Browse > Article

A New Similarity Measure for Improving Ranking in QA Systems  

Kim Myung-Gwan (서울보건대학 전산정보처리과)
Park Young-Tack (숭실대학교 컴퓨터학부)
Abstract
The main idea of this paper is to combine position information in sentence and query type classification to make the documents ranking to query more accessible. First, the use of conceptual graphs for the representation of document contents In information retrieval is discussed. The method is based on well-known strategies of text comparison, such as Dice Coefficient, with position-based weighted term. Second, we introduce a method for learning query type classification that improves the ability to retrieve answers to questions from Question Answering system. Proposed methods employ naive bayes classification in machine learning fields. And, we used a collection of approximately 30,000 question-answer pairs for training, obtained from Frequently Asked Question(FAQ) files on various subjects. The evaluation on a set of queries from international TREC-9 question answering track shows that the method with machine learning outperforms the underline other systems in TREC-9 (0.29 for mean reciprocal rank and 55.1% for precision).
Keywords
Question Answer system; Machine loaming;
Citations & Related Records
연도 인용수 순위
  • Reference
1 Harabagiu, S. M., 'Experiments with open-domain textual question-answering,' COLING-2000, 2000   DOI
2 Hovy, E.H., 'Question Answering in Webclopidia,' TREC-9 Proceedings, 2000
3 Strzalkowski, Tomek., 'Natural Language Information Retrieval,' TREC-5 Proceedings, 1996
4 Cardie, C. and Pierce, D., 'Examining the role of statistical and linguistic knowledge sources in a general-knowledge question answering system,' ANLP-2000, 2000   DOI
5 Aliod, D. and Berri, J., 'A real world implementation of answer extraction,' In Proceedings of the 9th International Workshop on Database and Expert Systems, 1998   DOI
6 Alpha, S. Dixon, P. Liao,C., 'Oracle at TREC 10,' TREC-10 Proceedings, 2001
7 Moldovan, D., 'A tool for surfing the answer net,' TREC-8 Proceedings, 1999
8 Katz, Boris and Winston, Patric H., 'A two-way natural language interface,' In proceedings of the European Conference on Integrated Interactive Computing Systems, 1982
9 Fagan, Joel L., 'Experiments in Automatic Phrase Indexing for Document Retrieval,' Ph.D thesis, Cornell University, 1987
10 Xu, J. and Croft, W. B., 'Improving the effectiveness of information retrieval with local context analysis,' ACM Transaction on Information Systems, vol. 18, No.l, pp.79-112, 2000   DOI   ScienceOn
11 이경순, 김재호, 최기선, '질의응답 시스템의 성능 평가를 위한 테스트컬렉션 구축', 제12회 한글 및 한국어 정보처리 학술대회, pp. 190-197, 2000
12 Li, J. and Yu, Z., 'Learning to Generate CGs from Domain Specific Sentences,' The Proceedings of the 9th International Conference on Conceptual Structures, 2001
13 이영신, 황영숙, 임해창, '질의응답 시스템을 위한 가변 길이 단락 검색', 제14회 한글과 한국어정보처리 학술대회. pp. 259-266, 2002
14 Lin, J., 'Indexing and Retrieving Natural Language Using Ternary Expression,' Master's Thesis, Massachusetts Institute of Technology, 2001
15 Voorhees, E. and Harmon, D., 'Overview of the TREC 2001 Question Answering Track,' TREC-10 Proceedings, 2001