DOI QR코드

DOI QR Code

질의응답문서 검색에서 문서구조를 이용한 질의재생성에 관한 연구

Query Reconstruction for Searching QA Documents by Utilizing Structural Components

  • 발행 : 2006.06.01

초록

질의응답문서는 이용자가 입력한 질의, 질의설명, 답을 아는 다른 이용자가 제시한 응답으로 구성된 구조화된 문서로서, 최근 웹 문서처럼 검색이 일반적으로 일어나고 있는 정보원이다. 이 연구에서는 질의응답문서의 구조적 특성을 기반으로 질의를 재생성하여 질의응답문서의 검색효율을 향상시키고자 하였다. 질의재생성 실험에서 성능이 비교된 문서구조는 질의와 응답내용이다. 질의를 기반으로 질의를 재생성하는 방식에서는 질의응답검색 시스템에 입력되어 있는 유사질의를 활용하여 클러스터링하는 기법이 적용되었다. 응답정보를 기반으로 질의를 재생성하는 방식에서는 가장 유사한 기존 질의에 대해 응답된 내용에서 단락검색으로 적합한 문장들을 선정하여 활용하는 기법이 적용되었다. 실험 결과 응답정보를 활용하여 질의를 재생성하는 방식이 정확률은 유지하면서 더 다양한 검색결과를 제공하는 것으로 나타났다.

This study aims to suggest an effective way to enhance question-answer(QA) document retrieval performance by reconstructing queries based on the structural features in the QA documents. QA documents are a structured document which consists of three components : question from a questioner, short description on the question, answers chosen by the questioner. The study proposes the methods to reconstruct a new query using by two major structural parts, question and answer, and examines which component of a QA document could contribute to improve query performance. The major finding in this study is that to use answer document set is the most effective for reconstructing a new query. That is, queries reconstructed based on terms appeared on the answer document set provide the most relevant search results with reducing redundancy of retrieved documents.

키워드

참고문헌

  1. Berztiss, A. T. 1993. 'The Query Language Vizla.' IEEE TKDE 5(5) : 813-825
  2. Bloesch, A. C. and Halpin, T. A. 1997. 'Conceptual Queries Using Conquer II.' Proceedings of the ER' 97 : 16th International Conference on Conceptual Modeling(Los Angeles). 112-126
  3. Choi, Sang Hee. 2005. 'A Study On Clustering Query-answer Documents with Structural Features.' Journal of the Korean Society for Library and Information Science. 39(4) : 105-118
  4. Chung, Young Mee and Lee, Jae Yun. 2001. 'Development of a Clustering Model for Automatic Knowledge Classification.' Journal of the Korean Society for information Management. 18(2) : 203-230
  5. Gopal, Ram D. and Ramesh, R. 1995. 'The Query Clustering Problem : A Set Partitioning Approach.' IEEE Transactions on Knowledge and Data Engineering, 7(6) : 885-899 https://doi.org/10.1109/69.476495
  6. Han, Eui-hong. 1998. 'WebACE : a web agent for document categorization and exploration.' In Proceeding of the 2nd International Conference on Autonomous Agents
  7. Kang, In-Ho and Kim, GilChang. 2003. 'Query Type Classfication or Web Document Retrieval' In Proceeding of the 26th Annual International ACM SIGIR Conference. July 28 -August 1, 2003,Toronto, Canada. pp. 64-71
  8. Kim, Jung Ha, and Lee, Jae Yun. 2000. 'A Comparative Study on Performance Evaluation of Document Clustering Results.' 7th Proceedings of the Korean Society for Information Management Conference. pp. 45-50
  9. Maybury, Mark. T. 2004. New Directions in Question Answering. Menlo Park : AAAI Press
  10. Owei, Vesper. 2002. 'An Intelligent Approach to Handling Imperfect Information in Concep-Based Natural Language Queries.' ACM Transaction on Information Systems, 20(3) : 291-328 https://doi.org/10.1145/568727.568729
  11. Tombros, Anastasios and Sanderson, Mark. 1998. 'Advantages of Query Biased Summaries in Information Retrieval.' In Proceedings of the 21st Annual International ACMSIGIR Conference on Research and Development in Information Retrieval, 2-10
  12. Tombros, Anastasios, Villa, Robert and Rijsbergen, C. J. Van. 2002. 'The Effectiveness of Query-Specific Hierarchic Clustering in Information Retrieval.' Information Processing & Management,. 38(4) : 559-582 https://doi.org/10.1016/S0306-4573(01)00048-6
  13. Wen, Ji-Ron, Nie, Jian-Yun, and Zhang, Hong-Jiang. 2001. 'Clustering User Queris of a Serach Engine.' In Proceedings of WWW10, 162-168
  14. Zhang, Ya-Jun and Liu, Zhi-Qiang. 2004. 'Refining Web Search Engine Results Using Incremental Clustering.' International Journal of Intelligent Systems, 19(2) : 191-199 https://doi.org/10.1002/int.10161