Adaptive User Profile for Information Retrieval from the Web

  • Srinil, Phaitoon (Faculty of Science and Art, Burapa University Chantaburi Campus) ;
  • Pinngern, Ouen (Department of Computer Engineering Faculty of Engineering, Research Center for Communication and Information Technology, King Mongkut’s Institute of Technology Ladkrabang)
  • Published : 2003.10.22

Abstract

This paper proposes the information retrieval improvement for the Web using the structure and hyperlinks of HTML documents along with user profile. The method bases on the rationale that terms appearing in different structure of documents may have different significance in identifying the documents. The method partitions the occurrence of terms in a document collection into six classes according to the tags in which particular terms occurred (such as Title, H1-H6 and Anchor). We use genetic algorithm to determine class importance values and expand user query. We also use this value in similarity computation and update user profile. Then a genetic algorithm is used again to select some terms from user profile to expand the original query. Lastly, the search engine uses the expanded query for searching and the results of the search engine are scored by similarity values between each result and the user profile. Vector space model is used and the weighting schemes of traditional information retrieval were extended to include class importance values. The tested results show that precision is up to 81.5%.

Keywords