Browse > Article
http://dx.doi.org/10.3745/KIPSTB.2009.16-B.1.71

An XML Tag Indexing Method Using on Lexical Similarity  

Jeong, Hye-Jin (전북대학교 컴퓨터정보학과)
Kim, Yong-Sung (전북대학교 전자정보공학부)
Abstract
For more effective index extraction and index weight determination, studies of extracting indices are carried out by using document content as well as structure. However, most of studies are concentrating in calculating the importance of context rather than that of XML tag. These conventional studies determine its importance from the aspect of common sense rather than verifying that through an objective experiment. This paper, for the automatic indexing by using the tag information of XML document that has taken its place as the standard for web document management, classifies major tags of constructing a paper according to its importance and calculates the term weight extracted from the tag of low weight. By using the weight obtained, this paper proposes a method of calculating the final weight while updating the term weight extracted from the tag of high weight. In order to determine more objective weight, this paper tests the tag that user considers as important and reflects it in calculating the weight by classifying its importance according to the result. Then by comparing with the search performance while using the index weight calculated by applying a method of determining existing tag importance, it verifies effectiveness of the index weight calculated by applying the method proposed in this paper.
Keywords
XML Tag Weight; Automatic Indexing; Information Retrieval;
Citations & Related Records
Times Cited By KSCI : 1  (Citation Analysis)
연도 인용수 순위
1 우선미, “사용자 질의를 이용한 XML 태그의 가중치 결정”, 정보처리논문지 D(정보처리 응용), 2005   과학기술학회마을   DOI
2 정혜진, “사용자 질의를 이용한 XML 태그이 중요도 결정 기법”, 전북대학교석사학윈논문, 2004
3 김흥남, 이기성, 조근식 “가중치가 부여된 규칙을 이용한 문서분류”, 한국정보과학회지, 제30권, 제2-1호, pp.0154-0156, 2003
4 김종영, 김철수 “가중치를 가지는 웹문서 색인기법에 관한 연구”, 한국정보처리학회, 제09권, 제02호, pp.0000-0000, 2002
5 우선미, 유춘식, 김용성, “용어 연관성 분석을 이용한 사용자 위주의 문서순위결정 기법”, 한국정보과학회 논문지, 제28권, 제2호, pp.149-156, 2001   과학기술학회마을
6 유춘식, 우선미, 유철중, 이종득, 권오봉, 김용성, “자연어 처리, 통계적 기법, 적합성 검증을 이용한 자동 색인 시스템에 관한 연구”, 한국정보처리학회 논문지, 제5권 제6호, 1998   과학기술학회마을
7 Brian Lowe, Justin Zobel and Ron Sacks-Davis “A Formal Model for Databases of Structured Text,” Proceedings of the Fouth International Conference on Database Systems for Advanced Applications(Dasfaa '95), pp.449-456, 1995
8 S.H.Lin, M.C.Chen, J.M.Ho and Y.M.Huang. “ACIRD : Intelligent Internet Organization and Retrieval,” IEEE Transactions on Knowledge and Data Engineering, Vol.14, No.3, May/June, 2002   DOI   ScienceOn