Browse > Article

A Storage and Retrieval System for Structured SGML Documents using Grove  

Kim, Hak-Gyoon (KT Service Development Research Center)
Cho, Sung-Bae (Dept.of Computer Science, Yonsei University)
Abstract
SGML(ISO 8879) has been proliferated to support various document styles and to transfer documents into different platforms. SGML documents have logical structure information in addition to contents. As SGML documents are widely used, there is an increasing need for database storage and retrieval system using the logical structure of documents. However. traditional search engines using document indexes cannot exploit the logical structure. In this Paper, we have developed an SGML document storage system, which is DTD-independent and store the document type and the document instance separately by using Grove which is the document model for DSSSL and HyTime. We have used the Object Store, an object-oriented DBMS, to store the structure information appropriately without any loss of structural information. Also, we have supported a index structure for search efficiency like the relational DBMS, and constructed an effective user interface which combines content-based search with structure-based search.
Keywords
SGML; Grove; SGML; Grove; Structure-based search; object-oriented DB;
Citations & Related Records
Times Cited By KSCI : 1  (Citation Analysis)
연도 인용수 순위
1 International Organization for Standardization, 'Information processing-text and office systems-Standard Generalized Markup Language(SGML),' ISO/IEC 8879, 1986
2 International Organization for Standardization, 'Hypermdeia/Time-based Structuring Language (Hy-Time),' ISO/IEC 10714, 1996
3 TEI(Text Encoding Initiative), URL: http://www.tei-c.org/
4 R. Sacks-Davis, T. Arnold-Moore and J. Zobel, 'Database systems for structured documents,' IEICE Trans. on Information and Systems, pp.1335-1342, 1995
5 J. Macleod, 'Storage and retrieval of structured documents,' Information Processing and Management, vol. 26. No.2. pp. 197-208, 1990   DOI   ScienceOn
6 G. Salton and M. McGill, Introduction to Modern Information Retrieval, McGraw-Hill, Tokyo, 1983
7 V. Christophides, S. Abiteboul, S. Cluet and M. Scholl, 'From structured documents to novel query facilities,' Special Interest Group on Management of Data(SIGMOD), 1994   DOI   ScienceOn
8 김규태, 현득창, 이수연, 정광철, '관계형 데이터베이스를 이용한 SGML문서 처리', 정보과학회논문지(C), 제3권 제3호, pp. 238-247, 1997
9 International Organization for Standardization, 'Document Style Semantics and Specification Languages(DSSSL),' ISO/IEC 10179, 1996
10 A. Seungupta and A. Dillon, 'Extending SGML to accommodate database functions: A methodological overview,' Journal of the American Society of Information Systems, pp. 629-637, 1997   DOI   ScienceOn
11 G.E. Blake, M.P. Consens, P. Kilpelainen, P.A. Larson, T. Snider and F.W. Tompa, 'Text/relational database management systems: Harmonizing SQL and SGML,' Proc. Applications of Databases, pp. 267-280, 1994
12 D. Megginson, The Simple API for XML, URL: http://www.megginson.com/SAX/
13 김용훈, 이원석, 류은숙, 이규철, 이상기, 김현기, 이혜란, 주종철, 'SGML 문서 관리 시스템의 설계 및 구현', 한국문헌정보학회지, 제32권 제2호, pp. 157-177, 1998   과학기술학회마을
14 K. Aberer, K. Bohm and C. Huser, 'The prospects of publishing using advanced database concepts,' Conf. on Electronic Publishing, 1994
15 J. Clark, A Free, Object-oriented Toolkit for SGML Parsing and Entity Management, URL: http://www.jclark.com/sp