Search | Korea Science

An Implementation of XML document searching system based on Structure and Semantics Similarity (구조와 내용 유사도에 기반한 XML 웹 문서 검색시스템 구축)

Park Uchang;Seo Yeojin
- Journal of Internet Computing and Services
- /
- v.6 no.2
- /
- pp.99-115
- /
- 2005
Extensible Markup Language (XML) is an Internet standard that is used to express and convert data, In order to find the necessary information out of XML documents, you need a search system for XML documents, In this research, we have developed a search system that can find documents that matches the structure and content of a given XML document, making the best use of XML structure, Search metrics take account of the similarity in tag names, tag values, and the structure of tags, After a search, the system displays the ranked results in the order of aggregate similarity, Three methods of query are provided: keyword search which is conventional; search with tag names and their values; and search with XML documents, These three methods enable users to choose the method that best suits their preference, resulting in the increase of the usefulness of the system.
PDF

Extended Entity-Relationship Model for Conceptual Modeling of XML Schema (XML 스키마의 개념적 모델링을 위한 확장된 개체관계 모델)

Jung, In-Hwan;Kim, Young-Ung
- The Journal of the Institute of Internet, Broadcasting and Communication
- /
- v.15 no.1
- /
- pp.157-163
- /
- 2015
XML has become one of the most influential standard language for representing and exchanging data on internet. However, XML itself has a ability to represent a logical structure for storing and managing data, it is inadequate to use as a conceptual modeling tool because of its complexity for representing the document structures. In this paper, we propose the graphical form of conceptual modeling techniques for representing the structure of the XML schema documents using an extended entity relationship diagram. For this, extended entity relationship model is presented for representing the XML schema structure, transformation rules are presented for transforming extended entity relationship model into XML schema document to show the completeness of the proposed model.
https://doi.org/10.7236/JIIBC.2015.15.1.157 인용 PDF KSCI

A Transformation Technique of XML DTD to Relational Database Schema Based On Extracting Common Structure in XML Documents (공통 문서 구조 추출을 통한 XML DTD의 관계형 데이터 베이스 스키마 변환 기법)

Ahn, Sung-Eun;Choi, Hwang-Kyu
- The KIPS Transactions:PartD
- /
- v.9D no.6
- /
- pp.999-1008
- /
- 2002
XML is emerging as a standard data format to exchange and to present data on the Web. There are increasing needs to efficiently store and to query XML data. In this paper. we propose a new schema transformation algorithm based on a common structure extracting technique from XML documents. The common structure is shared by all XML documents referenced by DTD and the uncommon structure is ununiformly appeared on all XML documents referenced by DTD. Based on the extracted common and uncommon structures, we transform XML DTD into relational database schema. We conduct a performance evaluation based on the number of the generated tables, the size of the record, query processing time and the number of joins on the query. The performance of our algorithm is compared with the existing algorithms, then in most cates, our algorithm is better than the existing ones with respect to the number of the generated tables and appearance of NULL values in the tables.
https://doi.org/10.3745/KIPSTD.2002.9D.6.999 인용 PDF KSCI

UML Class Diagram for XDR Schema (XDR 스키마를 위한 UML 클래스 다이어그램)

Yoo Moon-sung
- Journal of Internet Computing and Services
- /
- v.4 no.4
- /
- pp.35-44
- /
- 2003
XML becomes the standard for exchanging documents and data on the Web. Schema is used to define XML documents. Among schema, XDR schema, developed chiefly by Microsoft, is a leading schema for practical use. UML is a notation in object-oriented software development and a useful tool to represent the structure of a system. In this paper, we study the transformation method and algorithm from XDR schema to UML diagram. By this transformation, the structure of XDR schema is represented graphically. Thus we can easily understand the structure of XDR schema and we can create XML documents effectively by enhancing reusability and flexibility.
PDF

An Abstract Grammar for XML Document Editing (XML 문서 편집을 위한 추상문법)

신경희;최종명;유재우
- Journal of KIISE:Software and Applications
- /
- v.30 no.3_4
- /
- pp.268-277
- /
- 2003
A document type definition(DTD) which defines tags for a document is a XML document grammar that defines syntactic structure of a document. An XML document keeps the rules and must be parsed to check validation. To parse XML document, the deterministic parsing method of programming language is irrelevant because it does not satisfy the definition of deterministic content model in element declaration. In this paper, we consider editing of a valid XML document in syntax-directed editing environment, and we suggest the internal storage representations of syntax in DTD and theirs algorithms. The consequence is that a syntactic structure of textual DTD is transformed into graph and table structures. The table structure of DTD is interpreted the context free grammar which has attribute values and is used in syntax-directed editor for XML. We called this the XML abstract grammar and showed generated results and examples.
PDF KSCI

Incremental Clustering of XML Documents based on Similar Structures (유사 구조 기반 XML 문서의 점진적 클러스터링)

Hwang Jeong Hee;Ryu Keun Ho
- Journal of KIISE:Databases
- /
- v.31 no.6
- /
- pp.699-709
- /
- 2004
XML is increasingly important in data exchange and information management. Starting point for retrieving the structure and integrating the documents efficiently is clustering the documents that have similar structure. The reason is that we can retrieve the documents more flexible and faster than the method treating the whole documents that have different structure. Therefore, in this paper, we propose the similar structure-based incremental clustering method useful for retrieving the structure of XML documents and integrating them. As a novel method, we use a clustering algorithm for transactional data that facilitates the large number of data, which is quite different from the existing methods that measure the similarity between documents, using vector. We first extract the representative structures of XML documents using sequential pattern algorithm, and then we perform the similar structure based document clustering, assuming that the document as a transaction, the representative structure of the document as the items of the transaction. In addition, we define the cluster cohesion and inter-cluster similarity, and analyze the efficiency of the Proposed method through comparing with the existing method by experiments.
PDF KSCI

Reverse Engineering of XML Schema to Conceptual Model (개념적 모델로의 XML 스키마의 역변환 기법)

Jung, In-Hwan;Kim, Young-Ung
- The Journal of the Institute of Internet, Broadcasting and Communication
- /
- v.17 no.6
- /
- pp.211-216
- /
- 2017
XML is becoming a de facto standard for the representation and exchange of data, and XML itself is well structured to store and manage data, but it is difficult to understand the document structure at a glance due to its diverse and complex expressions. Thus, it is not appropriate to use it as a tool of conceptual model. In this paper, we propose a method to transform XML schema document structure into conceptual model. To do this, we describe CMXML as a conceptual modeling tool and present transformation rules and data structures for transforming an XML schema structure into CMXML diagram. In the transformation procedure, the XML schema document is input and the CMXML diagram is generated by applying the transformation rules.
https://doi.org/10.7236/JIIBC.2017.17.6.211 인용 PDF KSCI

XML Schema and Stylesheet Builder using in Tree on Web Server (웹 서버 상에서 XML트리를 이용한 XML 스키마와 스타일시트 생성)

Park Young-Soo;Jang Duk-Chul
- Journal of KIISE:Computing Practices and Letters
- /
- v.11 no.4
- /
- pp.349-356
- /
- 2005
At the present day, increasing the web service requirements, along with making XML program easy to create and manage, XML program which progress the research of related technologies enables developer the number of benefits. In order to create XML Documents which include structure that compose documents as well as data, it must have defined about DTD or XML schema that display structure of data. In this study, herewith suggest the design and implementation of method that can be automatically created XML Schema and XML Stylesheet using the web-based XML Tree.
PDF KSCI

Common XML Structure Extracting Algorithm for Applying Data Mining Techniques (데이터마이닝 기법 적용을 위한 공용 XML 구조 추출 알고리즘)

Jang, Min-Seok;Bang, Hyun-Jin
- Proceedings of the Korean Institute of Information and Commucation Sciences Conference
- /
- v.9 no.1
- /
- pp.1072-1076
- /
- 2005
Importance of XML as a target of Data Mining is growing because XML is used generally as a standard markup language for describing structured data. Especially researches have been done about extracting wanted informations by applying association rules to XML documents. But there are few development about solving the problems of method for efficiently obtaining informations from similar kinds of XML documents. To solve the problem this paper tries to suggest the method by which common XML structure is extracted form the same kinds of XML documents having a various XML schemas. The resulted schema structure is supposed to be important one as a preliminary job because it helps us to acquire the useful informations from various kinds of documents by unifying their structures.
PDF

An Approach to Structuralizing Business Information for Internet Shopping Malls (인터넷쇼핑몰의 사업자신원정보 구조화 방안)

장용식
- Journal of Intelligence and Information Systems
- /
- v.10 no.1
- /
- pp.27-45
- /
- 2004
While on-line shopping is increasing, the "Consumer Protection Law in Electronic Commerce" obliges each internet shopping mall to provide its business information. Although most internet shopping malls provide their business information in the semi-structured format on the bottom of their homepages, the attributes and expression forms of business information are different each other. It makes consumers difficult to identify their business information and lowers public confidence. Hence this study proposes three approaches - HTML-based structure, XML-based structure, and XML data island-based structure - to structuralizing business information for correct expression. The experiment results showed that the business information extraction time by XML data island-based structure is independent of the size of the web document, while the time by HTML-based structure is dependent on the size. By comparing the business information extraction times, we show that XML data island-based structure is more efficient and effective than HTML-based structure.structure.
PDF

Search Result 499, Processing Time 0.028 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)