• Title/Summary/Keyword: XML document

Search Result 840, Processing Time 0.042 seconds

Similarity Computation for XML Document with Semantically Extended Tags (의미적으로 확장된 태그들을 이용한 XML 문서들의 유사성 계산.)

  • Song, In-Sang;Paik, Ju-Ryun;Kim, Ung-Mo
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2006.11a
    • /
    • pp.369-372
    • /
    • 2006
  • XML(eXtensible Markup language) 사용의 급속한 증가는 웹에 존재하는 많은 양의 정보들을 XML기반 데이터로 생성하게 했으며 저장과 교환에 있어서 표준이 되도록 했다. 이는 사용자에 의한 임의의 태그정의를 가능하게 하는 XML 사용의 용이성에 기반한다. 그러나 이러한 장점은 비슷한 내용을 갖는 XML 문서에 대해서 사람들마다 개개의 태그이름과 구조를 사용한다는 문제점을 만든다. 따라서 유사한 의미를 가지고 있지만 서로 다른 문서로 분류된다. 이러한 점을 개선하기 위해 XML 문서 태그들 간의 벡터 스페이스 모델과 XML 데이터를 이용하여 시소러스를 구축하는 방법 등이 연구되고 제안되어 왔지만 아직 초보적인 단계이다. 본 논문에서는 XML 문서를 구성하는 태그들을 동의어로 확장하여 벡터를 생성하고 생성된 벡터를 가지고 태그들 간의 유사성을 체크하여 서로 다른 XML 문서들의 유사성을 수치적으로 계산한다.

  • PDF

A study on Conversion of UML Diagram to XML Documents (UML 다이어그램의 XML 문서 변환에 관한 연구)

  • Lee, Jeong-Seok;Park, Hea-Woo;Kang, Byung-Wook
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2003.11c
    • /
    • pp.1601-1604
    • /
    • 2003
  • XML(eXtensible Markup Language) 프로그램이나 XML을 이용한 B2B 시스템 구축과 같은 XML 프로젝트에서는 객체 지향적 설계언어인 UML을 이용해 개발하면 효율을 높일 수 있다. UHL(Unified Modeling Language)로 XML문서 구조를 표현하는 이유는 XML문서를 생성, 접근, 수정하는 XML프로그램을 체계적이고 효율적으로 설계할 수 있기 때문이다. DTD(Document Type Declaration)와 스키마(Schema)를 UML로 표현함으로써 프로그래밍을 통합적으로 추진 할 수 있다. 이러한 과정에서 XML의 문서 구조정보의 활용 증대와 UML의 확장이라는 이점을 취할 수 있다. 본 논문에서는 UIML 기반의 다이어그램에서 XML 문서서로의 변환기에 대한 모델을 제안한다.

  • PDF

Design and Implementation of a Document-Oriented and Web-Based Nuclear Design Automation System (문서중심 및 웹기반 핵설계 자동화 시스템의 설계 및 구현)

  • Park, Yong-Soo;Kim, Jong-Kyung
    • The KIPS Transactions:PartD
    • /
    • v.11D no.6
    • /
    • pp.1319-1326
    • /
    • 2004
  • To automate nuclear design works which are time-consuming and man-power intensive, Innovative Design Processor ($IDP^{TM}$) is being developed. Two basic principles of IDP are the document-oriented design and the web-based design. The document-oriented design is that, if the designer writes a design document called active document and feeds it to a special program which has a robust parser, the finai document with complete analysis, table and plots is made automatically. The active documents can be written with ordinary HTML/XML editors or created automatically on the web, which is another framework of IDP. Using the proper mix-up of server side and client side programming under the LAMP (Linux/Apache/MySQL/PHP) environment, the design process on the web is modeled as a design wizard style so that even a novice designer makes the design document easily.

A Complexity Metric for Web Documentation Based on Entropy (엔트로피를 기반으로한 Web 문서들의 복잡도 척도)

  • Kim, Kap-Su
    • Journal of The Korean Association of Information Education
    • /
    • v.2 no.2
    • /
    • pp.260-268
    • /
    • 1998
  • In this paper, I propose a metric model for measuring complexity of Web documentations which are wrote by HTML and XML. The complexity of Web documentation has effect on documentation understandability which is an important metric in maintenance and reusing of Web documentation. The understandable documents have more effect on WEI. The proposed metric uses the entropy to represent the degree of information flows between Web documentations. The proposed documentation complexity measures the information flows in a Web document based on the information passing relationship between Web document files. I evaluate the proposed metric by using the complexity properties proposed by Weyuker, and measure the document complexity. I show effectiveness of analyzing the correlation between the number of document file and document complexity.

  • PDF

Design and Implementation of the HTML-WML Converter (무선 인터넷을 위한 HTML-WML 변환기 설계 및 구현)

  • 민영수;강형일;유재수
    • Journal of Internet Computing and Services
    • /
    • v.2 no.2
    • /
    • pp.37-50
    • /
    • 2001
  • To access massive and various HTML documents that are in the web using wireless Internet equipments, another WML document that is equal to the HTML document must be written, In the case Web documents written by HTML are massive, the construction of a WML site with the same information needs much cost of space and time, This paper designs and implements the HTML-XML converter that alleviates such a problem. The HTML-WML converter translates the Web document written by HTML to the WML document for portable wireless equipments, The HTML-XML converter has advantages that it reconstructs WML document dynamically according to portable wireless equipments and processes various image formats such as GIF, JPG, BMP, and so on, The HTML-WML converter can be used as not only a utility of the WML editor but also a real-time converter on wireless Internet.

  • PDF

An Indexing Scheme for Efficient Retrieval and Update of Structured Documents Based on GDIT (GDIT를 기반으로 한 구조적 문서의 효율적 검색과 갱신을 위한 인덱스 설계)

  • Kim, Young-Ja;Bae, Jong-Min
    • The Transactions of the Korea Information Processing Society
    • /
    • v.7 no.2
    • /
    • pp.411-425
    • /
    • 2000
  • Information retrieval systems for structured documents which are written in SGML or XML support partial retrieval of document. In order to efficiently process queries based on document structures, low memory overhead for indexing, quick response time for queries, supports to powerful types of user queries, and minimal updates of index structure for document updates are required. This paper suggests the Global Document Instance Tree(GDIT) and proposes an effective indexing scheme and query processing algorithms based on the GDIT. The indexing scheme keeps up indexing and retrieval effciency and also guarantees minimal updates of the index structure when document structures are updated.

  • PDF

A Design and Implements of CPP/CPA Editing System based on ebXML (ebXML의 CPP/CPA 편집 시스템 설계 및 구현)

  • 최종근;김창수;정회경
    • Journal of Korea Multimedia Society
    • /
    • v.6 no.5
    • /
    • pp.928-936
    • /
    • 2003
  • In terms of B2B, business partners require works that define 1h13ir ability to operate business collaboration. The document outlining collaboration is the basis of improving the system of business partner and its interoperability. In addition, the definition of business interaction that is based on the documents demonstrating inter-cooperation of business companies is needed to function interoperability properly, and business trading is performed depending on the documents that define reciprocal action of collaboration. In ebXML, CPP(Collaboration-Protocol Profile) defines one business partner's technical capabilities to engage in electronic business collaborations with other partners by exchanging electronic messages. A CPA(Collaboration-Protocol Agreement) documents the technical agreement between two partners to engage in electronic business collaboration. In this paper, I draw up a plan for a composer system that deals with business collaboration document to ameliorate the interoperability of B2B companies.

  • PDF

XML Document Filtering based on Segments (세그먼트 기반의 XML 문서 필터링)

  • Kwon, Joon-Ho;Rao, Praveen;Moon, Bong-Ki;Lee, Suk-Ho
    • Journal of KIISE:Databases
    • /
    • v.35 no.4
    • /
    • pp.368-378
    • /
    • 2008
  • In recent years, publish-subscribe (pub-sub) systems based on XML document filtering have received much attention. In a typical pub-sub system, subscribed users specify their interest in profiles expressed in the XPath language, and each new content is matched against the user profiles so that the content is delivered to only the interested subscribers. As the number of subscribed users and their profiles can grow very large, the scalability of the system is critical to the success of pub-sub services. In this paper, we propose a fast and scalable XML filtering system called SFiST which is an extension of the FiST system. Sharable segments are extracted from twig patterns and stored into the hash-based Segment Table in SFiST system. Segments are used to represent user profiles as Terse Sequences and stored in the Compact Segment Index during filtering. Our experimental study shows that SFiST system has better performance than FiST system in terms of filtering time and memory usage.

A Study on Policy Design of Secure XML Access Control (안전한 XML 접근 제어의 정책 설계에 관한 연구)

  • Jo, Sun-Moon;Joo, Hyung-Seok;Yoo, Weon-Hee
    • The Journal of the Korea Contents Association
    • /
    • v.7 no.11
    • /
    • pp.43-51
    • /
    • 2007
  • Access control techniques should be flexible enough to support all protection granularity levels. Since access control policies are very likely to be specified in relation to document types, it is necessary to properly manage a situation in which documents fail to be dealt with by the existing access control policies. The existing access control has not taken information structures and semantics into full account due to the fundamental limitations of HTML. In addition, access control for XML documents allows only read operations, and there exists the problem of slowing down system performance due to the complex authorization evaluation process. In order to resolve this problem, this paper designs a XML Access Control Management System which is capable of making fined-grained access control. And then, in developing an access control system, it describes the subject and object policies of authorization for XML document on which authorization levels should be specified and which access control should be performed.

A Study on Processing XML Documents (XML 문서 처리에 관한 연구)

  • Kim, Tae Gwon
    • Journal of KIISE
    • /
    • v.43 no.4
    • /
    • pp.489-496
    • /
    • 2016
  • XML can effectively express structured or semi-structured data as well as relational databases. XQuery is a query language for retrieving information for such an XML document. In this paper, an XQuery composer is designed and implemented, with an API provided for XQuery processors, and a proper processor is registered. This composer shows query results immediately processed by the processor. As this composer contains a parser for XQuery, it can compose XQuery effectively using a diverse dialog box designed for XQuery grammar. A dialog box is affiliated with a clause region, which is a region that algebra operates from the parsing tree. It can compose path expressions for an XML document easily as it shows an element tree from DTD graphically. Path expressions are composed automatically by marking elements in the structural hierarchy and by specifying the predicate of an element partially.