• Title/Summary/Keyword: document structure

Search Result 594, Processing Time 0.026 seconds

A Method of XML Mapping Canonicalization for E-Business Integration (전자상거래 통합을 위한 XML 매핑 정형화 기법)

  • 안우영;홍창범
    • Journal of the Korea Society of Computer and Information
    • /
    • v.9 no.1
    • /
    • pp.1-8
    • /
    • 2004
  • XML is becoming the standard of the new document exchanging. Due to the ablility expressing various types of document structure through XML, RosettaNet and BizTalk are using XML as a core technology in the part of e-Business. Framework is running Business process each other different standard. Internal documents in each company should be transformed differently without any loss to work with other companies. In this paper, transforming Processor based on XML mapping information from XML document information.

  • PDF

An Experimental Study on the Performance of Element-based XML Document Retrieval (엘리먼트 기반 XML 문서검색의 성능에 관한 실험적 연구)

  • Yoon, So-Young;Moon, Sung-Been
    • Journal of the Korean Society for information Management
    • /
    • v.23 no.1 s.59
    • /
    • pp.201-219
    • /
    • 2006
  • This experimental study suggests an element-based XML document retrieval method that reveals highly relevant elements. The models investigated here for comparison are divergence and smoothing method, and hierarchical language model. In conclusion, the hierarchical language model proved to be most effective in element-based XML document retrieval with regard to the improved exhaustivity and harmed specificity.

a Prototype System for collaborative Authoring Over a Network (네트워크 상에서의 공동저작 프로토타입 시스템)

  • Kim, Cha-Jong
    • The Transactions of the Korea Information Processing Society
    • /
    • v.6 no.4
    • /
    • pp.1009-1021
    • /
    • 1999
  • This paper describes the design principles and structure of a prototype system for collaborative authoring over a wide area network. The system includes extensive support for commenting and comment review, facilities for document space navigation, and tools for controlling and monitoring work group activity, including document locking and activity recording. The operational prototype provides a testbed for the examination of human computer interaction, group interaction, group support, document structures, and the problem and the history of efforts to address it.

  • PDF

An Authorization Technique for an XML Document (XML 문서를 위한 권한 부여 기법)

  • Kang, Jung-Mo;Lee, Heon-Gil
    • Journal of Industrial Technology
    • /
    • v.21 no.A
    • /
    • pp.181-188
    • /
    • 2001
  • An XML is an markup language which has been focused on the next generation Web programming language. It easily represents the complex structure of a document, and it is possible to provide the access control over each component of an XML document. An implicit authorization technique means that granting an authorization to a node has effect on granting the same implicit authorization to its all descendants. Therefore, it enhances the time for the authorization grant and reduces the memory required for the authorization information. An authorization technique using an intention type and a authorization replacement solves a redundancy problem and decides whether the access is possible or the authorization conflict occurs at the first attempt.

  • PDF

A Study on the Contingent Worker's Handwritings and Documentation of Labor (비정규노동 수기와 노동의 기록화)

  • Kwak, Kun Hong
    • The Korean Journal of Archival Studies
    • /
    • no.64
    • /
    • pp.5-25
    • /
    • 2020
  • The archives should not document the absence of labor records, but document the traces of workers' lives. In other words, it is the responsibility of the archive to reproduce the acts and sufferings of contingent workers with records, and to reveal the oppressive structure of capitalism based on them. Records representing the lives of contingent workers, such as labor manuals, should be at the core of records that symbolize the present, and archives should be documented. The archives should discard the illusion of neutrality.

A Method of Object Identification from Procedural Programs (절차적 프로그램으로부터의 객체 추출 방법론)

  • Jin, Yun-Suk;Ma, Pyeong-Su;Sin, Gyu-Sang
    • The Transactions of the Korea Information Processing Society
    • /
    • v.6 no.10
    • /
    • pp.2693-2706
    • /
    • 1999
  • Reengineering to object-oriented system is needed to maintain the system and satisfy requirements of structure change. Target systems which should be reengineered to object-oriented system are difficult to change because these systems have no design document or their design document is inconsistent of source code. Using design document to identifying objects for these systems is improper. There are several researches which identify objects through procedural source code analysis. In this paper, we propose automatic object identification method based on clustering of VTFG(Variable-Type-Function Graph) which represents relations among variables, types, and functions. VTFG includes relations among variables, types, and functions that may be basis of objects, and weights of these relations. By clustering related variables, types, and functions using their weights, our method overcomes limit of existing researches which identify too big objects or objects excluding many functions. The method proposed in this paper minimizes user's interaction through automatic object identification and make it easy to reenginner procedural system to object-oriented system.

  • PDF

Term Frequency-Inverse Document Frequency (TF-IDF) Technique Using Principal Component Analysis (PCA) with Naive Bayes Classification

  • J.Uma;K.Prabha
    • International Journal of Computer Science & Network Security
    • /
    • v.24 no.4
    • /
    • pp.113-118
    • /
    • 2024
  • Pursuance Sentiment Analysis on Twitter is difficult then performance it's used for great review. The present be for the reason to the tweet is extremely small with mostly contain slang, emoticon, and hash tag with other tweet words. A feature extraction stands every technique concerning structure and aspect point beginning particular tweets. The subdivision in a aspect vector is an integer that has a commitment on ascribing a supposition class to a tweet. The cycle of feature extraction is to eradicate the exact quality to get better the accurateness of the classifications models. In this manuscript we proposed Term Frequency-Inverse Document Frequency (TF-IDF) method is to secure Principal Component Analysis (PCA) with Naïve Bayes Classifiers. As the classifications process, the work proposed can produce different aspects from wildly valued feature commencing a Twitter dataset.

PDFindexer: Distributed PDF Indexing system using MapReduce

  • Murtazaev, JAziz;Kihm, Jang-Su;Oh, Sangyoon
    • International Journal of Internet, Broadcasting and Communication
    • /
    • v.4 no.1
    • /
    • pp.13-17
    • /
    • 2012
  • Indexing allows converting raw document collection into easily searchable representation. Web searching by Google or Yahoo provides subsecond response time which is made possible by efficient indexing of web-pages over the entire Web. Indexing process gets challenging when the scale gets bigger. Parallel techniques, such as MapReduce framework can assist in efficient large-scale indexing process. In this paper we propose PDFindexer, system for indexing scientific papers in PDF using MapReduce programming model. Unlike Web search engines, our target domain is scientific papers, which has pre-defined structure, such as title, abstract, sections, references. Our proposed system enables parsing scientific papers in PDF recreating their structure and performing efficient distributed indexing with MapReduce framework in a cluster of nodes. We provide the overview of the system, their components and interactions among them. We discuss some issues related with the design of the system and usage of MapReduce in parsing and indexing of large document collection.

A Study on Project Management Scheduling Module Development using the Rule and CBR (규칙(Rule) 과 CBR 기법을 활용한 프로젝트 일정관리 모듈 구현에 관한 연구)

  • Sin, Ho-Gyun;Kim, Yeong-Jun;Jeon, Seung-Ho
    • 한국디지털정책학회:학술대회논문집
    • /
    • 2004.05a
    • /
    • pp.343-354
    • /
    • 2004
  • A Project planning is one of the most important processes that determines success and failure of the project. Scope management for project planning is also essential job in system integration project. However project planning is very difficult because lots of factor and their relationships should be considered. Therefore project planning of SI project has been done by project manager s own knowledge and experiences. It is necessary to develop an algorithm of WBS(Work Breakdown Structure) identification & document selection along to project's specificity in project management system using AI technique. This study also present method (ODW model) to cope with the limitations of the existing study that has uniformly customizing the methodology by only project complexity. We propose PPSM(Project planning support module) that apply Rule for determination of route map and document level, and CBR for WBS identification.

  • PDF

Multi-task learning with contextual hierarchical attention for Korean coreference resolution

  • Cheoneum Park
    • ETRI Journal
    • /
    • v.45 no.1
    • /
    • pp.93-104
    • /
    • 2023
  • Coreference resolution is a task in discourse analysis that links several headwords used in any document object. We suggest pointer networks-based coreference resolution for Korean using multi-task learning (MTL) with an attention mechanism for a hierarchical structure. As Korean is a head-final language, the head can easily be found. Our model learns the distribution by referring to the same entity position and utilizes a pointer network to conduct coreference resolution depending on the input headword. As the input is a document, the input sequence is very long. Thus, the core idea is to learn the word- and sentence-level distributions in parallel with MTL, while using a shared representation to address the long sequence problem. The suggested technique is used to generate word representations for Korean based on contextual information using pre-trained language models for Korean. In the same experimental conditions, our model performed roughly 1.8% better on CoNLL F1 than previous research without hierarchical structure.