• Title/Summary/Keyword: document analysis

Search Result 1,192, Processing Time 0.03 seconds

Performance Improvement by a Virtual Documents Technique in Text Categorization (문서분류에서 가상문서기법을 이용한 성능 향상)

  • Lee, Kyung-Soon;An, Dong-Un
    • The KIPS Transactions:PartB
    • /
    • v.11B no.4
    • /
    • pp.501-508
    • /
    • 2004
  • This paper proposes a virtual relevant document technique in the teaming phase for text categorization. The method uses a simple transformation of relevant documents, i.e. making virtual documents by combining document pairs in the training set. The virtual document produced by this method has the enriched term vector space, with greater weights for the terms that co-occur in two relevant documents. The experimental results showed a significant improvement over the baseline, which proves the usefulness of the proposed method: 71% improvement on TREC-11 filtering test collection and 11% improvement on Routers-21578 test set for the topics with less than 100 relevant documents in the micro average F1. The result analysis indicates that the addition of virtual relevant documents contributes to the steady improvement of the performance.

Health Level 7 Version 3 based Generating Clinical Document Architecture for Medication Administration System (HL7 버전 3 기반의 투약관리시스템을 위한 임상문서구조의 생성)

  • Kim, Genun-Hee;Cho, Su-Mi;Lee, Eun-Joo;Kim, Hwa-Sun;Cho, Hune
    • Journal of Korea Multimedia Society
    • /
    • v.11 no.3
    • /
    • pp.386-397
    • /
    • 2008
  • This study proposes the actualization of a standard data model for activities through the development of clinical document architecture for medication administration using the health level 7 development frameworks(HDF) process based on object oriented analysis and development method of health level 7 V 3. Medication administration is the most common activity performed by clinical professionals at healthcare settings. A standardized information model and structured hospital information system are necessary to achieve evidence-based clinical activities. We had used HDF and various tools(Rose tree, RMIM designer, V3 generator) to create the clinical document architecture(CDA). This allowed us to illustrate each step of the HDF in the administration of medication. This study generated a information model of the medication administration process, which is one clinical activity. It should become a fundamental conceptual model for understanding international standard methodology by information technology(IT) developers with the objective of modeling healthcare information systems.

  • PDF

BubbleDoc: Document Forgery and Tamper Detection through the Agent-Free File System-Awareness in Cloud Environment (BubbleDoc: 클라우드 환경에서의 agent-free 파일시스템 분석을 통한 문서 위/변조 탐지)

  • Jeon, Woo-Jin;Hong, Dowon;Park, Ki-Woong
    • Journal of the Korea Institute of Information Security & Cryptology
    • /
    • v.28 no.2
    • /
    • pp.429-436
    • /
    • 2018
  • Electronic documents are efficient to be created and managed, but they are liable to lose their originality because copies are created during distribution and delivery. For this reason, various security technologies for electronic documents have been applied. However, most security technologies currently used are for document management such as file access privilege control, file version and history management, and therefore can not be used in environments where authenticity is absolutely required, such as confidential documents. In this paper, we propose a method to detect document forgery and tampering through analysis of file system without installing an agent inside the instance operating system in cloud computing environment. BubbleDoc monitors the minimum amount of virtual volume storage in an instance, so it can efficiently detect forgery and tampering of documents. Experimental results show that the proposed technique has 0.16% disk read operation overhead when it is set to 1,000ms cycle for monitoring for document falsification and modulation detection.

Bibliographic metadata development for the efficient information resource sharing (효율적 정보자원 공유를 위한 서지 메타데이터 XML DTD 개발)

  • Lee, Hye-Jin;Song, In-Seok
    • Proceedings of the Korea Contents Association Conference
    • /
    • 2004.11a
    • /
    • pp.427-433
    • /
    • 2004
  • Most information providers are offering integrated retrieval service based on the bibliographic metadata and schema corresponding to each type of document which are developed in a distributed and independent way. However, it is difficult to maintain the relational consistency of those single heterogeneous databases even though they obey the metadata standard like MARC or MODS. It is the main reason that those standards are restricted to present the general property of document regardless of its type and not to applied to define the relationship of document types. Therefore, It is necessary to define a comprehensive meta model to associate the related databases in a systematic way so that the semantically common part of them can be easily shared and reused without any additional effort like conversion or mapping. In this paper, we first outline the document types for designing meta model by the empirical analysis of various data schema of main information providers. We propose then data element definition, metadata model and modularized XML DTD which support the efficient and consistent management of multiple ducument types.

  • PDF

Orthogonal Nonnegative Matrix Factorization: Multiplicative Updates on Stiefel Manifolds (Stiefel 다양체에서 곱셈의 업데이트를 이용한 비음수 행렬의 직교 분해)

  • Yoo, Ji-Ho;Choi, Seung-Jin
    • Journal of KIISE:Software and Applications
    • /
    • v.36 no.5
    • /
    • pp.347-352
    • /
    • 2009
  • Nonnegative matrix factorization (NMF) is a popular method for multivariate analysis of nonnegative data, the goal of which is decompose a data matrix into a product of two factor matrices with all entries in factor matrices restricted to be nonnegative. NMF was shown to be useful in a task of clustering (especially document clustering). In this paper we present an algorithm for orthogonal nonnegative matrix factorization, where an orthogonality constraint is imposed on the nonnegative decomposition of a term-document matrix. We develop multiplicative updates directly from true gradient on Stiefel manifold, whereas existing algorithms consider additive orthogonality constraints. Experiments on several different document data sets show our orthogonal NMF algorithms perform better in a task of clustering, compared to the standard NMF and an existing orthogonal NMF.

Segmentation and Contents Classification of Document Images Using Local Entropy and Texture-based PCA Algorithm (지역적 엔트로피와 텍스처의 주성분 분석을 이용한 문서영상의 분할 및 구성요소 분류)

  • Kim, Bo-Ram;Oh, Jun-Taek;Kim, Wook-Hyun
    • The KIPS Transactions:PartB
    • /
    • v.16B no.5
    • /
    • pp.377-384
    • /
    • 2009
  • A new algorithm in order to classify various contents in the image documents, such as text, figure, graph, table, etc. is proposed in this paper by classifying contents using texture-based PCA, and by segmenting document images using local entropy-based histogram. Local entropy and histogram made the binarization of image document not only robust to various transformation and noise, but also easy and less time-consuming. And texture-based PCA algorithm for each segmented region was taken notice of each content in the image documents having different texture information. Through this, it was not necessary to establish any pre-defined structural information, and advantages were found from the fact of fast and efficient classification. The result demonstrated that the proposed method had shown better performances of segmentation and classification for various images, and is also found superior to previous methods by its efficiency.

Methods for Investigating of Edit History about MS PowerPoint Files That Using the OOXML Formats (OOXML형식을 사용하는 MS 파워포인트 파일에 대한 편집 이력 조사 방법)

  • Youn, Ji-Hye;Park, Jung-Heum;Lee, Sang-Jin
    • The KIPS Transactions:PartC
    • /
    • v.19C no.4
    • /
    • pp.215-224
    • /
    • 2012
  • Today, individuals and businesses are a lot of paperwork through a computer. So many documents files are creating to digital type. And the digital type files are copied, moved by various media such as USB, E-mail and so on. A careful analysis of these digital materials can be tracked that occurred during the document editing work history. About these research are on the compound document file format, but has not been studied about the new OOXML format that how to analyze linkages between different document files, tracking an internal order, finding unsaved file for identify the process of creating the file. Future, the use of OOXML format digital documents will further increase, these document work history traceability in digital forensic investigation would be a big help. Therefore, this paper on the new OOXML format(has a forensic viewpoint) will show you how to track the internal order and analyze linkages between the files.

Retrieval Model using Subject Classification Table, User Profile, and LSI (전공분류표, 사용자 프로파일, LSI를 이용한 검색 모델)

  • Woo Seon-Mi
    • The KIPS Transactions:PartD
    • /
    • v.12D no.5 s.101
    • /
    • pp.789-796
    • /
    • 2005
  • Because existing information retrieval systems, in particular library retrieval systems, use 'exact keyword matching' with user's query, they present user with massive results including irrelevant information. So, a user spends extra effort and time to get the relevant information from the results. Thus, this paper will propose SULRM a Retrieval Model using Subject Classification Table, User profile, and LSI(Latent Semantic Indexing), to provide more relevant results. SULRM uses document filtering technique for classified data and document ranking technique for non-classified data in the results of keyword-based retrieval. Filtering technique uses Subject Classification Table, and ranking technique uses user profile and LSI. And, we have performed experiments on the performance of filtering technique, user profile updating method, and document ranking technique using the results of information retrieval system of our university' digital library system. In case that many documents are retrieved proposed techniques are able to provide user with filtered data and ranked data according to user's subject and preference.

The Complementary Study for Operational Concept Document(OCD) and Operational Requirements Document(ORD) using MND-AF (MND-AF를 활용한 운용개념기술서(OCD) 및 운용요구서(ORD)에 대한 보완 연구)

  • Cha, Seung Hun;Jang, Jae Duck;Lee, Hye Jin;Choi, Sang Wook;Yoo, Jae Sang
    • Journal of the Korean Society of Systems Engineering
    • /
    • v.16 no.2
    • /
    • pp.118-130
    • /
    • 2020
  • Modern weapon systems are composed of complex systems(System of Systems) and require a complex and advanced operational concept that performs missions through interoperability with various weapon systems. In order to derive the operational concept of the weapon system that the military wants to acquire (i.e., single mission, component operation, Joint and Alliance operations), it is necessary to identify the system related to the weapon system, environmental factors and restrictions of the weapon system to be developed. Through the derivation of the operational concept, the weapon system acquisition agency can reasonably and accurately extract various and complex requirements. In this paper, we propose a complementary method of using MND-AF to OCD and ORD required in weapon system acquisition process. MND-AF can increase the understanding and consensus of business stakeholders (users, acquirers, developers, etc.) by showing the results of weapon system analysis from various perspectives. We compare the items in the standard form of OCD and ORD with the MND-AF outputs.

An Analysis of Delivery/Transport Documents Content in Relation to the Contract of Carriage under Incoterms 2020 Rules

  • Jeon, Soon-Hwan
    • Journal of Korea Trade
    • /
    • v.25 no.1
    • /
    • pp.203-219
    • /
    • 2021
  • Purpose - The purpose of this study is to review and analyzes the contract of carriage and delivery/transport document in light of the major changes made to the Incoterms® 2020 rules forced into effect on January 1st, 2020. Design/methodology - This study analyzed responsibility for the loading and unloading of goods under the contract of carriage in Incoterms 2020® rules forced into effect by the ICC from January 1, 2020, and what document must be presented as evidence of delivery by the seller. Findings - A review revealed that in Rule C, the costs of unloading at the place of destination are determined by the terms of the contract of carriage, and in the DAP and DDP rules, if the seller bears the unloading costs, such unloading costs cannot be recovered from the buyer. To settle this issue, the seller needs to make a contract of carriage by sea with the carrier on FI terms. Furthermore, in the case of containerized goods that the FCA should be used, FOB was misused because the seller could not present an on-board bill of lading in the L/C transaction. However, it was confirmed that in FCA, the parties can use an optional mechanism to issue an on-board bill of lading. Originality/value - Incoterms 2020® rules are still widely used in international trade by parties to contract sales around the world, just like Incoterms 2010® rules. This study attempts to reduce or eliminate disputes that may arise from interpretative misunderstandings between the parties in the contract of sales concluded by the seller and the buyer.