• Title/Summary/Keyword: Automatic Document Generation

Search Result 50, Processing Time 0.025 seconds

Algorithm for Uniform DTD Automatic Generation (통합 DTD 자동생성 알고리즘)

  • 이복례;유춘식;김상은;김용성
    • Proceedings of the Korean Information Science Society Conference
    • /
    • 1998.10b
    • /
    • pp.356-358
    • /
    • 1998
  • 이기종 시스템들간의 멀티미디어 문서 교환을 목적으로 제정된 국제 표준인 SGML(Standard Generalized Markup Language)이다[2]. 이 SGML 문서를 만들기 위해서는 DTD(Document Type Definition)의 생성이 이루어져야 하며, 이 DTD는 문서의 논리구조에 따라 각각 다르게 생성되기 때문에 여러 개의 데이터베이스(Database)를 구축하게 된다. 그러므로 구축하는데 비용이 증가되며 검색하는데 비효율적이다. 따라서 본 논문에서는 유사한 논리구조를 가지는 문서에 대하여 그들 각각의 DTD를 하나로 통합할 수 있는 통합 DTD 작성 알고리즘을 제안한다. 이러한 알고리즘에 따라 적용하게 되면 하나의 논리 구조를 가지는 SGML 문서의 데이터베이스만을 구축하므로 효과적인 검색 환경을 제공할 수 잇다. 이를 위해 본 논문에서는 국내 학회의 논문지에 게재된 논문을 대상으로 적용하여 통합 DTD를 생성하였고 이의 따른 효용성을 입증하였다.

A House Design Automation System Based on the "Design-by-Novice" Paradigm

  • Kim, Uk;Choi, Jinwon;Kim, SungAh
    • Architectural research
    • /
    • v.1 no.1
    • /
    • pp.23-30
    • /
    • 1999
  • This research investigates a system for house design automation. The system is based on an object-oriented building data model, aiming to support the house design process conducted by non-expert users. Its object model, with simple yet powerful user interfaces, enables a CAD system to handle a complicated building system with much ease. Hence, the model dramatically simplifies the design process beyond just the automatic document generation. In this paper, we discuss the aspects of the building data model, introduce critical concepts such as grid objects and structured floor plan, and present a prototype system called GPLAN. The system is implemented in the framework of our building data model, and it provides a host of intelligent features that have been proved useful for house design automation.

  • PDF

Automatic document generation for effective project management plan specification (효과적인 프로젝트 관리 계획서위한 프로젝트 문서 생성 자동화)

  • Kang, Geon-Hee;Son, Hyun Seung;Yi, Geun Sang;Kim, R. Young Chul;Lee, Sang Eun
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2015.10a
    • /
    • pp.959-961
    • /
    • 2015
  • 현재 많은 소프트웨어 개발 기업들은 제품 개발에만 치중되어 품질 유지/관리 측면의 문서 산출물을 만들 여유가 없다. 그리고 프로젝트 문서 관리 자체가 주먹구구식이며, 차후 평가나 검수시 문서 산출물을 만드는 경향이 있다. 이 문제 해결위해, 실시간으로 프로젝트가 진행되는 상황에 맞게 문서를 자동 생성하는 방법을 제안한다. 이 방법은 기존의 통합관리(CI) 시스템에 XML과 XSLT 기법을 접목하여 프로젝트 계획 문서를 자동 추출가능하다. 이로 인해 소규모의 영업장에서도 적은 인원으로도 프로젝트 관리를 효율적으로 할 수 있고, 실제 문서와 프로젝트의 진행의 불일치가 많이 줄어들 것으로 판단된다.

Best Practice on Software Traceability Environment based on PaaS Cloud Service

  • Jang, Woo Sung;Kim, Janghwan;Kim, R. Young Chul
    • International journal of advanced smart convergence
    • /
    • v.9 no.4
    • /
    • pp.149-155
    • /
    • 2020
  • In the software industry of Korean Small and Medium-sized Enterprise(SME)s, the development process is often not mature. This may lead to failures in quality control and output management. As a result, the quality of the software can be degraded. To solve the problem, the software visualization technique, which is from the National IT Industry Promotion Agency Software Engineering Center can be applied. We have experienced with mentoring not only the visualization of software development process, but also various visualization process of SMEs. However, the existing software visualization method was difficult to install environment and its time cost was high. This paper proposes a software visualization environment through a cloud service along with a case of building a software visualization environment. We expect that this method will make it easier to build a visualization environment and improve the quality of SME software.

A Knowledge-based Wrapper Learning Agent for Semi-Structured Information Sources (준구조화된 정보소스에 대한 지식기반의 Wrapper 학습 에이전트)

  • Seo, Hee-Kyoung;Yang, Jae-Young;Choi, Joong-Min
    • Journal of KIISE:Software and Applications
    • /
    • v.29 no.1_2
    • /
    • pp.42-52
    • /
    • 2002
  • Information extraction(IE) is a process of recognizing and fetching particular information fragments from a document. In previous work, most IE systems generate the extraction rules called the wrappers manually, and although this manual wrapper generation may achieve more correct extraction, it reveals some problems in flexibility, extensibility, and efficiency. Some other researches that employ automatic ways of generating wrappers are also experiencing difficulties in acquiring and representing useful domain knowledge and in coping with the structural heterogeneity among different information sources, and as a result, the real-world information sources with complex document structures could not be correctly analyzed. In order to resolve these problems, this paper presents an agent-based information extraction system named XTROS that exploits the domain knowledge to learn from documents in a semi-structured information source. This system generates a wrapper for each information source automatically and performs information extraction and information integration by applying this wrapper to the corresponding source. In XTROS, both the domain knowledge and the wrapper are represented as XML-type documents. The wrapper generation algorithm first recognizes the meaning of each logical line of a sample document by using the domain knowledge, and then finds the most frequent pattern from the sequence of semantic representations of the logical lines. Eventually, the location and the structure of this pattern represented by an XML document becomes the wrapper. By testing XTROS on several real-estate information sites, we claim that it creates the correct wrappers for most Web sources and consequently facilitates effective information extraction and integration for heterogeneous and complex information sources.

Prototype Kite Development for Wind Power Generation (고공풍력 발전용 시제품 Kite 비행체 개발)

  • Kwon, Jae-Wook;Kim, Jong-Chul;Moon, Sang-Man;Choi, Ji-Ung
    • Proceedings of the Korean Institute of Navigation and Port Research Conference
    • /
    • 2012.10a
    • /
    • pp.259-260
    • /
    • 2012
  • With increasing interest in alternative energy source for green growth, this document shows that the study of kite flight control is based on the concept of autonomous flight of kite can exploit the energy. Currently, prototype kite was designed and the purpose of its flight test, by manual flight control with Remote Controller, was performed for the feasibility of the full automatic flight control. For the future research, the test data should be collected through the many flight test under various environment.

  • PDF

Automatic Text Summarization based on Selective Copy mechanism against for Addressing OOV (미등록 어휘에 대한 선택적 복사를 적용한 문서 자동요약)

  • Lee, Tae-Seok;Seon, Choong-Nyoung;Jung, Youngim;Kang, Seung-Shik
    • Smart Media Journal
    • /
    • v.8 no.2
    • /
    • pp.58-65
    • /
    • 2019
  • Automatic text summarization is a process of shortening a text document by either extraction or abstraction. The abstraction approach inspired by deep learning methods scaling to a large amount of document is applied in recent work. Abstractive text summarization involves utilizing pre-generated word embedding information. Low-frequent but salient words such as terminologies are seldom included to dictionaries, that are so called, out-of-vocabulary(OOV) problems. OOV deteriorates the performance of Encoder-Decoder model in neural network. In order to address OOV words in abstractive text summarization, we propose a copy mechanism to facilitate copying new words in the target document and generating summary sentences. Different from the previous studies, the proposed approach combines accurate pointing information and selective copy mechanism based on bidirectional RNN and bidirectional LSTM. In addition, neural network gate model to estimate the generation probability and the loss function to optimize the entire abstraction model has been applied. The dataset has been constructed from the collection of abstractions and titles of journal articles. Experimental results demonstrate that both ROUGE-1 (based on word recall) and ROUGE-L (employed longest common subsequence) of the proposed Encoding-Decoding model have been improved to 47.01 and 29.55, respectively.

The Design and Implementation of the System for Processing Well-Formed XML Document on the Client-side (클라이언트 상의 Well-Formed XML 문서 처리 시스템의 설계 및 구현)

  • Song, Jong-Chul;Moon, Byung-Joo;Hong, Gi-Chai;Cheong, Hyun-Soo;Kim, Gyu-Tae;Lee, Soo-Youn
    • The Transactions of the Korea Information Processing Society
    • /
    • v.7 no.10
    • /
    • pp.3236-3246
    • /
    • 2000
  • XML is a meta-language as SGML and also can be xonsructed as an Internet versionof simplified SGML being used in confunction with XLL. Xpointer and XSL. Also W3C established DTDless Well-Formed XML document to use XML document on the Web. But it isnt offered system that consists of browsing, link and DTD generating facihty, and efficiently processes DTDless Well-Formed XML document. This paper studies on an implementation and design of system to process DTDless Well-Formed XML document on the client-side. This system consists of Well-Formed XML viewer displaying Well-Formed XML documet, XLL Processor processing Xll and Auto DTD generator constructing automatically DTDs based on multiple documents of the same class. This study focuses on automatic DTD generation during hyperlink navigation and an implementation of extended links based on XLL and Xpointer. ID and Xpointer location address are used as the address mode in the links. As a result of implement of this system, it conforms to validationof extended link facihties, extracts DTD from Well-Fromed XML Documents including same root element at the same class and constructs generalized DTD.

  • PDF

A Model of Natural Language Information Retrieval Using Main Keywords and Sub-keywords (주 키워드와 부 키워드를 이용한 자연언어 정보 검색 모델)

  • Kang, Hyun-Kyu;Park, Se-Young
    • The Transactions of the Korea Information Processing Society
    • /
    • v.4 no.12
    • /
    • pp.3052-3062
    • /
    • 1997
  • An Information Retrieval (IR) is to retrieve relevant information that satisfies user's information needs. However a major role of IR systems is not just the generation of sets of relevant documents, but to help determine which documents are most likely to be relevant to the given requirements. Various attempts have been made in the recent past to use syntactic analysis methods for the generation of complex construction that are essential for content identification in various automatic text analysis systems. Unfortunately, it is known that methods based on syntactic understanding alone are not sufficiently powerful to Produce complete analyses of arbitrary text samples. In this paper, we present a document ranking method based on two-level ranking. The first level is used to retrieve the documents, and the second level to reorder the retrieved documents. The main keywords used in the first level can be defined as nouns and/or compound nouns that possess good document discrimination powers. The sub-keywords used in the second level can be also defined as adjectives, adverbs, and/or verbs that are not main keywords, and function words. An empirical study was conducted from a Korean encyclopedia with 23,113 entries and 161 Korean natural language queries collected by end users. 850% of the natural language queries contained sub-keywords. The two-level document ranking methods provides significant improvement in retrieval effectiveness over traditional ranking methods.

  • PDF

ER2XML: An Implementation of XML Schema Generator based on the Entity-Relationship Model (ER2XML :개체-관계 모델을 기반으로한 XML Schema 생성기의 구현)

  • Kim Chang Suk;Son Dong-Cheul
    • The KIPS Transactions:PartD
    • /
    • v.12D no.1 s.97
    • /
    • pp.1-12
    • /
    • 2005
  • The XML is emerging as standard language for data exchange on the Web. Therefore a demand of XML Schema(W3C MLL Schema Spec.) that verifies XML document becomes increasing. However, XML Schema has a weak point for design because of its complication despiteof various data and abundant expressiveness. This paper shows a simple way of design for XML Schema using a fundamental means for database design, the Entity-Relationship model. The conversion from the Entity-Relationship model to XML Schema can not be directly on account of discordance between the two models. So we present some algorithms to generate XML Schema from the Entity-Relationship model. The algorithms produce XML Schema codes using a hierarchical view representation. An important objective of this automatic generation is to preserve XML Schema's characteristics such as reusability, global and local ability, ability of expansion and various type changes.