• Title/Summary/Keyword: Documents Generation

Search Result 155, Processing Time 0.028 seconds

Design and Implementation of Concept Information Based Universal DTD Generator (개념정보를 포함한 포괄적 DTD 생성기의 설계 및 구현)

  • 최인석;공용해
    • Journal of the Korea Computer Industry Society
    • /
    • v.3 no.7
    • /
    • pp.831-836
    • /
    • 2002
  • There are various information resources on the Internet and people are taking more interest in XML day by day. In XML, the structure of information can be freely defined so that the standardization of documents can be hardly made. If DTD, which is applied to an XML Document representing specific information, is including concept information, it can be freely applied to the structure of document and also contributes to the convenience in information retrieval. In this study, we developed universal DTD Generator in order to automatically generate DTD including concept information. For the generation of universal DTD, the conceptualization of information is required; to conceptualize information, the hierarchical structuring and propertizing are required. The hierarchical structuring represents the inclusive relation of routine concepts for representing information in hierarchical structure, and the propertizing represents the property and mutual relation that the each concept represented in hierarchical structure can have. The defined hierarchical structure and propertization come to generate the universal DTD Generator. The universal DTD generated by DTD Generator can be applied to all the XML Documents representing the same information in different structure. However, the most ideal way is that the information of universal DTD, which can be applied to various documents, is including all the cases. Therefore, the study for designing correct concept information is necessary.

  • PDF

U Based Form Document Generation System for e-Business Sung-Han (XML 기반의 e-비즈니스 문서 생성을 위한 폼 생성시스템)

  • Kim, Seong-Han;Kim, Chang-Su;Jeong, Hoe-Gyeong
    • The KIPS Transactions:PartD
    • /
    • v.9D no.4
    • /
    • pp.713-722
    • /
    • 2002
  • In this paper, XML form generator is designed and implemented on the basis of e-business's DTD (Document Type Definition) document. Rapid evolving for internet services and information infrastructure give many impacts on the e-business, it need to make a new kinds of web-based or electronic-based document formats for e-business transaction trading. In current situations, there are many kinds of document formats on conventional business documents for each companies. And, it has many problems on the aspects of the document reusability and cost to support interoperability between documents for the trading partners. To solve this interoperability of documents, the constructed XML form generator is changing XML form document into HTML (HyperText Markup Language) based web document by XSLT. And it also generates XML business message validating for e-Business DTD by user Inputs.

Development of a Regulatory Q&A System for KAERI Utilizing Document Search Algorithms and Large Language Model (거대언어모델과 문서검색 알고리즘을 활용한 한국원자력연구원 규정 질의응답 시스템 개발)

  • Hongbi Kim;Yonggyun Yu
    • Journal of Korea Society of Industrial Information Systems
    • /
    • v.28 no.5
    • /
    • pp.31-39
    • /
    • 2023
  • The evolution of Natural Language Processing (NLP) and the rise of large language models (LLM) like ChatGPT have paved the way for specialized question-answering (QA) systems tailored to specific domains. This study outlines a system harnessing the power of LLM in conjunction with document search algorithms to interpret and address user inquiries using documents from the Korea Atomic Energy Research Institute (KAERI). Initially, the system refines multiple documents for optimized search and analysis, breaking the content into managable paragraphs suitable for the language model's processing. Each paragraph's content is converted into a vector via an embedding model and archived in a database. Upon receiving a user query, the system matches the extracted vectors from the question with the stored vectors, pinpointing the most pertinent content. The chosen paragraphs, combined with the user's query, are then processed by the language generation model to formulate a response. Tests encompassing a spectrum of questions verified the system's proficiency in discerning question intent, understanding diverse documents, and delivering rapid and precise answers.

Feature Generation of Dictionary for Named-Entity Recognition based on Machine Learning (기계학습 기반 개체명 인식을 위한 사전 자질 생성)

  • Kim, Jae-Hoon;Kim, Hyung-Chul;Choi, Yun-Soo
    • Journal of Information Management
    • /
    • v.41 no.2
    • /
    • pp.31-46
    • /
    • 2010
  • Now named-entity recognition(NER) as a part of information extraction has been used in the fields of information retrieval as well as question-answering systems. Unlike words, named-entities(NEs) are generated and changed steadily in documents on the Web, newspapers, and so on. The NE generation causes an unknown word problem and makes many application systems with NER difficult. In order to alleviate this problem, this paper proposes a new feature generation method for machine learning-based NER. In general features in machine learning-based NER are related with words, but entities in named-entity dictionaries are related to phrases. So the entities are not able to be directly used as features of the NER systems. This paper proposes an encoding scheme as a feature generation method which converts phrase entities into features of word units. Futhermore, due to this scheme, entities with semantic information in WordNet can be converted into features of the NER systems. Through our experiments we have shown that the performance is increased by about 6% of F1 score and the errors is reduced by about 38%.

A Study on Ontology Instance Generation Using Keywords (키워드를 활용한 온톨로지 인스턴스 생성에 관한 연구)

  • Han, Kwang-Rok;Kang, Hyun-Min;Sohn, Surg-Won
    • Journal of the Korea Society of Computer and Information
    • /
    • v.15 no.5
    • /
    • pp.1-11
    • /
    • 2010
  • The success of semantic web depends largely on the semantic annotation which systematizes knowledge for the construction and production of ontology. Therefore, the efficiency of semantic annotation is very important in order to change many knowledge expressions and generate into ontology instances. In this paper, we presents a generation system of rule-based ontology instances which are produced accurately and efficiently via semantic annotation in conventional web sites. In conventional studies, the manual process is necessary for finding relevant information, comparing it with ontology, and entering information. We propose a new method that manages keyword data regarding extracted information and rule information separately. Thus, it is quite practical to extract information efficiently from various web documents by adding a small number of keywords and rules. The proposed method shows the possibility of ontology instance generation which reuses the rules and keywords from the various websites.

A Study on Technological Forecasting of Next-Generation Display Technology (차세대 디스플레이 기술의 예측에 관한 연구)

  • Nam, Ki-Woong;Park, Sang-Sung;Shin, Young-Geun;Jung, Won-Gyo;Jang, Dong-Sik
    • Journal of the Korea Academia-Industrial cooperation Society
    • /
    • v.10 no.10
    • /
    • pp.2923-2934
    • /
    • 2009
  • This paper presents study on technological forecasting of Next-Generation Display technology. Next-Generation Display technology is one of the emerging technologies lately. So databases on patent documents of this technology were analyzed first. And patent analysis was performed for finding out present technology trend. And the forecast for this technology was made by growth curves which were obtained from forecast models using patent documents. In previous study, Gompertz, Logistic, Bass were used for forecasting diffusion of demand in market. Gompertz, Logistic models which were often used for technological forecasting, too. So, two models were applied in this study. But Gompertz, Logistic models only consider internal effect of diffusion. And it is difficult to estimate maximum value of growth in two models. So, Bass model which considers both internal effect and external effect of diffusion was also applied. And maximum value of growth in Gompertz, Logistic models was estimated by Bass model.

COBie Based Maintenance Document Generation of Railway Track (COBie 기반 철도 선로유지관리 문서 생성)

  • Seo, Kyung-Wan;Kwon, Tae-Ho;Lee, Sang-Ho
    • Journal of the Computational Structural Engineering Institute of Korea
    • /
    • v.30 no.4
    • /
    • pp.307-312
    • /
    • 2017
  • In this study, we proposed a method to generate a maintenance documents for railway track through Construction Operations Building information exchange(COBie) which is a subset of Industry Foundation Classes(IFC), a data model for Building Information Modeling(BIM). In order to define the items necessary for railway track maintenance document generation, we analyzed the guideline of maintenance and management to track by the Ministry of Land, Infrastructure and Transport(MLTM), and defined the way to refer to the information items in the COBie spreadsheet. The additional properties not supported in IFC, were created for generation of an Information model that reflecting maintenance information items of railway track by applying user-defined property set within the IFC framework. An IFC-based Information model reflecting the user-defined property was implemented through BIM software, and rail track maintenance information items were transferred to COBie spreadsheet according to the defined approach. It is tested that the information can be transferred from the IFC-based as-built model to the COBie spreadsheet, which can be used to generate the necessary documents for railway facility maintenance work.

Facilitating Web Service Taxonomy Generation : An Artificial Neural Network based Framework, A Prototype Systems, and Evaluation (인공신경망 기반 웹서비스 분류체계 생성 프레임워크의 실증적 평가)

  • Hwang, You-Sub
    • Journal of Intelligence and Information Systems
    • /
    • v.16 no.2
    • /
    • pp.33-54
    • /
    • 2010
  • The World Wide Web is transitioning from being a mere collection of documents that contain useful information toward providing a collection of services that perform useful tasks. The emerging Web service technology has been envisioned as the next technological wave and is expected to play an important role in this recent transformation of the Web. By providing interoperable interface standards for application-to-application communication, Web services can be combined with component based software development to promote application interaction both within and across enterprises. To make Web services for service-oriented computing operational, it is important that Web service repositories not only be well-structured but also provide efficient tools for developers to find reusable Web service components that meet their needs. As the potential of Web services for service-oriented computing is being widely recognized, the demand for effective Web service discovery mechanisms is concomitantly growing. A number of public Web service repositories have been proposed, but the Web service taxonomy generation has not been satisfactorily addressed. Unfortunately, most existing Web service taxonomies are either too rudimentary to be useful or too hard to be maintained. In this paper, we propose a Web service taxonomy generation framework that combines an artificial neural network based clustering techniques with descriptive label generating and leverages the semantics of the XML-based service specification in WSDL documents. We believe that this is one of the first attempts at applying data mining techniques in the Web service discovery domain. We have developed a prototype system based on the proposed framework using an unsupervised artificial neural network and empirically evaluated the proposed approach and tool using real Web service descriptions drawn from operational Web service repositories. We report on some preliminary results demonstrating the efficacy of the proposed approach.

Automatic Generation of Information Extraction Rules Through User-interface Agents (사용자 인터페이스 에이전트를 통한 정보추출 규칙의 자동 생성)

  • 김용기;양재영;최중민
    • Journal of KIISE:Software and Applications
    • /
    • v.31 no.4
    • /
    • pp.447-456
    • /
    • 2004
  • Information extraction is a process of recognizing and fetching particular information fragments from a document. In order to extract information uniformly from many heterogeneous information sources, it is necessary to produce information extraction rules called a wrapper for each source. Previous methods of information extraction can be categorized into manual wrapper generation and automatic wrapper generation. In the manual method, since the wrapper is manually generated by a human expert who analyzes documents and writes rules, the precision of the wrapper is very high whereas it reveals problems in scalability and efficiency In the automatic method, the agent program analyzes a set of example documents and produces a wrapper through learning. Although it is very scalable, this method has difficulty in generating correct rules per se, and also the generated rules are sometimes unreliable. This paper tries to combine both manual and automatic methods by proposing a new method of learning information extraction rules. We adopt the scheme of supervised learning in which a user-interface agent is designed to get information from the user regarding what to extract from a document, and eventually XML-based information extraction rules are generated through learning according to these inputs. The interface agent is used not only to generate new extraction rules but also to modify and extend existing ones to enhance the precision and the recall measures of the extraction system. We have done a series of experiments to test the system, and the results are very promising. We hope that our system can be applied to practical systems such as information-mediator agents.

A Study on the Distinction of Registration Regulations for Herbal Medicines (생약제제의 등록규정 차별화에 관한 연구)

  • Joo, Yun Jung;Oh, Jung Mi;Han, Byong Hyon;Hong, Sung Sun
    • Korean Journal of Clinical Pharmacy
    • /
    • v.11 no.2
    • /
    • pp.68-77
    • /
    • 2001
  • Herbal medicines have been used since ancient times as medicines to treat and relieve the symptoms of many different human diseases. However, so far, relatively few herbal medicines have been evaluated scientifically to prove their safety, potential benefits and effectiveness. This study was conducted to provide the groundwork for improving the current registration regulations for herbal medicines in distinction from synthetic medicines. The study was performed based on the literature research and individual interviews with 5 experts who had extensive experience in registration of herbal medicines. When compared with synthetic drugs, herbal medicines exhibit some marked differences, namely the active principles are frequently unknown, standardization, stability and quality control are not easy, they are usually mixtures of complex compounds. Second, the current regulations for herbal medicines are reviewed by comparison of foreign regulation systems like the one in China. The regulation requirements of herbal medicine in China are in distinction from synthetic drugs. The authors conclude that new registration requirements for the herbal medicines should be changed as follows; the toxicity and efficacy data should be submitted as mixed herbal preparation and the documents and other research on the reproduction and generation toxicity need to be shown for the proof of reproduction and generation toxicity, if needed.

  • PDF