• Title/Summary/Keyword: Korean document

Search Result 2,875, Processing Time 0.036 seconds

Accelerating Keyword Search Processing over XML Documents using Document-level Ranking (문서 단위 순위화를 통한 XML 문서에 대한 키워드 검색 성능 향상)

  • Lee, Hyung-Dong;Kim, Hyoung-Joo
    • Journal of KIISE:Databases
    • /
    • v.33 no.5
    • /
    • pp.538-550
    • /
    • 2006
  • XML Keyword search enables us to get information easily without knowledge of structure of documents and returns specific and useful partial document results instead of whole documents. Element level query processing makes it possible, but computational complexity, as the number of documents grows, increases significantly overhead costs. In this paper, we present document-level ranking scheme over XML documents which predicts results of element-level processing to reduce processing cost. To do this, we propose the notion of 'keyword proximity' - the correlation of keywords in a document that affects the results of element-level query processing using path information of occurrence nodes and their resemblances - for document ranking process. In benefit of document-centric view, it is possible to reduce processing time using ranked document list or filtering of low scored documents. Our experimental evaluation shows that document-level processing technique using ranked document list is effective and improves performance by the early termination for top-k query.

Document Replacement Policy by Web Site Popularity (웹 사이트의 인기도에 의한 도큐먼트 교체정책)

  • Yoo, Hang-Suk;Chang, Tae-Mu
    • Journal of the Korea Society of Computer and Information
    • /
    • v.13 no.1
    • /
    • pp.227-232
    • /
    • 2008
  • General web caches save documents temporarily into themselves on the basis of those documents. And when a corresponding document exists within the cache on user's request. web cache sends the document to corresponding user. On the contrary. when there is not any document within the cache, web cache requests a new document to the related server to copy the document into the cache and then turn it back to user. Here, web cache uses a replacement policy to change existing document into a new one due to exceeded capacity of cache. Typical replacement policy includes document-based LRU or LFU technique and other various replacement policies are used to replace the documents within cache effectively. However. these replacement policies function only with regard to the time and frequency of document request. not considering the popularity of each web site. Based on replacement policies with regard to documents on frequent requests and the popularity of each web site, this paper aims to present the document replacement policies with regard to the popularity of each web site, which are suitable for latest network environments to enhance the hit-ratio of cache and efficiently manage the contents of cache by effectively replacing documents on intermittent requests by new ones.

  • PDF

A Study of Document Creation and Management in Braille Libraries (점자도서관의 문서 생산과 관리에 관한 연구)

  • Seok, Jeong Eun
    • The Korean Journal of Archival Studies
    • /
    • no.40
    • /
    • pp.181-223
    • /
    • 2014
  • This study aims to present a Braille Library document creation and management status, and to identify ways to improve. This research field surveys and interviews were conducted three institutions, the quality requirements of the ISO 15489 standards. As a result, the Braille Library document management improvement plan is as follows. First, the policy and regulatory maintenance is needed. Copy of the regulations on the creation and management, access rights and related document management policies should be developed. Second, the document creation process needs to be improved. Electronic approval system responsible for the creation of persons who are visually impaired, visually impaired people can read documents created during document creation, and introduced measures and braille labels on the files attached to will have to be prepared. Third, the document management process needs to be improved. Changes in the creation copy of the records, and preserved along with the preservation of the original and the copy to have the same period, appointed to manage one set of all copies of the authentic copy of the plan is also required. Finally, for document management system should be introduced. Systematic document management system that can be introduced is required. This system will be designed to be accessible to the visually impaired, the search.

The Chosun Governor General Office's Administration regarding Official Documents (조선총독부 공문서(公文書) 제도 -기안(起案)에서 성책(成冊)까지의 과정을 중심으로-)

  • Lee, Seung-il
    • The Korean Journal of Archival Studies
    • /
    • no.9
    • /
    • pp.3-40
    • /
    • 2004
  • In this article, the elements usually included in the official documents issued by the Chosun Governor General office, the process of a certain document being put together and legally authorized, and its path of circulation and preservation are all examined. In order to create an official document of the Governor General office with legal authorization, a draft of a bill had to go through several discussions and a subsequent agreement before it was finally approved. Personnels involved in the discussion stage had the authority to ask for modifications and retouching of the draft, and the modifying process were all recorded in order to make clear who was responsible for a certain change or who objected to what at any given stage of the process. The approved version of an official document was called the 'Completed one(成案), and it was issued after the contents were turned into a fair copy by the office that originated the draft in the first place. With the original finalized version left in custody of that office, the fair copy was handed over to the Document department which was responsible for issuing outgoing documents. After the document was issued and the contained orders were carried out, the originally involved offices began to classify the documents according to their own standards and measures for safekeeping, but it was the Document department that was mainly responsible for document preservation. The Document department classified the documents according to related offices, nature of the documents(편찬류별), and most suitable preservation methods(보존종별). The documents were made into books, and documents to be permanently destroyed were handed over to the Account office where they would be demolished. The manners of document processing of the Chosun Governor General office was in fact a modified version of the manners of the Japanese government. Modifications were made so that the process would be more suitable to the situations and environment of the Chosun society. The office's managing process was inherited by the Chosun government after the Liberation, and cast a significant impact upon the document managing manners of the Korean authorities. The official document administration of the Chosun Governor General office marked both the beginning of the colony document administration, and also the beginning of a modernized document managing system.

The Engineering Change Document Management using SGML in PDM (SGML을 활용한 PDM에서의 설계변경문서관리)

  • Kim, Joon-Oh;Kim, Sunn-Ho
    • IE interfaces
    • /
    • v.10 no.2
    • /
    • pp.79-90
    • /
    • 1997
  • Documents in a traditional PDM(Product Data Management) system have been managed in a form of scanned document files or electronic documents developed by specific tools. Though each tool manages documents with its own systematical methods, it has drawbacks in data search, data integration and interchange, etc. For this reason, in this research we propose an efficient document management system for PDM by using the SGML(Standard Generalized Markup Language), one of CALS and ISO standards for document interchanges. Among documents to be managed in PDM, the engineering change notification (ECN) is taken into account. The DTD (Document Type Definition) has been constucted based on the logical analysis of the documents format, In addition, based on the DTD, DB classes have been designed by object-oriented paradigms and a prototype for document input/output and search has been developed using UniSQL ORDBMS (Object-Relational DBMS) and PowerBuilder under the client/server environment.

  • PDF

A Research on Origin of Provisions in Samhwaja-hyangyakbang(三和子鄕藥方) noted in Hyangyakjipseongbang(鄕藥集成方) (향약집성방(鄕藥集成方)에 나타난 삼화자향약방(三和子鄕藥方) 조문(條文)의 연원(淵源)에 대한 연구)

  • Sheen Yeong-il
    • Herbal Formula Science
    • /
    • v.5 no.1
    • /
    • pp.85-98
    • /
    • 1997
  • Samhwaja-hyangyakbang has been known as the book of herbalogy published by the man, Samhwaja. But there are no records about the additor, and the absence of the document, it is so difficult to be informed about the time when it published and other details. But in this document and Hyangyakgugeubbang, there are similar prescriptions to Sublingual Inflammation, Aphtose, Anthrax, Furoncle, Ulcer, Dysentery, etc. So the time it published is estimated to Goryo-dynasty Gojong epoch(1232-1251) when the Hyangyakgugeubbang was published. In addition, this document seems to be basis of Hyangyakjipseongbang, Because, Hyangyakjipseongbang quoted more than 140 provisions from this document. Prescriptions that are different from other books in dosage or taking method, took those of Biyebaekyobang. In explanation or classification of disease, this document almost copied those of Biyebaekyobang, so this document take Biyebaekyobang for origin and take Sikeuisimkam, Taepyungseonghyebang, etc, for reference.

  • PDF

Document Layout Analysis Based on Fuzzy Energy Matrix

  • Oh, KangHan;Kim, SooHyung
    • International Journal of Contents
    • /
    • v.11 no.2
    • /
    • pp.1-8
    • /
    • 2015
  • In this paper, we describe a novel method for document layout analysis that is based on a Fuzzy Energy Matrix (FEM). A FEM is a two-dimensional matrix that contains the likelihood of text and non-text and is generated through the use of Fuzzy theory. The key idea is to define an Energy map for the document to categorize text and non-text. The proposed mechanism is designed for execution with a low-resolution document image, and hence our method has a fast processing speed. The proposed method has been tested on public ICDAR 2009 datasets to conduct a comparison against other state-of-the-art methods, and it was also tested with Korean documents. The results of the experiment indicate that this scheme achieves superior segmentation accuracy, in terms of both precision and recall, and also requires less time for computation than other state-of-the-art document image analysis methods.

The Development of Web Browsed Electronic Document Interchanges System (초고속정보통신망상에서 웹 기반의 전자문서교환(EDI) 시스템 구현)

  • Kim, Nak-Hyun;Roh, Myung-Ho
    • IE interfaces
    • /
    • v.13 no.2
    • /
    • pp.258-265
    • /
    • 2000
  • EDI(Electronic Data Interchange) allows the exchange of business information and computer-processable data in a standard, structured format electronically between organizational entities. EDI handles the restructuring of a business document into the standard format so that it can be transmitted from one computer to another. This paper identifies features and technologies of web browsed electronic document exchange system as follows 1) the fundamental technologies that consists of the EDI technologies, the Internet/Web technologies, the security/authentication techniques, and the XML implementation technologies. 2) the functions that consists of the document standards, transfer technology of the document, encryption and authentication 3) the implemented Web-EDI systems that consists of document generation module, encryption and authentication module, transfer module, acknowledgement module, administration module. In this paper, the Web-based EDI system implemented from the researched technologies will be installed on the EDI servers owned by corporate customers and enable the exchange of documents between each installed companies.

  • PDF

Document Clustering based on Level-wise Stop-word Removing for an Efficient Document Searching (효율적인 문서검색을 위한 레벨별 불용어 제거에 기반한 문서 클러스터링)

  • Joo, Kil Hong;Lee, Won Suk
    • The Journal of Korean Association of Computer Education
    • /
    • v.11 no.3
    • /
    • pp.67-80
    • /
    • 2008
  • Various document categorization methods have been studied to provide a user with an effective way of browsing a large scale of documents. They do compares set of documents into groups of semantically similar documents automatically. However, the automatic categorization method suffers from low accuracy. This thesis proposes a semi-automatic document categorization method based on the domains of documents. Each documents is belongs to its initial domain. All the documents in each domain are recursively clustered in a level-wise manner, so that the category tree of the documents can be founded. To find the clusters of documents, the stop-word of each document is removed on the document frequency of a word in the domain. For each cluster, its cluster keywords are extracted based on the common keywords among the documents, and are used as the category of the domain. Recursively, each cluster is regarded as a specified domain and the same procedure is repeated until it is terminated by a user. In each level of clustering, a user can adjust any incorrectly clustered documents to improve the accuracy of the document categorization.

  • PDF

A Study on Written Year and Contents of 『Naeuiweonshikryef』 (내의원 편 『내의원(內醫院) 식례(式例)』의 저술 시기와 내용 연구)

  • Park, Hun-Pyeng
    • The Journal of Korean Medical History
    • /
    • v.28 no.1
    • /
    • pp.39-51
    • /
    • 2015
  • Naeuiweon (內醫院) is the royal medical office of Joseon Dynasty. "Naeuiweonshikrye (內醫院式例)" contains various regulations of Naeuiweon in early 19th century. Therefore, attention has been utilized by several researchers. However, these studies show partial side of this document. The purpose of this study is to introduce and analyze contents of "Naeuiweonshikrye". Additionally, as through the body of the written time of the contents was analyzed this document. The authors of this study found that. First, "Naeuiweonshikrye" is estimated in the volume whether modified or supplemented prior "Naeuiweonji (內醫院志)". Second, the written time of this document is about Sunjo'10(1810). Third, "Naeuiweonshikrye" is the primary document that provides a wealth of information about the actual operational and regulatory Naeuiweon (內醫院) in the early 19th century. There is no other material information has been recorded only in the literature. For example, there are several building names in the Naeuiweon. Finally, this document informs the concept of pharmaceutical terminology used in Joseon Dynasty.