• Title/Summary/Keyword: 문서지

Search Result 2,040, Processing Time 0.033 seconds

Distortion Corrected Black and White Document Image Generation Based on Camera (카메라기반의 왜곡이 보정된 흑백 문서 영상 생성)

  • Kim, Jin-Ho
    • The Journal of the Korea Contents Association
    • /
    • v.15 no.11
    • /
    • pp.18-26
    • /
    • 2015
  • Geometric distortion and shadow effect due to capturing angle could be included in document copy images that are captured by a camera in stead of a scanner. In this paper, a clean black and white document image generation algorithm by distortion correction and shadow elimination based on a camera, is proposed. In order to correct geometric distortion such as straightening un-straight boundary lines occurred by camera lens radial distortion and eliminating outlying area included by camera direction, second derivative filter based document boundary detection method is developed. Black and white images have been generated by adaptive binarization method by eliminating shadow effect. Experimental results of the black and white document image generation algorithm by recovering geometrical distortion and eliminating shadow effect for the document images captured by smart phone camera, shows very good processing results.

A Study on the Effective Strategy of Electronic Paper Resources Management (전자문서자원의 효율적 관리방안에 관한 연구)

  • 김은정
    • Journal of the Korea Society of Computer and Information
    • /
    • v.3 no.4
    • /
    • pp.145-153
    • /
    • 1998
  • Electronic paper resources management becomes an important alternative to overcome the inefficiency of various dysfunctions and pathologies dependent upon bureaucratic paperworks and to enhance organizational competitiveness. Korean government and enterprises have less recognized the importance of electronic document management and ignored the merits of introducing information technology into organizations. By means of document and paper exchange, decision making and signature, mail and bulletin board systems, information disclosure and sharing using electronic technology, however, electronic document management system can enhance organizational performance on a considerable degree. In this sense, it is necessary to introduce the system as soon as possible. Several measures are required : revising related laws and regulations, reforming the sense and behavior of personnel, establishing technological basis of the system, standardizing word processors and document formats.

  • PDF

EDI Document Processing System for Port Logistics (항만 물류처리를 위한 EDI 문서 처리 시스템)

  • Ham, Jong-Wan;Ban, Tae-Hak;Jung, Hoe-Kyung
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.15 no.5
    • /
    • pp.1081-1086
    • /
    • 2011
  • Last port logistics for the EDI(Electronic Data Interchange) document processing system using a rapid increase in the complaint is handled. However, the existing system in a way that the script processing EDI documents, but the complexity of the script writing and document processing efficiency, lower consumption due to increased demand for processing has not kept up. Therefore, we changed the script in a way how to handle the binary system was designed and implemented. Also used for port logistics has developed 12 types of EDI documents. Accordingly, the document processing speed compared to existing methods are improved twelvefold port logistics system for processing EDI documents are expected to be utilized.

Sentence Cohesion & Subject driving Keywords Extraction for Document Classification (문서 분류를 위한 문장 응집도와 주어 주도의 주제어 추출)

  • Ahn Heui-Kook;Roh Hi-Young
    • Proceedings of the Korean Information Science Society Conference
    • /
    • 2005.07b
    • /
    • pp.463-465
    • /
    • 2005
  • 문서분류 시 문서의 내용을 표현하기 위한 자질로서 사용되는 단어의 출현빈도정보는 해당 문서의 주제어를 표현하기에 취약한 점을 갖고 있다. 즉, 키워드가 문장에서 어떠한 목적(의미)으로 사용되었는지에 대한 정보를 표현할 수가 없고, 문장 간의 응집도가 강한 문장에서 추출되었는지 아닌지에 대한 정보를 표현할 수가 없다. 따라서, 이 정보로부터 문서분류를 하는 것은 그 정확도에 있어서 한계를 갖게 된다. 본 논문에서는 이러한 문서표현의 문제를 해결하기위해, 키워드를 선택할 때, 자질로서 문장의 역할(주어)정보를 추출하여 가중치 부여방식을 통하여 주어주도정보량을 추출하였다. 또한, 자질로서 문장 내 키워드들의 동시출현빈도 정보를 추출하여 문장 간 키워드들의 연관성정도를 시소러스에 담아내었다. 그리고, 이로부터 응집도 정보를 추출하였다. 이 두 정보의 통합으로부터 문서 주제어를 결정함으로서, 문서분류를 위한 주제어 추출 시 불필요한 키워드의 삽입을 줄이고, 동시 출현하는 키워드들에 대한 선택 기준을 제공하고자 하였다. 실험을 통해 한번 출현한 키워드라도, 문장을 주도하는 주어로서 사용될 경우와 응집도 가중치가 높을 경우에 주제어로서의 선택될 가능성이 향상되고, 문서분류를 위해 좀 더 세분화된 키워드 점수화가 가능함을 확인하였다. 따라서, 선택된 주제어가 문서분류의 정확도에 있어서 향상을 가져올 수 있을 것으로 기대한다.

  • PDF

A New Approach to Active Documents and its Application (능동문서에 대한 새로운 접근법과 그 응용)

  • 남철기;배재학;장길상
    • Journal of KIISE:Software and Applications
    • /
    • v.30 no.3_4
    • /
    • pp.347-357
    • /
    • 2003
  • The web is an important source of information and most of Web applications are based on form documents in HTML-based form documents only play a role as user interfaces, and they do not involve the procedures or rules if business process which form document designers assume. However, from documents imply methods for treating documents, and these embedded procedural knowledge can be utilized.actively in automation of business process. In this respect, we Investigate the activeness of documents with cognitive science to automate business processes based on from documents. Through this, we have a new concept and applicability of active documents. Our active documents include business rules and declarative knowledge to support the automation of document processing. Also, we propose a processing framework for the active documents. The framework has two phases: build-time and run-time. in order to demonstrate the usefulness of the proposed framework, a prototype called ActiveForm is designed and implemented for requisition processing them in an inference engine can enhance the intelligence of Internet applications.

Design of Security Mechanism for Electronic Document Repository System (전자문서 보관 시스템을 위한 보안 메커니즘 설계)

  • Kim, Jeom-Goo;Kim, Sang-Choon
    • Convergence Security Journal
    • /
    • v.11 no.3
    • /
    • pp.99-111
    • /
    • 2011
  • The management and deposit of paper document costs are increased gradually. Specially, it is too expensive to safekeeping paper document in the warehouse. Also paper based document system is exposed in several security problems. Therefore, demands of transformation process from paper document into electronic ones are quietly needed. Electronic document repository system is one of the best solutions for solving paper based document system issues. Electronic document repository system can reduce overall costs and provides some advantages in comparison with paper based document system. But, electronic document repository system has no formal methodology for guarantee safeties. Therefore, we suggest a security mechanism for establish electronic document repository system. Suggested security methodology can help for design of more secure electronic document repository system.

Ontology-based Method for automatic Knowledge Reasoning (온톨로지 기반 지식추론 기법)

  • 이정원;박세형;이언경;방건동;백두권
    • Proceedings of the Korean Information Science Society Conference
    • /
    • 2001.10a
    • /
    • pp.292-294
    • /
    • 2001
  • 제품 개발을 담당하는 부서에서, 다양한 이유로 핵심인력이 빠져나간다. 이때마다 제품개발 현장에서는 심각한 지식 누수현상이 나타나게 된다. 따라서, 지식 누수 현상을 방지하기 위한 방법은 기업 내부에 존재하는 핵심인력의 노하우를 형식지로 저장해 관리하는 것이다. 제품을 개발을 담당하는 부서에는 수많은 문서들이 존재한다. 특히 품질 관리 문서는 제품개발과 관련된 핵심인력의 노하우가 농축되어있는 지식이다. 그래서 많은 기업에서는 기업내부에서 발생하는 그 지식을 관리하고 재활용하고자, 문서관리를 위한 시스템을 도입, 사용하고 있다. 그러나 설계 지식의 공유를 지원하는 시스템을 갖추었어도, 단순히 설계지식을 저장해놓은 경우가 많아, 개발자는 필요한 자료를 다시 선정해야 하는 문제가 발생한다. 이는 개발자에게 있어서 부담이 되며, 풍부한 지식활용률을 떨어뜨리게 만드는 한 요인이 된다. 본 논문에서는 이 문제를 해결하기 위해, 온톨로지를 기반으로 문서를 분류하고, 이 온톨로지에 정의된 키워드를 바탕으로 새로운 지식을 자동적으로 추론하여, 제품에 대한 기술적 지식을 가지고 검색하게 함으로서 필요 없는 검색 결과를 최소화 하고, 설계자의 지식 활용률을 높이고자 하였다.

  • PDF

Practical Page Segmentation using Connected Components and Color Information (연결요소와 색상정보를 이용한 실제적 문서영상 분할)

  • Kim, Pyeoung-Kee
    • The Transactions of the Korea Information Processing Society
    • /
    • v.7 no.1
    • /
    • pp.273-285
    • /
    • 2000
  • While page segmentation is an important step in document recognition, there haven's been many researches on it. More improvement is still needed on the segmentation of document elements in complicated or color documents. In this paper, I present a new page segmentation method which can segment pages with multiple columns, dotted lines, graphics, and photographs. I extract all connected components using contour following and combine them depending on the size and positional information of them. Separate text location is done for non-text color regions to extract possible text lines. To see the performance of the proposed method, experiments are done for 180 documents. Four commercial OCR programs are also tested and the proposed method showed the best result.

  • PDF

Development of an Integrated EDI Document Generation System (통합적 EDI 문서생성 시스템의 개발)

  • Lee, Seung-Ik;Cho, Sung-Bae
    • Journal of KIISE:Computing Practices and Letters
    • /
    • v.6 no.3
    • /
    • pp.339-347
    • /
    • 2000
  • There are increasing needs for an individual or enterprise to interchange documents electronically through communication network to enhance the efficiency of business, according to the rapid progress in the construction of communication network. UN has established and distributed the standards for electronic documents to facilitate rapid and accurate processing of business. In this paper, we present the design and implementation of an integrated system for processing documents that conform to the electronic document standards. This system consists of three subsystems, which are an EDI parser for checking syntax, an EDI document editor, and an EDI directory viewer for referencing syntax of an EDI document. Integrated usage of these tools can afford the composition of correct EDI documents more rapidly and conveniently.

  • PDF

XML document editing system that is creation for structural digital document (구조화된 전자문서 생성을 위한 사용자 중심의 XML 문서편집 시스템)

  • 최일선;이용준;정회경
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.7 no.3
    • /
    • pp.513-518
    • /
    • 2003
  • Established XML at February, 1998 in W3C by solution about document processing and exchange and reusability to be shortcoming that early web happens using nonstructural document. Existing electron transaction is changing in electronic business form between corporation through XML base message exchange using XML. Necessity about solution that can masticate structured electron transaction of XML base that is used in electron transaction between corporation rose. Structured electron transaction of XML base that is used in electron transaction in treatise that see hereupon efficiently study about XML document editing system that integrate XML Schema editor to masticate XML Schema document that define edit and XML instance editor of user central that can write a book and structure of XML document efficiently do.