• Title/Summary/Keyword: Document Processing System

Search Result 398, Processing Time 0.023 seconds

Automated Essay Grading: An Application For Historical Malay Text

  • Syed Mustapha, S.M.F.D;Idris, N.
    • Proceedings of the Korea Inteligent Information System Society Conference
    • /
    • 2001.01a
    • /
    • pp.237-245
    • /
    • 2001
  • Automated essay grading has been proposed for over thirty years. Only recently have practical implementations been constructed and tested. This paper investigated the role of the nearest-neighbour algorithm within the information retrieval as a way of grading the essay automatically called Automated Essay Grading System. It intended to offer teachers an individualized assistance in grading the student\`s essay. The system involved several processes, which are the indexing, the structuring of the model answer and the grade processing. The indexing process comprised the document indexing and query processing which are mainly used for representing the documents and the query. Structuring the model answer is actually preparing the marking scheme and the grade processing is the process of assessing the essay. To test the effectiveness of the developed algorithms, the algorithms are tested against the History text in Malay. The result showed that th information retrieval and the nearest-neighbour algorithm are practical combination that offer acceptable performance for grading the essay.

  • PDF

Development of Intelligent OCR Technology to Utilize Document Image Data (문서 이미지 데이터 활용을 위한 지능형 OCR 기술 개발)

  • Kim, Sangjun;Yu, Donghui;Hwang, Soyoung;Kim, Minho
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2022.05a
    • /
    • pp.212-215
    • /
    • 2022
  • In the era of so-called digital transformation today, the need for the construction and utilization of big data in various fields has increased. Today, a lot of data is produced and stored in a digital device and media-friendly manner, but the production and storage of data for a long time in the past has been dominated by print books. Therefore, the need for Optical Character Recognition (OCR) technology to utilize the vast amount of print books accumulated for a long time as big data was also required in line with the need for big data. In this study, a system for digitizing the structure and content of a document object inside a scanned book image is proposed. The proposal system largely consists of the following three steps. 1) Recognition of area information by document objects (table, equation, picture, text body) in scanned book image. 2) OCR processing for each area of the text body-table-formula module according to recognized document object areas. 3) The processed document informations gather up and returned to the JSON format. The model proposed in this study uses an open-source project that additional learning and improvement. Intelligent OCR proposed as a system in this study showed commercial OCR software-level performance in processing four types of document objects(table, equation, image, text body).

  • PDF

XML Document Editing System for Structural Processing of the Digital Document to Including Mathematical Formula (수식을 포함한 전자문헌의 구조적 처리를 위한 XML 문서편집시스템)

  • 윤화묵;유범종;김창수;정회경
    • Journal of the Korean Society for information Management
    • /
    • v.19 no.4
    • /
    • pp.96-111
    • /
    • 2002
  • A lot of accumulated data of many quantity exist within a institution or an organization, but most data is remained in form of standardization as each institution or organization. There are difficulty in exchange and share of information. New concept of knowledge information resource management to overcome this disadvantage was introduced, and the digitization of knowledge information resources to share and manage accumulated data is been doing. Specially, in science technic or education scholarship it, the tendency that importing XML to process necessary data to exchange and share of knowledge information resources structurally, and limitation of back for search and indexing or reusability is happened according as expression of great many mathematics used inside electron document of these sphere is processed to nonstructural data of image or text and so on. There is interest converged in processing of mathematics that use MathML to overcome this, and we require the solution to be able to process MathML easily and efficiently on structural document. In this paper, designed and implemented of XML document editing system which easy structural process of electronic document for knowledge information resources, and create and express MathML easily on structural document without expert knowledge about MathML.

The Data Processing System Development of Telemetry Ground System for Korean Space Launch Vehicle-1 (KSLV-1의 신호 수신.처리를 위한 원격측정 지상국시스템의 자료처리시스템 개발)

  • Ma, Jin-A;Kwon, Soon-Ho;Oh, Chang-Yul;Lee, Hyo-Keun
    • Aerospace Engineering and Technology
    • /
    • v.6 no.1
    • /
    • pp.245-254
    • /
    • 2007
  • The on-board telemetry system of KSLV-1 transmits telemetry signal for the launch vehicle and satellite to ground telemetry system in real time. In ground telemetry system, antenna system acquires telemetry signals and transfers these to data processing system. Data processing system processes and recordes telemetry data and distributes it to each mission operator in order to monitor it the operation goes well or not. This document describes the configurations and functions of data processing system designed for efficient and appropriate processing of telemetry data.

  • PDF

Retrieval Scheme of XML Documents Using Link Queries (링크 질의를 통한 XML 문서의 검색 기법)

  • Mun, Chan-Ho;Gang, Hyeon-Cheol
    • The KIPS Transactions:PartD
    • /
    • v.8D no.4
    • /
    • pp.313-326
    • /
    • 2001
  • The XML that was proposed as a next-generation standard for describing Web documents is widely used in various Web-based applications. In addition, XML documents on the Web link each other by hyperlinks. The current works on XML focus on the XML storage system that can efficiently store, manage, and retrieve XML documents. However, the research on the query language that supports the XML links and on the XML retrieval systems to process the XML links, is little conducted until now. In this paper, we propose an extension of an XML query language for expressing the XML link query and its processing scheme. A link query is to retrieve contents from an XML document (a query document) and from the XML documents (referenced documents) that are referred to by the links in the query document. As far as retrieving from the referenced documents is concerned, the current practice is to manually generate queries to get the partial results, and to repeat such a procedure. The purpose of link query processing in this paper is to eliminate the manual work altogether in getting the complete query result. The performance analysis shows that our link query processing strategy outperforms the conventional approach including the manual tasks. The more links to the referenced documents and the more referenced documents there are in the site storing the query document, the more query processing time decreases.

  • PDF

Purchase Information Extraction Model From Scanned Invoice Document Image By Classification Of Invoice Table Header Texts (인보이스 서류 영상의 테이블 헤더 문자 분류를 통한 구매 정보 추출 모델)

  • Shin, Hyunkyung
    • Journal of Digital Convergence
    • /
    • v.10 no.11
    • /
    • pp.383-387
    • /
    • 2012
  • Development of automated document management system specified for scanned invoice images suffers from rigorous accuracy requirements for extraction of monetary data, which necessiate automatic validation on the extracted values for a generative invoice table model. Use of certain internal constraints such as "amount = unit price times quantity" is typical implementation. In this paper, we propose a noble invoice information extraction model with improved auto-validation method by utilizing table header detection and column classification.

Design and Implementation of the REL Document Processing System Based on MPEG-21 Framework for Rights Management (저작권 관리를 위한 MPEG-21 프레임워크 기반의 REL 문서 처리 시스템 설계 및 구현)

  • Ko Kwang-San;Song Jong-Chul;Jung Hoe-Kyung
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.9 no.3
    • /
    • pp.555-560
    • /
    • 2005
  • Problem of compatibility, contents protection, contents circulation etc. was caused between contents offerer and consumers by indiscreet rights technical development about digital contents. Therefore, MPEG-21 framework proposed the REL(Right Expression Language) based on XML(extensible Markup Language) by rights expression language for rights management. We designed and implemented of the REL document editing system that rights creation and edit are available efficiently about contents easily without comprehension of MPEG-21 framework and client system that present consumption method of REL document.

A Review on Needs and Related Law of Authorized Electronic Data Depot for the Diffusion in the Use of Electronic Document (전자문서 이용확산을 위한 공인 전자문서보관소의 도입 필요성과 입법내용)

  • Kim, Sun-Kwang
    • International Commerce and Information Review
    • /
    • v.6 no.1
    • /
    • pp.191-210
    • /
    • 2004
  • There are increasing needs for an individual or enterprise to interchange documents electronically through communication network to enhance the efficiency of business, owing to rapid process of transactions. But e-commerce encounters the problems regarding the handling the electronic documents, that is to say, deposit and proof of the electronic documents. This paper deals with Authorized Electronic Data Depot as an integrated system for processing, relaying and proving documents that. Authorized Electronic Data Depot operates as e-enabler in exchanging documents in trust among administrative agencies and a comprehensive government directory, digital government seal certification system and DNS system. Authorized Electronic Data Depot leads public and private sectors to save the paper-related costs. But the regulations concerning an authorized electronic data depot is introduced in the course of revision of Electronic Transactions Act. The purpose of this paper is to suggest some guidelines in legalizing the authorized electronic data depot.

  • PDF

Development of Production Management System for Medium and Small Companies (중소제조업을 위한 생산관리시스템의 개발)

  • 임수경;오근태
    • Journal of Korean Society of Industrial and Systems Engineering
    • /
    • v.18 no.36
    • /
    • pp.193-203
    • /
    • 1995
  • The production management system software which can be applied In the medium and small companies is developed. Since the medium and small companies hesitate to install the computer system for production management owing to the lack of funds and the complexities of operation, the system is made to be used in PC and only requires minimum input data. This system is composed of interrelated modules for receive/order release, inventory management MRP, new product registration, document management and production scheduling. In this paper the architecture of the system, functions of each module, and information processing procedures of each funtion are discussed.

  • PDF

Design and Implementation of a Query Processor for Document Management Systems (문서관리시스템을 위한 질의처리기 설계 및 구현)

  • U, Jong-Won;Yun, Seung-Hyeon;Yu, Jae-Su
    • The Transactions of the Korea Information Processing Society
    • /
    • v.6 no.6
    • /
    • pp.1419-1432
    • /
    • 1999
  • The Document Management System(DMS) is a system which retrieves and manages library information efficiently. Since DMS manages the information using only one table, it does not need to provide join and view operations that spend high cost in traditional DBMS. In addition, DMs requires new operations because of their property. the operation has not been supported in existing DBMSs. In this paper we define a data language which represents the structure definition and process of data on the DMS. Especially we define Ranking and Proximity operation which is needed in Document Retrieval,. We also design and implement a query processor to process the query constructed with the data language. When the exiting query processors of relational DBMS are used as a query processor of DMS, they degrade the whole system performance. The proposed query processor not only overcomes such a problem but also supports new operation which is needed in DMS.

  • PDF