• Title/Summary/Keyword: character module

Search Result 76, Processing Time 0.028 seconds

Development of Intelligent OCR Technology to Utilize Document Image Data (문서 이미지 데이터 활용을 위한 지능형 OCR 기술 개발)

  • Kim, Sangjun;Yu, Donghui;Hwang, Soyoung;Kim, Minho
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2022.05a
    • /
    • pp.212-215
    • /
    • 2022
  • In the era of so-called digital transformation today, the need for the construction and utilization of big data in various fields has increased. Today, a lot of data is produced and stored in a digital device and media-friendly manner, but the production and storage of data for a long time in the past has been dominated by print books. Therefore, the need for Optical Character Recognition (OCR) technology to utilize the vast amount of print books accumulated for a long time as big data was also required in line with the need for big data. In this study, a system for digitizing the structure and content of a document object inside a scanned book image is proposed. The proposal system largely consists of the following three steps. 1) Recognition of area information by document objects (table, equation, picture, text body) in scanned book image. 2) OCR processing for each area of the text body-table-formula module according to recognized document object areas. 3) The processed document informations gather up and returned to the JSON format. The model proposed in this study uses an open-source project that additional learning and improvement. Intelligent OCR proposed as a system in this study showed commercial OCR software-level performance in processing four types of document objects(table, equation, image, text body).

  • PDF

Multi-modal Image Processing for Improving Recognition Accuracy of Text Data in Images (이미지 내의 텍스트 데이터 인식 정확도 향상을 위한 멀티 모달 이미지 처리 프로세스)

  • Park, Jungeun;Joo, Gyeongdon;Kim, Chulyun
    • Database Research
    • /
    • v.34 no.3
    • /
    • pp.148-158
    • /
    • 2018
  • The optical character recognition (OCR) is a technique to extract and recognize texts from images. It is an important preprocessing step in data analysis since most actual text information is embedded in images. Many OCR engines have high recognition accuracy for images where texts are clearly separable from background, such as white background and black lettering. However, they have low recognition accuracy for images where texts are not easily separable from complex background. To improve this low accuracy problem with complex images, it is necessary to transform the input image to make texts more noticeable. In this paper, we propose a method to segment an input image into text lines to enable OCR engines to recognize each line more efficiently, and to determine the final output by comparing the recognition rates of CLAHE module and Two-step module which distinguish texts from background regions based on image processing techniques. Through thorough experiments comparing with well-known OCR engines, Tesseract and Abbyy, we show that our proposed method have the best recognition accuracy with complex background images.

Design and Implementation of Visual Information Extraction System for Education (학습용 시각 정보 인식 시스템의 설계 및 구현)

  • Shin, Hyunkyung
    • Journal of The Korean Association of Information Education
    • /
    • v.16 no.4
    • /
    • pp.483-488
    • /
    • 2012
  • As propagation of mobile smart devices is widespread, it is an observable trend that the cases of utilizing them are increasing in the school programs, and it is also anticipated that they will be very important part of the educational equipment in near future. For this reason the department of education and science technology has announced a medium and long term project on the education with smart device, which is undergoing the preparation stage, and the various academic and industrial institutes have actively produced the related research results and the application prototypes. In this paper we propose a framework on design and implementation of a visual context recognition system for educational purpose usable in the school program by utilizing a module for recognition of the texts embedded in the image captured by video camera from mobile smart device. The system proposed in this paper is consisted of the four modules, such as, image acquisition, image processing, information extraction, and knowledge representation, which are explained in details with the practical examples.

  • PDF

A Study on the Production Planning and Management for Automated Clothing Manufacture (의류산업의 생산 자동화 현황과 그에 따른 생산기획 및 관리에 관한 연구)

  • 박진아;조진숙
    • Journal of the Korean Society of Clothing and Textiles
    • /
    • v.21 no.1
    • /
    • pp.19-34
    • /
    • 1997
  • The goals of this study are to suggest the guidance for automated clothing manufacture by analysis the technology of the automated manufacturing facilities and to propose how improve the efficiency of the production planning and management for automated clothing manufacture In this study, the research about the automated clothing manufacturing machines and the analysis about the modules and functions of apparel information systems were performed. In order to understand the factory automation of the larger clothing firms, the case study method was used. The case study samples were 3 clothing firms. The results and suggestions are as follows: 1. An information technology for automated clothing manufacture has enabled the computer integrated manufacturing system to connect production planning and management part with each work station on the factory floor. 2. The apparel information system to integrate and manage manufacturing informations from each workstation and the apparel CAD system are used in the department of production planning. At the cutting room, there are automated manufacturing machines like an automatic spreading system and an automatic cutting system. Sewing room has the computer controlled unit production system and semi-automated sewing machines. In addition, in the finishing room, an automatic packing machine and a press system are used and besides a warehousing system has been developed. Considering these available technology, for better product efficiency, it is necessary to consider and utilize the specific character of these automatic manufacturing machines and computer system whether they proper to each product style. 3. Most of the clothing manufacturers are in the stage of semi-automated manufacture. In order to improve the manufacturing environment, it is needed to gradual procedure of manufacturing automation with considering the firm's financial condition, existing facilities and staffs operating machines. The case study sample firms are in the high degree of manufacturing automation. They can accomplish the flexible manufacturing system to link the information system with each work station menufacturing system by computerized control. For the case of the firm having already used the computer integrated manufacturing and managing system, it is necessary that the function to deal with drawing information is added to the retaining module of the apparel system.

  • PDF

A Study on Stereo Matching Algorithm using Disparity Space Image (시차공간영상을 이용한 스테레오 영상 정합에 관한 연구)

  • Lee, Jong-Min;Kim, Dae-Hyun;Choi, Jong-Soo
    • Journal of the Institute of Electronics Engineers of Korea SP
    • /
    • v.41 no.6
    • /
    • pp.9-18
    • /
    • 2004
  • This paper proposes a new and simple stereo matching algorithm using the disparity space image (DSI) technique. First of all, we detect some salient feature points on each scan-line of the image pair and set the matching area using those points and define a simple cost matrix. And we take advantage of matching by pixel-by-pixel instead of using the matching window. While the pixel-by-pixel method boost up the speed of matching, because of no using neighbor information, the correctness of the matching may not be better. To cover this point, we expand the matching path using character of disparity-space-image for using neighbor information. In addition, we devise the compensated matching module using the volume of the disparity space image in order to improve the accuracy of the match. Consequently, we can reduce mismatches at the disparity discontinuities and can obtain the more detailed and correct disparity map.

A Study on Montage and Expression Styles in Cut-scenes of Mobile Game (모바일 게임 컷신의 몽타주와 표현 양식 연구)

  • Park, Jin-Ok
    • Journal of Digital Contents Society
    • /
    • v.18 no.1
    • /
    • pp.55-62
    • /
    • 2017
  • The background of this study is the trend in which the genres of mobile games are becoming diverse based on advanced smart devices. However, since cut-scenes are still being regarded as somewhat unnecessary elements in digital games, this study seeks to suggest the functions and range of uses of cut-scenes. The scope of this study is to make classifications for the range of uses of cut-scenes in commercialized mobile games according to the genre, and investigate the characteristics of visual expression to create a frame for communicative styles of mobile games. The result of study was that different styles were identified regarding the use of cut-scenes in mobile games just as in existing digital games, however diverse attempts are not yet being made as in early digital games. Future study needs to be carried out on communication styles that match the characteristics of mobile game platforms, in which a module that can be applied in the introduction of a new platform is required.

A Study on Korean-Style Motion Prototype and Animation (한국적 애니메이션과 Motion Prototype 연구)

  • Koh Jae-Sung;Bae Soo-Am;Cho Dae-Jea
    • Proceedings of the Korea Contents Association Conference
    • /
    • 2005.05a
    • /
    • pp.285-288
    • /
    • 2005
  • If the motion is universal over the differences between the ancient age and the modern age, and between primitiveness and civilization, It is required to be studied, which make it possible to discover the universally valid interpreting elements of various societies and cultural phenomena. The study on the motion, as a base of universal motion study on animation and character modelling is considered as a very important part. The sampled motion prototype was classified by the continuity and the synchronicity that is a basis of modular analysis and of motion flow. Koguryo's mural paintings in old tombs has been the heritage and the symbol of the nation's historical identity and pride in Korean history. Koguryo is obviously a part of Korean history, which are of Korean elements. Accordingly, the mural paintings that this study has explicated is the origin as well as the history of Korean visual animation history, while the analysed and restored motion prototype is a module of Korean motion with golden section proportion.

  • PDF

Object detection in financial reporting documents for subsequent recognition

  • Sokerin, Petr;Volkova, Alla;Kushnarev, Kirill
    • International journal of advanced smart convergence
    • /
    • v.10 no.1
    • /
    • pp.1-11
    • /
    • 2021
  • Document page segmentation is an important step in building a quality optical character recognition module. The study examined already existing work on the topic of page segmentation and focused on the development of a segmentation model that has greater functional significance for application in an organization, as well as broad capabilities for managing the quality of the model. The main problems of document segmentation were highlighted, which include a complex background of intersecting objects. As classes for detection, not only classic text, table and figure were selected, but also additional types, such as signature, logo and table without borders (or with partially missing borders). This made it possible to pose a non-trivial task of detecting non-standard document elements. The authors compared existing neural network architectures for object detection based on published research data. The most suitable architecture was RetinaNet. To ensure the possibility of quality control of the model, a method based on neural network modeling using the RetinaNet architecture is proposed. During the study, several models were built, the quality of which was assessed on the test sample using the Mean average Precision metric. The best result among the constructed algorithms was shown by a model that includes four neural networks: the focus of the first neural network on detecting tables and tables without borders, the second - seals and signatures, the third - pictures and logos, and the fourth - text. As a result of the analysis, it was revealed that the approach based on four neural networks showed the best results in accordance with the objectives of the study on the test sample in the context of most classes of detection. The method proposed in the article can be used to recognize other objects. A promising direction in which the analysis can be continued is the segmentation of tables; the areas of the table that differ in function will act as classes: heading, cell with a name, cell with data, empty cell.

Question Similarity Measurement of Chinese Crop Diseases and Insect Pests Based on Mixed Information Extraction

  • Zhou, Han;Guo, Xuchao;Liu, Chengqi;Tang, Zhan;Lu, Shuhan;Li, Lin
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.15 no.11
    • /
    • pp.3991-4010
    • /
    • 2021
  • The Question Similarity Measurement of Chinese Crop Diseases and Insect Pests (QSM-CCD&IP) aims to judge the user's tendency to ask questions regarding input problems. The measurement is the basis of the Agricultural Knowledge Question and Answering (Q & A) system, information retrieval, and other tasks. However, the corpus and measurement methods available in this field have some deficiencies. In addition, error propagation may occur when the word boundary features and local context information are ignored when the general method embeds sentences. Hence, these factors make the task challenging. To solve the above problems and tackle the Question Similarity Measurement task in this work, a corpus on Chinese crop diseases and insect pests(CCDIP), which contains 13 categories, was established. Then, taking the CCDIP as the research object, this study proposes a Chinese agricultural text similarity matching model, namely, the AgrCQS. This model is based on mixed information extraction. Specifically, the hybrid embedding layer can enrich character information and improve the recognition ability of the model on the word boundary. The multi-scale local information can be extracted by multi-core convolutional neural network based on multi-weight (MM-CNN). The self-attention mechanism can enhance the fusion ability of the model on global information. In this research, the performance of the AgrCQS on the CCDIP is verified, and three benchmark datasets, namely, AFQMC, LCQMC, and BQ, are used. The accuracy rates are 93.92%, 74.42%, 86.35%, and 83.05%, respectively, which are higher than that of baseline systems without using any external knowledge. Additionally, the proposed method module can be extracted separately and applied to other models, thus providing reference for related research.

Embedded Multi-LED Display System based on Wireless Internet using Otsu Algorithm (오츠 알고리즘을 활용한 무선인터넷 기반 임베디드 다중 LED 전광판 시스템)

  • Jang, Ho-Min;Kim, Eui-Ryong;Oh, Se-Chun;Kim, Sin-Ryeong;Kim, Young-Gon
    • The Journal of the Institute of Internet, Broadcasting and Communication
    • /
    • v.16 no.6
    • /
    • pp.329-336
    • /
    • 2016
  • In the outdoor advertising and industrial sites, are trying to implement the LED electric bulletin board system that is based on image processing in order to express a variety of intention in real time. Recently, in various field, rather than simple text representation, the importance of intuitive communication using images is increasing. Thus, instead of outputting the simple input information for communication, a system that can output a real-time information being sought. Therefore, the system is directed to overcoming by converting the problem of mapping an image on a variety of conventional LED display that can not be output images, the possible image output formats. Using an LED of low power, it has developed to output the efficient messages and images within a limited resources. This paper provides a system capable of managing the LED display on the wireless network. Atmega2560, Wi-Fi module, using the server and Android applications client, rather than printing a text only, it is a system to reduce the load generated image output character output in to the conversion process as can be managed by the server.