• Title/Summary/Keyword: Optical Character Recognition

Search Result 184, Processing Time 0.022 seconds

Implementation of Pre-Post Process for Accuraty Improvement of OCR Recognition Engine Based on Deep-Learning Technology (딥러닝 기반 OCR 인식 엔진의 정확도 향상을 위한 전/후처리기 기술 구현)

  • Jang, Chang-Bok;Kim, Ki-Bong
    • Journal of Convergence for Information Technology
    • /
    • v.12 no.1
    • /
    • pp.163-170
    • /
    • 2022
  • With the advent of the 4th Industrial Revolution, solutions that apply AI technology are being actively developed. Since 2017, the introduction of business automation solutions using AI-based Robotic Process Automation (RPA) has begun in the financial sector and insurance companies, and recently, it is entering a time when it spreads past the stage of introducing RPA solutions. Among the business automation using these RPA solutions, it is very important how accurately textual information in the document is recognized for business automation using various documents. Such character recognition has recently increased its accuracy by introducing deep learning technology, but there is still no recognition model with perfect recognition accuracy. Therefore, in this paper, we checked how much accuracy is improved when pre- and post-processor technologies are applied to deep learning-based character recognition engines, and implemented RPA recognition engines and linkage technologies.

Optical Recognition of Credit Card Numbers (신용카드 번호의 광학적 인식)

  • Jung, Min Chul
    • Journal of the Semiconductor & Display Technology
    • /
    • v.13 no.1
    • /
    • pp.57-62
    • /
    • 2014
  • This paper proposes a new optical recognition method of credit card numbers. Firstly, the proposed method segments numbers from the input image of a credit card. It uses the significant differences of standard deviations between the foreground numbers and the background. Secondly, the method extracts gradient features from the segmented numbers. The gradient features are defined as four directions of grayscale pixels for 16 regions of an input number. Finally, it utilizes an artificial neural network classifier that uses an error back-propagation algorithm. The proposed method is implemented using C language in an embedded Linux system for a high-speed real-time image processing. Experiments were conducted by using real credit card images. The results show that the proposed algorithm is quite successful for most credit cards. However, the method fails in some credit cards with strong background patterns.

Human Interface Software for Wireless and Mobile Devices (무선 이동 통신 기기용 휴먼인터페이스 소프트웨어)

  • Kim, Se-Ho;Lee, Chan-Gun
    • Journal of KIISE:Information Networking
    • /
    • v.37 no.1
    • /
    • pp.57-65
    • /
    • 2010
  • Recently, the character recognization technique is strongly needed to enable the mobile communication devices with cameras to gather input information from the users. In general, it is not easy to reuse a CBOCR(Camera Based Optical Character Recognizer) module because of its dependency on a specific platform. In this paper, we propose a software architecture for CBOCR module providing the easy adaptability to various mobile communication platforms. The proposed architecture is composed of the platform dependency support layer, the interface layer, the engine support layer, and the engine layer. The engine layer adopts a plug-in data structure to support various hardware endian policies. We show the effectiveness of the proposed method by applying the architecture to a practical product.

User Authentication System using OCR (광학문자인식을 이용한 사용자 인증 시스템)

  • Jeong, Pil-Seong;Cho, Yang-Hyun
    • Journal of the Korea Convergence Society
    • /
    • v.9 no.9
    • /
    • pp.15-22
    • /
    • 2018
  • As smart devices become popular, users can use authentication services in various methods. Authentication services include authentication using an ID and a password, authentication using a sms, and authentication using an OTP(One Time Password). This paper proposed an authentication system that solves the security problem of knowledge-based authentication using optical character recognition and can easily and quickly authenticate users. The proposed authentication system extracts a character from an uploaded image by a user and authenticates the user using the extracted character information. The proposed authentication system has the advantage of not using a password or an OTP that are easily exposed or lost, and can not be authenticated without using accurate photographs. The proposed authentication system is platform independent and can be used for user authentication, file encryption and decryption.

HANDWRITTEN HANGUL RECOGNITION MODEL USING MULTI-LABEL CLASSIFICATION

  • HANA CHOI
    • Journal of the Korean Society for Industrial and Applied Mathematics
    • /
    • v.27 no.2
    • /
    • pp.135-145
    • /
    • 2023
  • Recently, as deep learning technology has developed, various deep learning technologies have been introduced in handwritten recognition, greatly contributing to performance improvement. The recognition accuracy of handwritten Hangeul recognition has also improved significantly, but prior research has focused on recognizing 520 Hangul characters or 2,350 Hangul characters using SERI95 data or PE92 data. In the past, most of the expressions were possible with 2,350 Hangul characters, but as globalization progresses and information and communication technology develops, there are many cases where various foreign words need to be expressed in Hangul. In this paper, we propose a model that recognizes and combines the consonants, medial vowels, and final consonants of a Korean syllable using a multi-label classification model, and achieves a high recognition accuracy of 98.38% as a result of learning with the public data of Korean handwritten characters, PE92. In addition, this model learned only 2,350 Hangul characters, but can recognize the characters which is not included in the 2,350 Hangul characters

Automatic Generation of Training Character Samples for OCR Systems

  • Le, Ha;Kim, Soo-Hyung;Na, In-Seop;Do, Yen;Park, Sang-Cheol;Jeong, Sun-Hwa
    • International Journal of Contents
    • /
    • v.8 no.3
    • /
    • pp.83-93
    • /
    • 2012
  • In this paper, we propose a novel method that automatically generates real character images to familiarize existing OCR systems with new fonts. At first, we generate synthetic character images using a simple degradation model. The synthetic data is used to train an OCR engine, and the trained OCR is used to recognize and label real character images that are segmented from ideal document images. Since the OCR engine is unable to recognize accurately all real character images, a substring matching method is employed to fix wrongly labeled characters by comparing two strings; one is the string grouped by recognized characters in an ideal document image, and the other is the ordered string of characters which we are considering to train and recognize. Based on our method, we build a system that automatically generates 2350 most common Korean and 117 alphanumeric characters from new fonts. The ideal document images used in the system are postal envelope images with characters printed in ascending order of their codes. The proposed system achieved a labeling accuracy of 99%. Therefore, we believe that our system is effective in facilitating the generation of numerous character samples to enhance the recognition rate of existing OCR systems for fonts that have never been trained.

A Fast Algorithm for Korean Text Extraction and Segmentation from Subway Signboard Images Utilizing Smartphone Sensors

  • Milevskiy, Igor;Ha, Jin-Young
    • Journal of Computing Science and Engineering
    • /
    • v.5 no.3
    • /
    • pp.161-166
    • /
    • 2011
  • We present a fast algorithm for Korean text extraction and segmentation from subway signboards using smart phone sensors in order to minimize computational time and memory usage. The algorithm can be used as preprocessing steps for optical character recognition (OCR): binarization, text location, and segmentation. An image of a signboard captured by smart phone camera while holding smart phone by an arbitrary angle is rotated by the detected angle, as if the image was taken by holding a smart phone horizontally. Binarization is only performed once on the subset of connected components instead of the whole image area, resulting in a large reduction in computational time. Text location is guided by user's marker-line placed over the region of interest in binarized image via smart phone touch screen. Then, text segmentation utilizes the data of connected components received in the binarization step, and cuts the string into individual images for designated characters. The resulting data could be used as OCR input, hence solving the most difficult part of OCR on text area included in natural scene images. The experimental results showed that the binarization algorithm of our method is 3.5 and 3.7 times faster than Niblack and Sauvola adaptive-thresholding algorithms, respectively. In addition, our method achieved better quality than other methods.

Understanding Smartphone-based Online Shopping Experiences and Behaviors of Blind Users

  • Park, Jihyuk;Han, Yeji;Oh, Uran
    • International journal of advanced smart convergence
    • /
    • v.9 no.3
    • /
    • pp.260-271
    • /
    • 2020
  • Smartphones provide blind users with screenreader as an accessibility tool. However, blind users often experience difficulties accessing online shopping malls via smartphones due to their inconsistent and image-based layouts. To enable screenreader users to get access to the detailed information about products while they are shopping online, we have developed BarrierFreeShop, an accessible mobile shopping application for people with visual impairments. BarrierFreeShop has three accessibility features: (1) layout automation, (2) review summarization, and (3) optical character recognition. We conducted a user study with 80 participants with visual impairments where they were asked to use BarrierFreeShop for a month. The findings revealed the effectiveness of our app in terms of speed and post interview feedback. We have also discovered typical shopping experiences that participants had during the test. This research suggests that computer vision technologies can improve accessibility issues in online shopping malls. In addition, we have confirmed that extracting contents from images help people with visual impairments to get better access to product information.

Optical Character Recognition System Using The Document Form Identification (문서 양식 식별을 이용한 광학 문자 인식 시스템)

  • Jung, Won-Gyo;Park, Sang-Sung;Shin, Young-Geun;Ahn, Dong-Kyu;Jang, Dong-Sik
    • Proceedings of the Korean Society of Computer Information Conference
    • /
    • 2008.06a
    • /
    • pp.155-161
    • /
    • 2008
  • 최근 들어 문서나 서류 등의 보관에 대한 중요성이 커짐에 따라 기존에 종이 형태로 관리하던 문서나 서류들을 편리하게 관리하기 위해 문서 전자화 시스템을 도입하고 있는 기업 및 기관들이 많아지고 있다. 과거에는 종이로 되어 있는 서류들을 전자화시키기 위해서 사람들이 해당 서류를 보고 컴퓨터에 데이터를 수작업으로 일일이 입력해야 하는 번거로움이 있었다. 현재는 이러한 번거로움을 줄이기 위해 문서나 서류를 스캔하고, 스캔한 이미지에서 광학문자 인식(OCR: Optical Character Recognition) 기술을 이용한 방법으로 종이 형태의 문서들을 전자화하고 있다. 그러나 OCR을 통해 문자 인식을 한 이후에도 인식된 전체 문서에서 필요한 부분과 필요하지 않은 부분을 일일이 수작업으로 분류해야 하는 번거로움이 있다는 것이 문제점으로 부각되고 있다. 본 논문에서는 이와 같은 문제점을 해결하기 위해 문서 양식과 인식이 필요한 부분을 미리 지정해 놓고 문자 인식을 하는 방법 및 시스템을 제안한다. 제안된 시스템은 문자 인식 속도를 향상시키고 보다 정확한 문자 인식이 가능하게 하여, 전체적으로 문자 인식의 효율을 향상시킬 수 있을 것이다. 또한 대량의 정형화된 문서의 문자 인식에도 효과적일 것으로 기대한다.

  • PDF

Design for Automation System for Pharmaceutical Prescription Using Arduino and Optical Character Recognition

  • Lim, Myung-Jae;Jung, Dong-Kun;Kim, Kyu-Dong;Kwon, Young-Man
    • International journal of advanced smart convergence
    • /
    • v.10 no.3
    • /
    • pp.66-71
    • /
    • 2021
  • Recent healthcare environments have characteristics of expanding the scope of healthcare-impacting healthcare, complexity resulting from diversification of components, and accelerating the pace of change. Drugs are used for the prevention, mitigation, and treatment of diseases, so they can inevitably cause harm, while they have efficacy and effectiveness, which are key elements of health recovery. Therefore, many countries regulate permits for safe and effective medicines, and also designate essential drugs directly related to life as pay targets and guarantee health insurance. Especially Pharmacist relying on manpower for composition medicine is liable for mal-manufacture due to combination of toxic medical substances or other chemical usage. In this paper, we focus on using Kiosk and Optical Character Recognition (OCR) for automated pharmacy to level up medical service and create labor friendly environment for pharmacist themselves through maintenance of prescription data and automated manufacturing solution. Presentation of drug substances and precautions will lead to efficient drug prescription and prevent misuse of information while auto manufacturing system efficiently maintain labor force and raise patient satisfaction level by reduction of waiting time.