• Title/Summary/Keyword: Optical character recognition

Search Result 181, Processing Time 0.02 seconds

Character Segmentation with Segmentation Cost in Optical Character Recognition (문자 인식에서 분할 비용에 따른 문자 분할 연구)

  • Jung Minchul
    • Proceedings of the KAIS Fall Conference
    • /
    • 2004.06a
    • /
    • pp.179-181
    • /
    • 2004
  • 인쇄체 문자 인식에서 접합 문자는 주요한 에러 발생의 원인이다. 본 논문에서는 접합 문자를 분할하기 위해 두 개의 분할 비용을 정의한다. 첫째, 절단 비용은 한 패턴을 분할하는 데 얼마나 많은 블랙픽셀이 분리되어야 하는가이다. 둘째, 접선 비용은 분할선이 얼마나 많은 블랙 픽셀과 화이트 픽셀사이를 지나가는가이다. 폰트 분류기는 접합 문자의 후보 문자를 제공한다. 후보 문자의 문자 폭은 접합 문자를 분리하기 위한 기준선을 제공하며, 그 기준선 부근의 픽셀들이 분할 가능 영역을 나타낸다. 절단 비용의 최소값과 접선 비용의 최대값이 되는 지점이 최종적으로 접합 문자를 분할하는 위치이다. 이렇게 정의된 절단 비용과 접선 비용을 가지고 접합 문자를 분할하면 보다 정확한 문자 분할을 하여 문자 인식에서 에러 발생을 줄일 수 있다.

  • PDF

Manchu Script Letters Dataset Creation and Labeling

  • Aaron Daniel Snowberger;Choong Ho Lee
    • Journal of information and communication convergence engineering
    • /
    • v.22 no.1
    • /
    • pp.80-87
    • /
    • 2024
  • The Manchu language holds historical significance, but a complete dataset of Manchu script letters for training optical character recognition machine-learning models is currently unavailable. Therefore, this paper describes the process of creating a robust dataset of extracted Manchu script letters. Rather than performing automatic letter segmentation based on whitespace or the thickness of the central word stem, an image of the Manchu script was manually inspected, and one copy of the desired letter was selected as a region of interest. This selected region of interest was used as a template to match all other occurrences of the same letter within the Manchu script image. Although the dataset in this study contained only 4,000 images of five Manchu script letters, these letters were collected from twenty-eight writing styles. A full dataset of Manchu letters is expected to be obtained through this process. The collected dataset was normalized and trained using a simple convolutional neural network to verify its effectiveness.

Development of the automated gate system based on RFID/OCR in a container terminal (RFID/OCR 기반의 자동화 게이트시스템 개발)

  • Choi, Hyung-Rim;Park, Byung-Joo;Shin, Joong-Jo;Keceli, Yavuz;Lee, Jung-Hee
    • Journal of Korea Society of Industrial Information Systems
    • /
    • v.12 no.2
    • /
    • pp.37-48
    • /
    • 2007
  • In order to become a mega hub port, major ports all over the world are making every effort to enhance their productivity through efficiency of internal operation and introduction of the state-of-the-art technologies. They are not only installing various kinds of high-technology equipments but also introducing advanced technologies for the development of an effective gate system. Recently thanks to the appearance of RFID (radio frequency identification) and OCR (optical character recognition) technology, major container terminals are stewing up the automation of truck and container identification at the container luminal gate. This study aim to develop an automated gate system for identification task based on RFID and OCR technology. It will make mn effective gate operations in a container terminal.

  • PDF

The Verification System of the Customer Barcode for the Advanced Automatic Processing of the Mail Items (우편물 자도처리 촉진을 위한 우편용 고객 바코드 검증 시스템)

  • Park, Mun-Seong;Song, Jae-Gwan;U, Dong-Jin
    • The Transactions of the Korea Information Processing Society
    • /
    • v.6 no.4
    • /
    • pp.968-976
    • /
    • 1999
  • Currently, in the most mail automatic processing centers, after facing and canceling, envelope mail is passed through an Optical Character Recognition/Barcode Sorter(OCR/BS) to read the address and 3 of 5 fluorescent(luminescent) barcode is applied. Normally, 30%∼35% of this mail is rejected. The usual reasons for read failure are poor printing quality of address and barcode, script printing and failure to locate the address. This paper describes a verification system of the postal 3 of 5 customer barcode for solving this problem. The certification system of the 3 of 5 customer barcode consists of barcode verification system and postal address database. The purpose of certification system of the customer barcode verifies the postal 3 of 5 customer barcode and tests matching of mail piece postal address, and retrieves postal code.

  • PDF

Development of Korean-to-English and English-to-Korean Mobile Translator for Smartphone (스마트폰용 영한, 한영 모바일 번역기 개발)

  • Yuh, Sang-Hwa;Chae, Heung-Seok
    • Journal of the Korea Society of Computer and Information
    • /
    • v.16 no.3
    • /
    • pp.229-236
    • /
    • 2011
  • In this paper we present light weighted English-to-Korean and Korean-to-English mobile translators on smart phones. For natural translation and higher translation quality, translation engines are hybridized with Translation Memory (TM) and Rule-based translation engine. In order to maximize the usability of the system, we combined an Optical Character Recognition (OCR) engine and Text-to-Speech (TTS) engine as a Front-End and Back-end of the mobile translators. With the BLEU and NIST evaluation metrics, the experimental results show our E-K and K-E mobile translation equality reach 72.4% and 77.7% of Google translators, respectively. This shows the quality of our mobile translators almost reaches the that of server-based machine translation to show its commercial usefulness.

Expiration Date Notification System Based on YOLO and OCR algorithms for Visually Impaired Person (YOLO와 OCR 알고리즘에 기반한 시각 장애우를 위한 유통기한 알림 시스템)

  • Kim, Min-Soo;Moon, Mi-Kyung;Han, Chang-Hee
    • The Journal of the Korea institute of electronic communication sciences
    • /
    • v.16 no.6
    • /
    • pp.1329-1338
    • /
    • 2021
  • There are rarely effective methods to help visually impaired people when they want to know the expiration date of products excepted to only Braille. In this study, we developed an expiration date notification system based on YOLO and OCR for visually impaired people. The handicapped people can automatically know the expiration date of a specific product by using our system without the help of a caregiver, fast and accurately. The proposed system is worked by four different steps: (1) identification of a target product by scanning its barcode; (2) segmentation of an image area with the expiration date using YOLO; (3) classification of the expiration date by OCR: (4) notification of the expiration date by TTS. Our system showed an average classification accuracy of about 86.00% when blindfolded subjects used the proposed system in real-time. This result validates that the proposed system can be potentially used for visually impaired people.

Implementation of ROS-Based Intelligent Unmanned Delivery Robot System (ROS 기반 지능형 무인 배송 로봇 시스템의 구현)

  • Seong-Jin Kong;Won-Chang Lee
    • Journal of IKEEE
    • /
    • v.27 no.4
    • /
    • pp.610-616
    • /
    • 2023
  • In this paper, we implement an unmanned delivery robot system with Robot Operating System(ROS)-based mobile manipulator, and introduce the technologies employed for the system implementation. The robot consists of a mobile robot capable of autonomous navigation inside the building using an elevator and a Selective Compliance Assembly Robot Arm(SCARA)-Type manipulator equipped with a vacuum pump. The robot can determines the position and orientation for picking up a package through image segmentation and corner detection using the camera on the manipulator. The proposed system has a user interface implemented to check the delivery status and determine the real-time location of the robot through a web server linked to the application and ROS, and recognizes the shipment and address at the delivery station through You Only Look Once(YOLO) and Optical Character Recognition(OCR). The effectiveness of the system is validated through delivery experiments conducted within a 4-story building.

Development of a Ship's Logbook Data Extraction Model Using OCR Program (OCR 프로그램을 활용한 선박 항해일지 데이터 추출 모델 개발)

  • Dain Lee;Sung-Cheol Kim;Ik-Hyun Youn
    • Journal of the Korean Society of Marine Environment & Safety
    • /
    • v.30 no.1
    • /
    • pp.97-107
    • /
    • 2024
  • Despite the rapid advancement in image recognition technology, achieving perfect digitization of tabular documents and handwritten documents still challenges. The purpose of this study is to improve the accuracy of digitizing the logbook by correcting errors by utilizing associated rules considered during logbook entries. Through this, it is expected to enhance the accuracy and reliability of data extracted from logbook through OCR programs. This model is to improve the accuracy of digitizing the logbook of the training ship "Saenuri" at the Mokpo Maritime University by correcting errors identified after Optical Character Recognition (OCR) program recognition. The model identified and corrected errors by utilizing associated rules considered during logbook entries. To evaluate the effect of model, the data before and after correction were divided by features, and comparisons were made between the same sailing number and the same feature. Using this model, approximately 10.6% of errors out of the total estimated error rate of about 11.8% were identified, and 56 out of 123 errors were corrected. A limitation of this study is that it only focuses on information from Dist.Run to Stand Course sections of the logbook, which contain navigational information. Future research will aim to correct more information from the logbook, including weather information, to overcome this limitation.

A Methodology for Urdu Word Segmentation using Ligature and Word Probabilities

  • Khan, Yunus;Nagar, Chetan;Kaushal, Devendra S.
    • International Journal of Ocean System Engineering
    • /
    • v.2 no.1
    • /
    • pp.24-31
    • /
    • 2012
  • This paper introduce a technique for Word segmentation for the handwritten recognition of Urdu script. Word segmentation or word tokenization is a primary technique for understanding the sentences written in Urdu language. Several techniques are available for word segmentation in other languages but not much work has been done for word segmentation of Urdu Optical Character Recognition (OCR) System. A method is proposed for word segmentation in this paper. It finds the boundaries of words in a sequence of ligatures using probabilistic formulas, by utilizing the knowledge of collocation of ligatures and words in the corpus. The word identification rate using this technique is 97.10% with 66.63% unknown words identification rate.

Korean Character Recognition with Tree Structure Using Representative Images (대표영상을 이용한 나무구조의 한글문자 인식)

  • 김정우;정수길;조웅호;김성용;김수중
    • Journal of the Korean Institute of Telematics and Electronics B
    • /
    • v.31B no.4
    • /
    • pp.18-29
    • /
    • 1994
  • For the efficient recognition of Korean Alphabets, we proposed the tree structure algorithm which was based on K-tuple NRF-SDF using representative images as training images. Representative images consisted of ECP-SDF images of several consonants or vowels. To reduce the effect of sidelobe in the output correlation plane, we used the representative images as training images and obtained the elements of a vector inner product matrix using the peak value of AMPOF correlation of training images with one another. The proposed algorithm consisted of three main-step containing several substeps. In filter synthesis of each step, representative images were used as training images in the first and the second main-step and each consonant or vowel was used as training images in the third main-step. The performance of this algorithm is demonstrated by computer simulation and optical experiment.

  • PDF