• Title/Summary/Keyword: optical character

Search Result 279, Processing Time 0.033 seconds

Development of Smart Household Ledger based on OCR (OCR 기반 스마트 가계부 구현)

  • Chae, Sung-eun;Jung, Ki-seok;Lee, Jeong-yeol;Rho, Young-J.
    • The Journal of the Institute of Internet, Broadcasting and Communication
    • /
    • v.18 no.6
    • /
    • pp.269-276
    • /
    • 2018
  • OCR(Optical Character Recognition) using computers has been developed for 20 years and applied to various fields such as parking management based on the recognition of license plates of cars. This technology was also used in the development of our smart OCR-based household ledger. In order to improve filling the purchase history into a smartphone based household account book, we can take pictures of receipts with the smarphone camera and automatically organize the purchase list. In this process, the recognition rate of the characters of the receipt image is not high enough with OCR technology. We could improve the rate by applying the image processing technology and adjusting the contrast of the receipt image. The rate improved from 89% to 92.5%.

Research and development of haptic simulator for Dental education using Virtual reality and User motion

  • Lee, Sang-Hyun
    • International Journal of Advanced Culture Technology
    • /
    • v.6 no.4
    • /
    • pp.52-57
    • /
    • 2018
  • The purpose of this paper is to develop simulations that can be used for virtual education in dentistry. The virtual education to be developed will be developed with clinical training and actual case data of tooth extraction. This development goal is to allow dental students to learn the necessary surgical techniques at the point of their choice, not going into the operating room, away from time, space, and physical limits. I want to develop content using VR. Oculus Rift HMD, Optical Based Outside-in Tracking System, Oculus Touch Motion Controller, and Headset as Input / Output Device. In this configuration, the optimization method is applied convergent, and when the operation of the VR contents is performed, the content data is extracted from the interaction analysis formed in the VR engine, and the data is processed by the content algorithm. It also computes events and dental operations generated within the 3D engine programming and generates corresponding events through data processing according to the input signal. The visualization information is output to the HMD using the rendering information. In addition, the operating room environment was constructed by studying lighting and material for actual operating room environment. We applied the ratio of actual space to virtual space and the ratio between character and actual person to create a spatial composition at a similar rate to actual space.

A Study on the Application of Knowledge-based Service in Procurement Engineering (구매엔지니어링을 위한 지식기반 서비스 적용 방안에 관한 연구)

  • Kim, Jinil;Cha, Jaemin;Shin, Joonguk;Yeum, Choongseup
    • Journal of the Korean Society of Systems Engineering
    • /
    • v.14 no.2
    • /
    • pp.67-72
    • /
    • 2018
  • In the EPC(Engineering Procurement and Construction) project of the plant, procurement engineering has a profound effect on the profitability of the project. It is important that the procurement specifications are well written to ensure that procurement engineering works properly. In the meantime, the procurement specifications have been created by the experience of the person in charge because there was no system for helping procurement engineering. To cope with this situation, we are developing a procurement engineering management support system (PeMSS). This paper describes how to implement a knowledge-based service in the procurement engineering management support system. First, we briefly introduce the PeMSS, the knowledge base application field, and how to apply it. The parts that requires knowledge-based service are parsing the requirements in the PDF (Portable Document Format) file and management of the document provided by the supplier of the equipment.

Credit Card Number Recognition for People with Visual Impairment (시력 취약 계층을 위한 신용 카드 번호 인식 연구)

  • Park, Dahoon;Kwon, Kon-Woo
    • Journal of IKEEE
    • /
    • v.25 no.1
    • /
    • pp.25-31
    • /
    • 2021
  • The conventional credit card number recognition system generally needs a card to be placed in a designated location before its processing, which is not an ideal user experience especially for people with visual impairment. To improve the user experience, this paper proposes a novel algorithm that can automatically detect the location of a credit card number based on the fact that a group of sixteen digits has a fixed aspect ratio. The proposed algorithm first performs morphological operations to obtain multiple candidates of the credit card number with >4:1 aspect ratio, then recognizes the card number by testing each candidate via OCR and BIN matching techniques. Implemented with OpenCV and Firebase ML, the proposed scheme achieves 77.75% accuracy in the credit card number recognition task.

An End-to-End Sequence Learning Approach for Text Extraction and Recognition from Scene Image

  • Lalitha, G.;Lavanya, B.
    • International Journal of Computer Science & Network Security
    • /
    • v.22 no.7
    • /
    • pp.220-228
    • /
    • 2022
  • Image always carry useful information, detecting a text from scene images is imperative. The proposed work's purpose is to recognize scene text image, example boarding image kept on highways. Scene text detection on highways boarding's plays a vital role in road safety measures. At initial stage applying preprocessing techniques to the image is to sharpen and improve the features exist in the image. Likely, morphological operator were applied on images to remove the close gaps exists between objects. Here we proposed a two phase algorithm for extracting and recognizing text from scene images. In phase I text from scenery image is extracted by applying various image preprocessing techniques like blurring, erosion, tophat followed by applying thresholding, morphological gradient and by fixing kernel sizes, then canny edge detector is applied to detect the text contained in the scene images. In phase II text from scenery image recognized using MSER (Maximally Stable Extremal Region) and OCR; Proposed work aimed to detect the text contained in the scenery images from popular dataset repositories SVT, ICDAR 2003, MSRA-TD 500; these images were captured at various illumination and angles. Proposed algorithm produces higher accuracy in minimal execution time compared with state-of-the-art methodologies.

A Study on the Prediction for the OCR Technology Development Trajectory based on the Patent and Article Information (특허와 논문정보를 활용한 OCR 기술발전 동향예측에 관한 연구)

  • Won Jun, Kim;Sang Kon, Lee;Sung Kuk, Pyo
    • Journal of Information Technology Services
    • /
    • v.21 no.6
    • /
    • pp.39-51
    • /
    • 2022
  • As the 4th Industrial Revolution emerged as a key to improving national competitiveness, OCR technology, one of the major technologies in the 4th industry is in the spotlight. Since characters in various images contain a lot of information, OCR technology for recognizing these characters has evolved into technology used in many industries. In this paper, trends in OCR technology were identified and predicted using thesis data published in 'RISS' and patent data by International patent classification (IPC) under the theme of Optical character recognition (OCR). For patent data 20,000 patents related to OCR technology from 2002 to 2020 were used as data, and 432 papers from 2012 to 2022 were used as data. Through time-series analysis, each patent data and thesis data were investigated since when OCR technology has developed, and various keyword analysis predicted which technology will be used in the future. Finally, the direction of future OCR technology development was presented through network association analysis with patent data and thesis data.

A Real-time Bus Arrival Notification System for Visually Impaired Using Deep Learning (딥 러닝을 이용한 시각장애인을 위한 실시간 버스 도착 알림 시스템)

  • Seyoung Jang;In-Jae Yoo;Seok-Yoon Kim;Youngmo Kim
    • Journal of the Semiconductor & Display Technology
    • /
    • v.22 no.2
    • /
    • pp.24-29
    • /
    • 2023
  • In this paper, we propose a real-time bus arrival notification system using deep learning to guarantee movement rights for the visually impaired. In modern society, by using location information of public transportation, users can quickly obtain information about public transportation and use public transportation easily. However, since the existing public transportation information system is a visual system, the visually impaired cannot use it. In Korea, various laws have been amended since the 'Act on the Promotion of Transportation for the Vulnerable' was enacted in June 2012 as the Act on the Movement Rights of the Blind, but the visually impaired are experiencing inconvenience in using public transportation. In particular, from the standpoint of the visually impaired, it is impossible to determine whether the bus is coming soon, is coming now, or has already arrived with the current system. In this paper, we use deep learning technology to learn bus numbers and identify upcoming bus numbers. Finally, we propose a method to notify the visually impaired by voice that the bus is coming by using TTS technology.

  • PDF

A study on the trader-centered blockchain-based bill of lading (거래자 중심의 블록체인 기반 선하증권 연구)

  • Lee, Ju-Young;Kim, Hyun-A;Sung, Chae-Min;Kim, Joung-Min
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2021.11a
    • /
    • pp.1353-1356
    • /
    • 2021
  • 블록체인은 다수의 노드 네트워크 내에서 거래내역을 분산 저장함으로써 투명성을 확보하는 기술이다. 최근에는 금전적 가치를 지닌 선하증권(Bill of Lading, B/L 서류)에 블록체인을 적용하여 무결성을 확보하고 거래 과정을 간소화 하기위한 연구가 진행되고 있다. 본 논문에서는 거래자 중심의 블록체인 기반의 선하증권 시스템을 제안한다. 수출자는 발행 받은 선하증권을 AI(Artificial intelligence)기반의 OCR(Optical character recognition)기능을 통해 블록체인에 등록하고, 각국 은행에서 열람하여 신용장거래를 진행한다. 수입자는 선하증권 정보를 담은 QR(Quick Response code)코드로 자기증명을 하여 물품을 인도 받게 된다. 이는 수출자 측에서는 선적서류를 우편으로 보낼 시간과 비용을 단축하고, 서류의 무결성을 입증할 수 있다는 점에서 큰 효과를 얻을 수 있다. 수입자 측에서는 서류가 등록됨과 동시에 확인할 수 있고, 해당 거래를 신뢰할 수 있다는 이점을 갖는다. 마지막으로 은행 측에서는 선적서류에 대해 보안성을 갖출 수 있고 검증이 더 신속하게 이루어질 수 있다.

Drug identification application for aged group (노년층을 위한 의약품 식별 애플리케이션)

  • Cho, Hyunjun;Seo, Hyemin;Jung, Hwanhoon;Lim, Hyuk;Joo, Jong Wha J.
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2022.11a
    • /
    • pp.673-675
    • /
    • 2022
  • 우리 사회에서 개인이 복용하고 있는 약물의 종류와 수가 점점 늘어나고 있다. 약물의 사용이 증가하면서 때로는 치명적일 수 있는 약물 오남용 또한 빈번히 발생하고 있으며 특히 노년층과 같이 약품을 정확하게 구별할 수 없는 사람들은 더욱더 그 위험에 노출되어있다. 본 논문에서는 사용자가 간단한 사진을 찍는 행위를 거치면 약물의 정보를 제공하고, 복용법을 알 수 있는 모바일 애플리케이션에 관하여 기술한다. 이를 구현하기 위하여 세밀한 시각적 분류 (Fine-Grained Visual Categorization, FGVC) 기법과 광학 문자 인식 (Optical Character Recognition, OCR) 기법을 결합한 인공지능 모델을 사용하였으며, React Native 를 사용하여 운영체제에 종속되지 않도록 애플리케이션을 제안한다. 이 애플리케이션은 노년층에 친화된 UI/UX 로 디자인되었으며, 약물의 정보 제공 이외에도 개인 약물 관리, 주변 약국 길 찾기 등의 편의 기능을 통해 노년층에 삶의 질 향상을 기대할 수 있을 것이다.

Development of a Vegan Decipher System for the Social Vulnerable, such as the Low Vision Person and the Visually Impaired Person Using Optical Character Recognition (OCR) (광학 문자 인식(OCR)을 활용한 저시력자 및 시각장애인 등 사회적 약자를 위한 비건 판독 시스템 개발)

  • Hye-Rim OH;Ye-Na Kong;Jeong-Min Kim;Jea-Jun Choi
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2023.11a
    • /
    • pp.990-991
    • /
    • 2023
  • 커져만 가는 비건 시장에 비해서 비건 제품의 가격은 높고, 한정되어 있다. 성분표만을 보고 비건 여부를 파악하기에는 어렵고, 저시력자 및 시각장애인에게는 더욱 어려운 일이다. 치주 질환이나 당뇨를 포함한 크고 작은 다양한 질병으로 인해 육식 섭취 대신 불가피하게 채식을 실천해야 하는 경우 또는 가격 부담이 크고 찾기 어렵다. 그래서 비건 인증을 받은 제품 대신 일반 제품들 사이에서 비건에 적합한 제품을 찾는 데 도움이 되는 시스템을 개발하고자 한다. 본 논문에서는 저시력자 및 시각장애인을 위한 큰 글씨 화면, 음성 입출력 시스템 제공과 성분표 촬영을 통해 비건 적합 여부 및 알레르기 정보 제공, 사용자 특성 분석을 통한 UI 구성의 서비스를 제공한다. 성분표 촬영에 어려움을 겪는 저시력자 및 시각장애인에게 편리를 제공하기 위해 소프트웨어 뿐만 아니라 하드웨어를 구성한다.