• Title/Summary/Keyword: 광학문자 인식

Search Result 82, Processing Time 0.031 seconds

Improving Korean Character Recognition Rate based on the Cell Clustering Information (셀들의 군집 정보를 이용한 한글 문자 인식률 향상 기법 연구)

  • Shin, Woojun;Ko, Yoonsik;Lim, Youngtaek;Yoon, Youngsu;Park, Heewan
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2015.04a
    • /
    • pp.810-812
    • /
    • 2015
  • 문자인식 즉 OCR(Optical Character Recognition)기술은 광학적으로 인식할 수 있는 문자를 컴퓨터가 읽을 수 있도록 하는 기술을 뜻한다. 문자인식의 근간이 되는 방법은 스트링 매칭 기법이 사용되어 왔지만 한글의 경우 자음, 모음, 자음 조합으로 만 가지 유형이 넘고, 더욱이 상용한자와 영어를 섞어 쓰기 때문에 오인식되는 경우가 많다. 본 논문에서는 한글이 수직선, 수평선, 사선과 같이 방향성이 강한 선소들로 구성되어 있다는 점을 이용하여 한글의 인식률을 높이는 방법을 제안하였다.

Automatic Notification System of Expiration Date Based on YOLO and OCR algorithm for Blind Person (시각 장애우를 위한 YOLO와 OCR 알고리즘 기반의 유통기한 자동 알림 시스템)

  • Kim, Min-Soo;Moon, Mi-kyung;Han, Chang-hee
    • Proceedings of the Korean Society of Computer Information Conference
    • /
    • 2021.07a
    • /
    • pp.697-698
    • /
    • 2021
  • 본 논문에서는 시각 장애우의 식품 안전성 증진을 위해 광학 문자 인식 (optical character recognition, OCR) 및 실시간 객체 인식 (you only look once, YOLO) 알고리즘에 기반한 식품의 유통기한 자동 알림 시스템을 제안한다. 제안하는 시스템은 1) 스마트폰 카메라를 통해 실시간으로 입력되는 영상에서 YOLO 알고리즘을 활용하여 유통기한으로 예측되는 이미지 영역을 검출하고, 2) 검출된 영역에서 OCR 알고리즘을 활용하여 유통기한 데이터를 추출하며, 3) 최종 추출된 유통기한 데이터를 음성으로 변환하여 시각 장애우에게 전달한다. 개발된 시스템은 유통기한 정보를 추출해서 사용자에게 전달하기까지 평균 약 7초 이내의 빠른 응답 속도를 보였으며, 62.8%의 객체 인식 정확도와 93.6%의 문자 인식 정확도를 보였다. 이러한 결과들은 제안하는 시스템을 시각 장애우들이 실용적으로 활용할 수 있다는 가능성을 보여준다.

  • PDF

Artificial intelligence wearable platform that supports the life cycle of the visually impaired (시각장애인의 라이프 사이클을 지원하는 인공지능 웨어러블 플랫폼)

  • Park, Siwoong;Kim, Jeung Eun;Kang, Hyun Seo;Park, Hyoung Jun
    • Journal of Platform Technology
    • /
    • v.8 no.4
    • /
    • pp.20-28
    • /
    • 2020
  • In this paper, a voice, object, and optical character recognition platform including voice recognition-based smart wearable devices, smart devices, and web AI servers was proposed as an appropriate technology to help the visually impaired to live independently by learning the life cycle of the visually impaired in advance. The wearable device for the visually impaired was designed and manufactured with a reverse neckband structure to increase the convenience of wearing and the efficiency of object recognition. And the high-sensitivity small microphone and speaker attached to the wearable device was configured to support the voice recognition interface function consisting of the app of the smart device linked to the wearable device. From experimental results, the voice, object, and optical character recognition service used open source and Google APIs in the web AI server, and it was confirmed that the accuracy of voice, object and optical character recognition of the service platform achieved an average of 90% or more.

  • PDF

An Implementation of a System for Video Translation on Window Platform Using OCR (윈도우 기반의 광학문자인식을 이용한 영상 번역 시스템 구현)

  • Hwang, Sun-Myung;Yeom, Hee-Gyun
    • Journal of Internet of Things and Convergence
    • /
    • v.5 no.2
    • /
    • pp.15-20
    • /
    • 2019
  • As the machine learning research has developed, the field of translation and image analysis such as optical character recognition has made great progress. However, video translation that combines these two is slower than previous developments. In this paper, we develop an image translator that combines existing OCR technology and translation technology and verify its effectiveness. Before developing, we presented what functions are needed to implement this system and how to implement them, and then tested their performance. With the application program developed through this paper, users can access translation more conveniently, and also can contribute to ensuring the convenience provided in any environment.

Trends in Deep Learning-based Medical Optical Character Recognition (딥러닝 기반의 의료 OCR 기술 동향)

  • Sungyeon Yoon;Arin Choi;Chaewon Kim;Sumin Oh;Seoyoung Sohn;Jiyeon Kim;Hyunhee Lee;Myeongeun Han;Minseo Park
    • The Journal of the Convergence on Culture Technology
    • /
    • v.10 no.2
    • /
    • pp.453-458
    • /
    • 2024
  • Optical Character Recognition is the technology that recognizes text in images and converts them into digital format. Deep learning-based OCR is being used in many industries with large quantities of recorded data due to its high recognition performance. To improve medical services, deep learning-based OCR was actively introduced by the medical industry. In this paper, we discussed trends in OCR engines and medical OCR and provided a roadmap for development of medical OCR. By using natural language processing on detected text data, current medical OCR has improved its recognition performance. However, there are limits to the recognition performance, especially for non-standard handwriting and modified text. To develop advanced medical OCR, databaseization of medical data, image pre-processing, and natural language processing are necessary.

A Study on Improved Label Recognition Method Using Deep Learning. (딥러닝을 활용한 향상된 라벨인식 방법에 관한 연구)

  • Yoo, Sung Geun;Cho, Sung Man;Song, Minjeong;Jeon, Soyeon;Lim, Song Won;Jung, Seokyung;Park, Sangil;Park, Gooman;Kim, Heetae;Lee, Daesung
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2018.05a
    • /
    • pp.447-448
    • /
    • 2018
  • 라벨인식과 같은 광학 문자 인식은 영상처리를 활용한 컴퓨터 비전의 대표적인 연구분야이다. 본 연구에서는 딥러닝 기반의 라벨인식 시스템을 고안하였다, 생산 라인에 적용되는 라벨인식 시스템은 인식 속도가 중요하기 때문에 기존의 R-CNN기반의 딥러닝 신경망보다 월등히 빠른 오브젝트 검출 시스템 YOLO를 활용하여 문자를 학습 및 인식 시스템을 개발하였다. 본 시스템은 기존 시스템에 근접하는 문자인식 정확도를 제공하고 자동으로 문자영역을 검출 가능하며, 라벨의 인쇄불량을 판독하도록 하였다. 또한 개발, 배포, 적용이 한번에 가능한 프레임워크를 통하여 생산현장에서 발생하는 다양한 이미지 처리에 활용될 전망이다.

Research on Korea Text Recognition in Images Using Deep Learning (딥 러닝 기법을 활용한 이미지 내 한글 텍스트 인식에 관한 연구)

  • Sung, Sang-Ha;Lee, Kang-Bae;Park, Sung-Ho
    • Journal of the Korea Convergence Society
    • /
    • v.11 no.6
    • /
    • pp.1-6
    • /
    • 2020
  • In this study, research on character recognition, which is one of the fields of computer vision, was conducted. Optical character recognition, which is one of the most widely used character recognition techniques, suffers from decreasing recognition rate if the recognition target deviates from a certain standard and format. Hence, this study aimed to address this limitation by applying deep learning techniques to character recognition. In addition, as most character recognition studies have been limited to English or number recognition, the recognition range has been expanded through additional data training on Korean text. As a result, this study derived a deep learning-based character recognition algorithm for Korean text recognition. The algorithm obtained a score of 0.841 on the 1-NED evaluation method, which is a similar result to that of English recognition. Further, based on the analysis of the results, major issues with Korean text recognition and possible future study tasks are introduced.

A Study on Optical Condition and preprocessing for Input Image Improvement of Dented and Raised Characters of Rubber Tires (고무타이어 문자열 입력영상 개선을 위한 전처리와 광학조건에 관한 연구)

  • 류한성;최중경;권정혁;구본민;박무열
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.6 no.1
    • /
    • pp.124-132
    • /
    • 2002
  • In this paper, we present a vision algorithm and method for input image improvement and preprocessing of dented and raised characters on the sidewall of tires. we define optical condition between reflect coefficient and reflectance by the physical vector calculate. On the contrary this work will recognize the engraved characters using the computer vision technique. Tire input images have all most same grey levels between the characters and backgrounds. The reflectance is little from a tire surface. therefore, it's very difficult segment the characters from the background. Moreover, one side of the character string is raised and the other is dented. So, the captured images are varied with the angle of camera and illumination. For optimum Input images, the angle between camera and illumination was found out to be with in 90$^{\circ}$. In addition, We used complex filtering with low-pass and high-pass band filters to improve input images, for clear input images. Finally we define equation reflect coefficient and reflectance. By doing this, we obtained good images of tires for pattern recognition.

Optical Character Recognition based Security Document Image File Management System (광학문자인식 기반 보안문서 이미지 파일 관리 시스템)

  • Jeong, Pil-Seong;Cho, Yang-Hyun
    • Journal of the Korea Convergence Society
    • /
    • v.10 no.3
    • /
    • pp.7-14
    • /
    • 2019
  • With the development of information and communication technology, we have been able to access and manage documents containing corporate information anytime and anywhere using smart devices. As the work environment changes to smart work, the scope of information distribution is expanded, and more efforts are needed to manage security. This paper proposes a file sharing system that enables users who have smart devices to manage and share files through mutual cooperation. Proposed file sharing system, the user can add a partner to share files with each other when uploading files kept by spliting the part of the file and the other uses an algorithm to store on the server. After converting the file to be uploaded to base64, it splits it into encrypted files among users, and then transmits it to the server when it wants to share. It is easy to manage and control files using dedicated application to view files and has high security. Using the system developed with proposed algorithm, it is possible to build a system with high efficiency even for SMEs(small and medium-sized enterprises) that can not pay much money for security.

Implementation of Deep Learning-Based Vehicle Model and License Plate Recognition System (딥러닝 기반 자동차 모델 및 번호판 인식 시스템 구현)

  • Ham, Kyoung-Youn;Kang, Gil-Nam;Lee, Jang-Hyeon;Lee, Jung-Woo;Park, Dong-Hoon;Ryoo, Myung-Chun
    • Proceedings of the Korean Society of Computer Information Conference
    • /
    • 2022.07a
    • /
    • pp.465-466
    • /
    • 2022
  • 본 논문에서는 딥러닝 영상인식 기술을 활용한 객체검출 모델인 YOLOv4를 활용하여 차량의 모델과 번호판인식 시스템을 제안한다. 본 논문에서 제안하는 시스템은 실시간 영상처리기술인 YOLOv4를 사용하여 차량모델 인식과 번호판 영역 검출을 하고, CNN(Convolutional Neural Network)알고리즘을 이용하여 번호판의 글자와 숫자를 인식한다. 이러한 방법을 이용한다면 카메라 1대로 차량의 모델 인식과 번호판 인식이 가능하다. 차량모델 인식과 번호판 영역 검출에는 실제 데이터를 사용하였으며, 차량 번호판 문자 인식의 경우 실제 데이터와 가상 데이터를 사용하였다. 차량 모델 인식 정확도는 92.3%, 번호판 검출 98.9%, 번호판 문자 인식 94.2%를 기록하였다.

  • PDF