A Study on the Improvement of Tesseract-based OCR Model Recognition Rate using Ontology

Hwang, Chi-gon;Yun, Dai Yeol;Yoon, Chang-Pyo;

Proceedings of the Korean Institute of Information and Commucation Sciences Conference (한국정보통신학회:학술대회논문집)

2021.05a
/
Pages.438-440
/
2021

The Korea Institute of Information and Commucation Engineering (한국정보통신학회)

A Study on the Improvement of Tesseract-based OCR Model Recognition Rate using Ontology

온톨로지를 이용한 tesseract 기반의 OCR 모델 인식률 향상에 관한 연구

Hwang, Chi-gon (Kwangwoon University) ;
Yun, Dai Yeol (Kwangwoon University) ;
Yoon, Chang-Pyo (GyeongGi University of Science and Technology)

황치곤 (광운대학교) ;
윤대열 (광운대학교) ;
윤창표 (경기과학기술대학교)

Published : 2021.05.03

PDF

Download PDF

⟨ Previous Next ⟩

Abstract

With the development of machine learning, artificial intelligence techniques are being applied in various fields. Among these fields, there is an OCR technique that converts characters in images into text. The tesseract developed by HP is one of those techniques. However, the recognition rate for recognizing characters in images is still low. To this end, we try to improve the conversion rate of the text of the image through the post-processing process that recognizes the context using the ontology.

기계학습의 발전에 따라 다양한 분야에 인공지능 기법이 적용되고 있다. 이 분야 중 이미지에 있는 문자를 텍스트로 변환하는 OCR 기법이 있다. HP에서 개발된 tesseract는 그 기법의 하나다. 그러나 이미지의 문자를 인식하는 인식률이 아직은 낮다. 이를 위해 본 연구에서는 온톨로지를 이용하여 문맥을 인지시키는 후처리 과정을 통해서 이미지의 문자 변환율에 향상을 기하고자 한다.

Proceedings of the Korean Institute of Information and Commucation Sciences Conference (한국정보통신학회:학술대회논문집)

A Study on the Improvement of Tesseract-based OCR Model Recognition Rate using Ontology

온톨로지를 이용한 tesseract 기반의 OCR 모델 인식률 향상에 관한 연구

Abstract

Keywords

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)