Search | Korea Science

A Study of Image Enhancement Processing for Letter Extraction of Image Using Terahertz Signal (테라헤르츠 신호를 이용한 영상의 글자 추출을 위한 화질 개선처리에 대한 연구)

Kim, Seongyoon;Choi, Hyunkeun;Park, Inho;Kim, Youngseop;Lee, Yonghwan
- Journal of the Semiconductor & Display Technology
- /
- v.16 no.3
- /
- pp.111-115
- /
- 2017
Terahertz waves are superior to conventional X-ray or Magnetic Resonance Tomography(MRI), and the amount of information that can be transmitted is as large as thousands of times that conventional X-ray or MRI. In addition, Terahertz waves have great performance in analyzing an object which have some layered structure. By using this advantage, we can extract the letters of a page by analyzing information such as absorption amount and reflection amount by irradiating a closed book with pulses of various frequencies within gap of a terahertz wave. However, in the image of each page using the Terahertz wave might be obtained various kinds of noise and the different character occlusion region. So, to extract letters from the terahertz image, we must take the noise and occlusion region away. We have been working to enhancement the image quality in various ways, and keep on studying de-noising processing for enhancement about the image quality and high resolution. Finally, we also keep on studying about OCR(Optical Character Recognition) technology, which based on pattern matching technique, to read letters.
PDF

Development of Intelligent OCR Technology to Utilize Document Image Data (문서 이미지 데이터 활용을 위한 지능형 OCR 기술 개발)

Kim, Sangjun;Yu, Donghui;Hwang, Soyoung;Kim, Minho
- Proceedings of the Korean Institute of Information and Commucation Sciences Conference
- /
- 2022.05a
- /
- pp.212-215
- /
- 2022
In the era of so-called digital transformation today, the need for the construction and utilization of big data in various fields has increased. Today, a lot of data is produced and stored in a digital device and media-friendly manner, but the production and storage of data for a long time in the past has been dominated by print books. Therefore, the need for Optical Character Recognition (OCR) technology to utilize the vast amount of print books accumulated for a long time as big data was also required in line with the need for big data. In this study, a system for digitizing the structure and content of a document object inside a scanned book image is proposed. The proposal system largely consists of the following three steps. 1) Recognition of area information by document objects (table, equation, picture, text body) in scanned book image. 2) OCR processing for each area of the text body-table-formula module according to recognized document object areas. 3) The processed document informations gather up and returned to the JSON format. The model proposed in this study uses an open-source project that additional learning and improvement. Intelligent OCR proposed as a system in this study showed commercial OCR software-level performance in processing four types of document objects(table, equation, image, text body).
PDF

Recognition of Korean Menu for Online to Offline Stores : VGG-ResNet Fusion Model with Attention Mechanism (Online to Offline 상점을 위한 한글 메뉴판 인식 : 어텐션 메커니즘을 적용한 VGG-ResNet 융합 모델)

Jongwook Si;Sangjin Lee;Sungyoung Kim
- The Journal of Korea Institute of Information, Electronics, and Communication Technology
- /
- v.17 no.4
- /
- pp.190-197
- /
- 2024
The O2O store model dissolves the boundaries between online and offline platforms, providing significant convenience to customers. To effectively operate such platforms, small business owners must provide necessary information in digital format. Specifically, the process of digitizing Korean menus manually can lead to multiple issues, and the use of OCR technology often results in high error rates due to the low accuracy in recognizing Korean. In response, this paper proposes an enhanced OCR model based on the popular EasyOCR framework, aimed at improving the recognition accuracy of Korean. The proposed model integrates the structural advantages of VGG and ResNet, and incorporates an attention mechanism to significantly improve the recognition performance of Korean. Moreover, experimental results indicate that the proposed model achieved approximately a 3.5% improvement in accuracy and around a 1% improvement in both confidence score and normalized edit distance compared to EasyOCR. Therefore, this demonstrates that the proposed method effectively addresses the existing challenges.
https://doi.org/10.17661/jkiiect.2024.17.4.190 인용 PDF HTML

Character Recognition Based on Adaptive Statistical Learning Algorithm

K.C. Koh;Park, H.J.;Kim, J.S.;K. Koh;H.S. Cho
- 제어로봇시스템학회:학술대회논문집
- /
- 2001.10a
- /
- pp.109.2-109
- /
- 2001
In the PCB assembly lines, as components become more complex and smaller, the conventional inspection method using traditional ICT and function test show their limitations in application. The automatic optical inspection(AOI) gradually becomes the alternative in the PCB assembly line. In Particular, the PCB inspection machines need more reliable and flexible object recognition algorithms for high inspection accuracy. The conventional AOI machines use the algorithmic approaches such as template matching, Fourier analysis, edge analysis, geometric feature recognition or optical character recognition (OCR), which mostly require much of teaching time and expertise of human operators. To solve this problem, in this paper, a statistical learning based part recognition method is proposed. The performance of the ...
PDF

Improvement OCR Algorithm for Efficient Book Catalog RetrievalTechnology (효과적인 도서목록 검색을 위한 개선된 OCR알고리즘에 관한 연구)

HeWen, HeWen;Baek, Young-Hyun;Moon, Sung-Ryong
- Journal of the Institute of Electronics Engineers of Korea CI
- /
- v.47 no.1
- /
- pp.152-159
- /
- 2010
Existing character recognition algorithm recognize characters in simple conditional. It has the disadvantage that recognition rates often drop drastically when input document image has low quality, rotated text, various font or size text because of external noise or data loss. In this paper, proposes the optical character recognition algorithm which using bicubic interpolation method for the catalog retrieval when the input image has rotated text, blurred, various font and size. In this paper, applied optical character recognition algorithm consist of detection and recognition part. Detection part applied roberts and hausdorff distance algorithm for correct detection the catalog of book. Recognition part applied bicubic interpolation to interpolate data loss due to low quality, various font and size text. By the next time, applied rotation for the bicubic interpolation result image to slant proofreading. Experimental results show that proposal method can effectively improve recognition rate 6% and search-time 1.077s process result.
PDF KSCI

Development of a Ship's Logbook Data Extraction Model Using OCR Program (OCR 프로그램을 활용한 선박 항해일지 데이터 추출 모델 개발)

Dain Lee;Sung-Cheol Kim;Ik-Hyun Youn
- Journal of the Korean Society of Marine Environment & Safety
- /
- v.30 no.1
- /
- pp.97-107
- /
- 2024
Despite the rapid advancement in image recognition technology, achieving perfect digitization of tabular documents and handwritten documents still challenges. The purpose of this study is to improve the accuracy of digitizing the logbook by correcting errors by utilizing associated rules considered during logbook entries. Through this, it is expected to enhance the accuracy and reliability of data extracted from logbook through OCR programs. This model is to improve the accuracy of digitizing the logbook of the training ship "Saenuri" at the Mokpo Maritime University by correcting errors identified after Optical Character Recognition (OCR) program recognition. The model identified and corrected errors by utilizing associated rules considered during logbook entries. To evaluate the effect of model, the data before and after correction were divided by features, and comparisons were made between the same sailing number and the same feature. Using this model, approximately 10.6% of errors out of the total estimated error rate of about 11.8% were identified, and 56 out of 123 errors were corrected. A limitation of this study is that it only focuses on information from Dist.Run to Stand Course sections of the logbook, which contain navigational information. Future research will aim to correct more information from the logbook, including weather information, to overcome this limitation.
https://doi.org/10.7837/kosomes.2024.30.1.097 인용 PDF

Simple Frame Marker: Implementation of In-Marker Image and Character Recognition and Tracking Method (심플 프레임 마커: 마커 내부 이미지 및 문자 패턴의 인식 및 추적 기법 구현)

Kim, Hye-Jin;Woo, Woon-Tack
- 한국HCI학회:학술대회논문집
- /
- 2009.02a
- /
- pp.558-561
- /
- 2009
In this paper, we propose Simple Frame Marker(SFMarker) to support recognition of characters and images included in a marker in augmented reality. If characters are inserted inside of marker and are recognised using Optical Character Recognition(OCR), it doesn't need marker learning process before an execution. It also reduces visual disturbance compared to 2D barcode marker due to familarity of characters. Therefore, proposed SFMarker distinguishes Square SFMarker that embeds images from Rectangle SFMarker with characters according to ratio of marker and applies different recognition algorithms. Also, in order to reduce preprocessing of character recognition, SFMarker inserts direction information in border of marker and extracts it to execute character recognition fast and correctly. Finally, since the character recognition for every frame slows down tracking speed, we increase the speed of recognition process using the result of character recognition in previous frame when frame difference is low.
PDF

Detection and Recognition of Vehicle License Plates using Deep Learning in Video Surveillance

Farooq, Muhammad Umer;Ahmed, Saad;Latif, Mustafa;Jawaid, Danish;Khan, Muhammad Zofeen;Khan, Yahya
- International Journal of Computer Science & Network Security
- /
- v.22 no.11
- /
- pp.121-126
- /
- 2022
The number of vehicles has increased exponentially over the past 20 years due to technological advancements. It is becoming almost impossible to manually control and manage the traffic in a city like Karachi. Without license plate recognition, traffic management is impossible. The Framework for License Plate Detection & Recognition to overcome these issues is proposed. License Plate Detection & Recognition is primarily performed in two steps. The first step is to accurately detect the license plate in the given image, and the second step is to successfully read and recognize each character of that license plate. Some of the most common algorithms used in the past are based on colour, texture, edge-detection and template matching. Nowadays, many researchers are proposing methods based on deep learning. This research proposes a framework for License Plate Detection & Recognition using a custom YOLOv5 Object Detector, image segmentation techniques, and Tesseract's optical character recognition OCR. The accuracy of this framework is 0.89.
https://doi.org/10.22937/IJCSNS.2022.22.11.17 인용 PDF KSCI

An Efficient Management Strategy of A Offline Second-Hand Bookstore With Camera Type OCR Technology (카메라형 광학식문자판독기술(OCR)을 활용한 오프라인 중고서점의 장서 디지털 데이터화 관리 방안 제안)

Koo, Ja Min;Ham, Seung Mo;Kim, Woo Je;Shim, Hyun Dong;Ryu, Ki Don
- Proceedings of the Korean Society of Computer Information Conference
- /
- 2014.01a
- /
- pp.283-286
- /
- 2014
본 논문에서는 카메라형 OCR (Optical Character Reader) 기술을 이용해 오프라인 중고서점의 효율적 장서관리 시스템을 구축하기 위한 디지털 데이터화 관리시스템 방안을 제안한다. OCR은 광학적으로 인식할 수 있는 문자를 컴퓨터가 읽을 수 있도록 하는 기술이다. 원리적으로 문자 한 개를 수십 개의 모눈으로 분할해 특정한 모눈의 흑백 또는 자획형상 특징에 의해 문자를 판독한다. 이 논문에서는 OCR 기술을 활용함으로써 디지털 데이터화의 효과는 물론 적용 환경의 개선효과를 기대해 볼 수 있는 오프라인 중고서점 시장을 목표로 했다. 오프라인 중고서점에서 보유하고 있는 장서의 디지털 데이터화는 기업형 중고서점과의 경쟁에 있어서도 생존을 위해 필요한 요소이다. 카메라형 OCR 기술을 활용한 장서 디지털 데이터화는 오프라인 중고서점 판매자가 도서재고 검색 및 판매 관리 효율을 높이도록 도와줄 뿐 아니라, 도서판매 유형, 소비자 분석과 수요 예측을 가능하게 한다. 또한 소비자에게 오프라인 중고서점에서 보유하고 있는 희귀 장서와 중고서적들을 검색해 구입할 수 있는 편의를 제공할 것이다. 오프라인 중고서점 판매를 촉진하고 활성화시킨다면 출판의 선순환적 구조를 만드는 데 기여할 것으로 예상된다.
PDF

Deep Learning OCR based document processing platform and its application in financial domain (금융 특화 딥러닝 광학문자인식 기반 문서 처리 플랫폼 구축 및 금융권 내 활용)

Dongyoung Kim;Doohyung Kim;Myungsung Kwak;Hyunsoo Son;Dongwon Sohn;Mingi Lim;Yeji Shin;Hyeonjung Lee;Chandong Park;Mihyang Kim;Dongwon Choi
- Journal of Intelligence and Information Systems
- /
- v.29 no.1
- /
- pp.143-174
- /
- 2023
With the development of deep learning technologies, Artificial Intelligence powered Optical Character Recognition (AI-OCR) has evolved to read multiple languages from various forms of images accurately. For the financial industry, where a large number of diverse documents are processed through manpower, the potential for using AI-OCR is great. In this study, we present a configuration and a design of an AI-OCR modality for use in the financial industry and discuss the platform construction with application cases. Since the use of financial domain data is prohibited under the Personal Information Protection Act, we developed a deep learning-based data generation approach and used it to train the AI-OCR models. The AI-OCR models are trained for image preprocessing, text recognition, and language processing and are configured as a microservice architected platform to process a broad variety of documents. We have demonstrated the AI-OCR platform by applying it to financial domain tasks of document sorting, document verification, and typing assistance The demonstrations confirm the increasing work efficiency and conveniences.
https://doi.org/10.13088/jiis.2023.29.1.143 인용 PDF

Search Result 136, Processing Time 0.025 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)