• Title/Summary/Keyword: OCR

Search Result 476, Processing Time 0.025 seconds

Trends in Deep Learning-based Medical Optical Character Recognition (딥러닝 기반의 의료 OCR 기술 동향)

  • Sungyeon Yoon;Arin Choi;Chaewon Kim;Sumin Oh;Seoyoung Sohn;Jiyeon Kim;Hyunhee Lee;Myeongeun Han;Minseo Park
    • The Journal of the Convergence on Culture Technology
    • /
    • v.10 no.2
    • /
    • pp.453-458
    • /
    • 2024
  • Optical Character Recognition is the technology that recognizes text in images and converts them into digital format. Deep learning-based OCR is being used in many industries with large quantities of recorded data due to its high recognition performance. To improve medical services, deep learning-based OCR was actively introduced by the medical industry. In this paper, we discussed trends in OCR engines and medical OCR and provided a roadmap for development of medical OCR. By using natural language processing on detected text data, current medical OCR has improved its recognition performance. However, there are limits to the recognition performance, especially for non-standard handwriting and modified text. To develop advanced medical OCR, databaseization of medical data, image pre-processing, and natural language processing are necessary.

A Study on Improvement of Korean OCR Accuracy Using Deep Learning (딥러닝을 이용한 한글 OCR 정확도 향상에 대한 연구)

  • Kang, Ga-Hyeon;Ko, Ji-Hyun;Kwon, Yong-Jun;Kwon, Na-Young;Koh, Seok-Ju
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2018.05a
    • /
    • pp.693-695
    • /
    • 2018
  • In this paper, we propose the improvement of Hangul OCR accuracy through deep learning. OCR is a program that senses printed and handwritten characters in an optical way and encodes them digitally. In the case of the most commonly used Tesseract OCR, the accuracy of English recognition is high. However, Hangul has lower accuracy because it has less learning data for a complex structure. Therefore, in this study, we propose a method to improve the accuracy of Hangul OCR by extracting the character region from the desired image through image processing and using deep learning using it as learning data. It is expected that OCR, which has been developed only by existing alphanumeric and several languages, can be applied to various languages.

  • PDF

Assessment of Overconsolidation Ratio by Depth of Soft Ground: A Case Study in South Korea (국내 연약지반의 심도별 과압밀비 산정에 관한 사례연구)

  • Lee, Jong-Young;Han, Jung-Geun
    • Journal of the Korean Geosynthetics Society
    • /
    • v.20 no.4
    • /
    • pp.9-18
    • /
    • 2021
  • In this study, the overconsolidation ratio (OCR) of soft clay soil was calculated by conducting an indoor physical experiment and a dynamics test using undisturbed soil samples from a soft clay soil field in South Korea. The OCR by depth was predicted by comparing the experimental results with the existing empirical equations. Methods using the liquidity index and the existing empirical equation by the Naval Facilities Engineering Systems Command (NAVFAC) were examined, and the results were compared with the actual measured values. The method using the liquidity index was found to be suitable for estimating the rough OCR of the ground. However, the effect of drying was not considered for the ground above the groundwater level. Therefore, an equation for the correlation equation between the depth and OCR of each region, including the ground above the groundwater level, was proposed. The proposed equation was applied to the OCR prediction of the adjacent area. The predicted values in the area composed of clay (CL, CH) were found to be in good agreement with the actual values. In the region composed of silt (ML), however, the predicted values were not consistent with the actual values. This suggests that the sedimentation and compositional characteristics, rather than the engineering characteristics of the soil, are important factors that affect the OCR prediction.

A Study on the Prediction for the OCR Technology Development Trajectory based on the Patent and Article Information (특허와 논문정보를 활용한 OCR 기술발전 동향예측에 관한 연구)

  • Won Jun, Kim;Sang Kon, Lee;Sung Kuk, Pyo
    • Journal of Information Technology Services
    • /
    • v.21 no.6
    • /
    • pp.39-51
    • /
    • 2022
  • As the 4th Industrial Revolution emerged as a key to improving national competitiveness, OCR technology, one of the major technologies in the 4th industry is in the spotlight. Since characters in various images contain a lot of information, OCR technology for recognizing these characters has evolved into technology used in many industries. In this paper, trends in OCR technology were identified and predicted using thesis data published in 'RISS' and patent data by International patent classification (IPC) under the theme of Optical character recognition (OCR). For patent data 20,000 patents related to OCR technology from 2002 to 2020 were used as data, and 432 papers from 2012 to 2022 were used as data. Through time-series analysis, each patent data and thesis data were investigated since when OCR technology has developed, and various keyword analysis predicted which technology will be used in the future. Finally, the direction of future OCR technology development was presented through network association analysis with patent data and thesis data.

Proposal Record Automation Service Based on AI by Using OCR and Pattern Analysis Algorithm (OCR과 패턴분석 알고리즘을 활용한 인공지능 기반 기록 자동화 서비스 제안)

  • Hwang, Yun-Young
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2019.10a
    • /
    • pp.530-532
    • /
    • 2019
  • 제안하는 서비스는 OCR(Optical Character Recognition, 광학문자인식)과 딥러닝 패턴분석 알고리즘을 활용하여 문서를 효율적으로 관리하는 서비스로 필기를 많이 하는 사용자를 위한 기능을 제공한다. 최근 다양한 분야에서의 머신러닝 기반의 OCR의 활용이 증가했지만 기존의 애플리케이션은 패턴 분석 알고리즘과 통계 기반의 OCR을 혼합하여 사용하기 때문에 필기체에 대한 인식률이 높지 않다. 이에 본 논문에서는 OCR과 패턴분석 알고리즘을 활용하여 필기체에 대한 높은 인식률을 제공하는 서비스를 제안한다.

Study on OCR Enhancement of Homomorphic Filtering with Adaptive Gamma Value

  • Heeyeon Jo;Jeongwoo Lee;Hongrae Lee
    • Journal of the Korea Society of Computer and Information
    • /
    • v.29 no.2
    • /
    • pp.101-108
    • /
    • 2024
  • AI-OCR (Artificial Intelligence Optical Character Recognition) combines OCR technology with Artificial Intelligence to overcome limitations that required human intervention. To enhance the performance of AI-OCR, training on diverse data sets is essential. However, the recognition rate declines when image colors have similar brightness levels. To solve this issue, this study employs Homomorphic filtering as a preprocessing step to clearly differentiate color levels, thereby increasing text recognition rates. While Homomorphic filtering is ideal for text extraction because of its ability to adjust the high and low frequency components of an image separately using a gamma value, it has the downside of requiring manual adjustments to the gamma value. This research proposes a range for gamma threshold values based on tests involving image contrast, brightness, and entropy. Experimental results using the proposed range of gamma values in Homomorphic filtering suggest a high likelihood for effective AI-OCR performance.

Text Region Extraction and OCR on Camera Based Images (카메라 영상 위에서의 문자 영역 추출 및 OCR)

  • Shin, Hyun-Kyung
    • The KIPS Transactions:PartD
    • /
    • v.17D no.1
    • /
    • pp.59-66
    • /
    • 2010
  • Traditional OCR engines are designed to the scanned documents in calibrated environment. Three dimensional perspective distortion and smooth distortion in images are critical problems caused by un-calibrated devices, e.g. image from smart phones. To meet the growing demand of character recognition of texts embedded in the photos acquired from the non-calibrated hand-held devices, we address the problem in three categorical aspects: rotational invariant method of text region extraction, scale invariant method of text line segmentation, and three dimensional perspective mapping. With the integration of the methods, we developed an OCR for camera-captured images.

Development of Smart Household Ledger based on OCR (OCR 기반 스마트 가계부 구현)

  • Chae, Sung-eun;Jung, Ki-seok;Lee, Jeong-yeol;Rho, Young-J.
    • The Journal of the Institute of Internet, Broadcasting and Communication
    • /
    • v.18 no.6
    • /
    • pp.269-276
    • /
    • 2018
  • OCR(Optical Character Recognition) using computers has been developed for 20 years and applied to various fields such as parking management based on the recognition of license plates of cars. This technology was also used in the development of our smart OCR-based household ledger. In order to improve filling the purchase history into a smartphone based household account book, we can take pictures of receipts with the smarphone camera and automatically organize the purchase list. In this process, the recognition rate of the characters of the receipt image is not high enough with OCR technology. We could improve the rate by applying the image processing technology and adjusting the contrast of the receipt image. The rate improved from 89% to 92.5%.

Implementation and test results of on-channel repeater for ATSC 3.0 systems

  • Ahn, Sungjun;Kwon, Sunhyoung;Kwon, Hae-Chan;Kim, Youngsu;Lee, Jaekwon;Shin, Yoo-Sang;Park, Sung-Ik
    • ETRI Journal
    • /
    • v.44 no.5
    • /
    • pp.715-732
    • /
    • 2022
  • Despite the successful launch of Advanced Television Systems Committee (ATSC) 3.0 broadcasting worldwide, broadcasters are facing obstacles in constructing void-less large-scale single-frequency networks (SFNs). The bottleneck is the absence of decent on-channel repeater (OCR) solutions necessary for SFNs. In the real world, OCRs suffer from the maleficent feedback interference (FI) problem, which overwhelms the desired input signal. Moreover, the undesired multipaths between studio-linked transmitters and the OCR deteriorate the forward signals' quality as well. These problems crucially restrict the feasibility of conventional OCR systems, arousing the strong need for cost-worthy advanced OCR solutions. This paper presents an ATSC 3.0-specific solution of advanced OCR that solves the FI problem and refines the input signal. To this end, the FI canceler and channel equalizer functionalities are carefully implemented into the OCR system. The presented OCR system is designed to be fully compliant with the ATSC 3.0 specifications and performs a fast and efficient signal processing by exploiting the specific frame structure. The real product of ATSC 3.0 OCR is fabricated as well, and its feasibility is verified via field and laboratory experiments. The implemented solution is installed at a commercial on-air site and shown to provide substantial coverage gain in practice.

Development an Android based OCR Application for Hangul Food Menu (한글 음식 메뉴 인식을 위한 OCR 기반 어플리케이션 개발)

  • Lee, Gyu-Cheol;Yoo, Jisang
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.21 no.5
    • /
    • pp.951-959
    • /
    • 2017
  • In this paper, we design and implement an Android-based Hangul food menu recognition application that recognizes characters from images captured by a smart phone. Optical Character Recognition (OCR) technology is divided into preprocessing, recognition and post-processing. In the preprocessing process, the characters are extracted using Maximally Stable Extremal Regions (MSER). In recognition process, Tesseract-OCR, a free OCR engine, is used to recognize characters. In the post-processing process, the wrong result is corrected by using the dictionary DB for the food menu. In order to evaluate the performance of the proposed method, experiments were conducted to compare the recognition performance using the actual menu plate as the DB. The recognition rate measurement experiment with OCR Instantly Free, Text Scanner and Text Fairy, which is a character recognizing application in Google Play Store, was conducted. The experimental results show that the proposed method shows an average recognition rate of 14.1% higher than other techniques.