• Title/Summary/Keyword: WeOCR

Search Result 165, Processing Time 0.025 seconds

Three-Level Color Clustering Algorithm for Binarizing Scene Text Images (자연영상 텍스트 이진화를 위한 3단계 색상 군집화 알고리즘)

  • Kim Ji-Soo;Kim Soo-Hyung
    • The KIPS Transactions:PartB
    • /
    • v.12B no.7 s.103
    • /
    • pp.737-744
    • /
    • 2005
  • In this paper, we propose a three-level color clustering algerian for the binarization of text regions extracted from natural scene images. The proposed algorithm consists of three phases of color segmentation. First, the ordinary images in which the texts are well separated from the background, are binarized. Then, in the second phase, the input image is passed through a high pass filter to deal with those affected by natural or artificial light. Finally, the image Is passed through a low pass filter to deal with the texture in texts and/or background. We have shown that the proposed algorithm is more effective used gray-information binarization algorithm. To evaluate the effectiveness of the proposed algorithm we use a commercial OCR software ARMI 6.0 to observe the recognition accuracies on the binarized images. The experimental results on word and character recognition show that the proposed approach is more accurate than conventional methods by over $35\%$.

Image Denoising Methods based on DAECNN for Medication Prescriptions (DAECNN 기반의 병원처방전 이미지잡음제거)

  • Khongorzul, Dashdondov;Lee, Sang-Mu;Kim, Yong-Ki;Kim, Mi-Hye
    • Journal of the Korea Convergence Society
    • /
    • v.10 no.5
    • /
    • pp.17-26
    • /
    • 2019
  • We aimed to build a patient-based allergy prevention system using the smartphone and focused on the region of interest (ROI) extraction method for Optical Character Recognition (OCR) in the general environment. However, the current ROI extraction method has shown good performance in the experimental environment, but the performance in the real environment was not good due to the noisy background. Therefore, in this paper, we propose the compared methods of reducing noisy background to solve the ROI extraction problem. There five methods used as a SMF, DIN, Denoising Autoencoder(DAE), DAE with Convolution Neural Network(DAECNN) and median filter(MF) with DAECNN (MF+DAECNN). We have shown that our proposed DAECNN and MF+DAECNN methods are 69%, respectively, which is relatively higher than the conventional DAE method 55%. The verification of performance improvement uses MSE, PSNR and SSIM. The system has implemented OpenCV, C++ and Python, including its performance, is tested on real images.

Simple Frame Marker: Implementation of In-Marker Image and Character Recognition and Tracking Method (심플 프레임 마커: 마커 내부 이미지 및 문자 패턴의 인식 및 추적 기법 구현)

  • Kim, Hye-Jin;Woo, Woon-Tack
    • 한국HCI학회:학술대회논문집
    • /
    • 2009.02a
    • /
    • pp.558-561
    • /
    • 2009
  • In this paper, we propose Simple Frame Marker(SFMarker) to support recognition of characters and images included in a marker in augmented reality. If characters are inserted inside of marker and are recognised using Optical Character Recognition(OCR), it doesn't need marker learning process before an execution. It also reduces visual disturbance compared to 2D barcode marker due to familarity of characters. Therefore, proposed SFMarker distinguishes Square SFMarker that embeds images from Rectangle SFMarker with characters according to ratio of marker and applies different recognition algorithms. Also, in order to reduce preprocessing of character recognition, SFMarker inserts direction information in border of marker and extracts it to execute character recognition fast and correctly. Finally, since the character recognition for every frame slows down tracking speed, we increase the speed of recognition process using the result of character recognition in previous frame when frame difference is low.

  • PDF

Design of Postal Address File for Address Interpretation and Retrieval (주소해석 및 검색을 위한 우편주소파일 설계)

  • Chang, Tai-Woo;Kim, Ho-Yon;Lim, Kil-Taek
    • Journal of Korea Society of Industrial Information Systems
    • /
    • v.12 no.4
    • /
    • pp.74-88
    • /
    • 2007
  • In order to automate the process of mail sorting by delivery sequence, it is necessary to prepare a postal address database and to interpret written addresses on the mail-pieces with the database and OCR technology. The address database is a critical factor of automation and informatization of postal service since it could be used not only in address recognition but also in various mail processing. In this study, we design the schema of postal address database, design the postal address file based on it and explain the method of address interpretation and retrieval using it. We analyze infonnation requirements for transformation of postal address into the standardized format and consider them in the process of design. The postal address file can be used by address matching or retrieval system as well as by Hangul address recognition system for automation of delivery sequence mail-sorting.

  • PDF

Customer Barcode Support System for the Cost Saving of Mail Items (우편물 처리원가 절감을 위한 고객 바코드 지원 시스템)

  • Hwang, Jae-Gak;Park, Moon-Sung;Song, Jae-Gwan;Woo, Dong-Chin
    • The Transactions of the Korea Information Processing Society
    • /
    • v.6 no.10
    • /
    • pp.2563-2573
    • /
    • 1999
  • In most mail automatic processing centers, after facing and canceling, letter mails are passed through an Optical Character Recognition/Barcode Sorter(OCR/BS) to read the postal code and 3 of 5 fluorescent (luminescent) barcode is applied. Normally, 31%∼35% of this mails are rejected. The main reasons for reading failures are poor printing quality of addresses and barcodes, script printing, writing in a cursive hand, variety fonts, and failure to locate the address. Our goal is to provide mailer with top quality service and customer barcode service as we move toward 100% barcoding automation of letter mail. In this paper, we propose a method of printing 3 of 5 customer barcode, postal code management, and detection of postal code based on postal address for increase the performance of automatic processing system in mail items. Using postal code generating rules, which are automatically extracted from postal addresses and address numbers, creates postal codes. The customer barcode support system is implemented by C++ language and runs on IBM PC under Windows 95.

  • PDF

Word Extraction from Table Regions in Document Images (문서 영상 내 테이블 영역에서의 단어 추출)

  • Jeong, Chang-Bu;Kim, Soo-Hyung
    • The KIPS Transactions:PartB
    • /
    • v.12B no.4 s.100
    • /
    • pp.369-378
    • /
    • 2005
  • Document image is segmented and classified into text, picture, or table by a document layout analysis, and the words in table regions are significant for keyword spotting because they are more meaningful than the words in other regions. This paper proposes a method to extract words from table regions in document images. As word extraction from table regions is practically regarded extracting words from cell regions composing the table, it is necessary to extract the cell correctly. In the cell extraction module, table frame is extracted first by analyzing connected components, and then the intersection points are extracted from the table frame. We modify the false intersections using the correlation between the neighboring intersections, and extract the cells using the information of intersections. Text regions in the individual cells are located by using the connected components information that was obtained during the cell extraction module, and they are segmented into text lines by using projection profiles. Finally we divide the segmented lines into words using gap clustering and special symbol detection. The experiment performed on In table images that are extracted from Korean documents, and shows $99.16\%$ accuracy of word extraction.

Etoposide Induces Mitochondrial Dysfunction and Cellular Senescence in Primary Cultured Rat Astrocytes

  • Bang, Minji;Kim, Do Gyeong;Gonzales, Edson Luck;Kwon, Kyoung Ja;Shin, Chan Young
    • Biomolecules & Therapeutics
    • /
    • v.27 no.6
    • /
    • pp.530-539
    • /
    • 2019
  • Brain aging is an inevitable process characterized by structural and functional changes and is a major risk factor for neurodegenerative diseases. Most brain aging studies are focused on neurons and less on astrocytes which are the most abundant cells in the brain known to be in charge of various functions including the maintenance of brain physical formation, ion homeostasis, and secretion of various extracellular matrix proteins. Altered mitochondrial dynamics, defective mitophagy or mitochondrial damages are causative factors of mitochondrial dysfunction, which is linked to age-related disorders. Etoposide is an anti-cancer reagent which can induce DNA stress and cellular senescence of cancer cell lines. In this study, we investigated whether etoposide induces senescence and functional alterations in cultured rat astrocytes. Senescence-associated ${\beta}$-galactosidase (SA-${\beta}$-gal) activity was used as a cellular senescence marker. The results indicated that etoposide-treated astrocytes showed cellular senescence phenotypes including increased SA-${\beta}$-gal-positive cells number, increased nuclear size and increased senescence-associated secretory phenotypes (SASP) such as IL-6. We also observed a decreased expression of cell cycle markers, including PhosphoHistone H3/Histone H3 and CDK2, and dysregulation of cellular functions based on wound-healing, neuronal protection, and phagocytosis assays. Finally, mitochondrial dysfunction was noted through the determination of mitochondrial membrane potential using tetramethylrhodamine methyl ester (TMRM) and the measurement of mitochondrial oxygen consumption rate (OCR). These data suggest that etoposide can induce cellular senescence and mitochondrial dysfunction in astrocytes which may have implications in brain aging and neurodegenerative conditions.

Classification of Handwritten and Machine-printed Korean Address Image based on Connected Component Analysis (연결요소 분석에 기반한 인쇄체 한글 주소와 필기체 한글 주소의 구분)

  • 장승익;정선화;임길택;남윤석
    • Journal of KIISE:Software and Applications
    • /
    • v.30 no.10
    • /
    • pp.904-911
    • /
    • 2003
  • In this paper, we propose an effective method for the distinction between machine-printed and handwritten Korean address images. It is important to know whether an input image is handwritten or machine-printed, because methods for handwritten image are quite different from those of machine-printed image in such applications as address reading, form processing, FAX routing, and so on. Our method consists of three blocks: valid connected components grouping, feature extraction, and classification. Features related to width and position of groups of valid connected components are used for the classification based on a neural network. The experiment done with live Korean address images has demonstrated the superiority of the proposed method. The correct classification rate for 3,147 testing images was about 98.85%.

Enhancement of Transmittance and Adhesion of Flexible Display Adhesion Surface by Bubble Removing Process (기포 제거 공정을 통한 유연한 디스플레이 합착 면의 투과율 및 접착력 향상)

  • Kim, Jungsoo;Jang, Kyungsoo;Phu, Cam;Park, Heejun;Shin, Donggi;Lee, Younjung;Yi, Junsin
    • Journal of the Korean Institute of Electrical and Electronic Material Engineers
    • /
    • v.31 no.5
    • /
    • pp.330-334
    • /
    • 2018
  • With the development of the Internet of Things, the use of flexible displays has become widespread. In particular, the use of curved, bendable, and rollable displays is increasing. Flexible display production processes include various important components such as lamination material, flexible substrates, and adhesives. Among them, improvement of the lamination process comprises a large proportion of efforts for further development. In this paper, we attempt to improve the transmittance of the display substrate by performing a bubble removal process after adhesion. The transmittance of the glass substrate with the bubble removal process was 5~12% higher than that of the substrate without the bubble removal process. The fill-strength after the bubble removal process was improved by 21.4%, and the shear-strength was improved by 43.9%.

A study on the Character Correction of the Wrongly Recognized Sentence Marks, Japanese, English, and Chinese Character in the Off-line printed Character Recognition (오프라인 인쇄체 문장부호, 일본 문자, 영문자, 한자 인식에서의 오인식 문자 교 정에 관한 연구)

  • Lee, Byeong-Hui;Kim, Tae-Gyun
    • The Transactions of the Korea Information Processing Society
    • /
    • v.4 no.1
    • /
    • pp.184-194
    • /
    • 1997
  • In the recent years number of commercial off-line character recognition systems have been appeared in the Korean market. This paper describes a "self -organizing" data structure for representing a large dictionary which can be searched in real time and uses a practical amount of memory, and presents a study on the character correction for off-line printed sentence marks, Japanese, English, and Chinese character recognition. Self-organizing algorithm can be recommenced as particularly appropriate when we have reasons to suspect that the accessing probabilities for individual words will change with time and theme. The wrongly recognized characters generated by OCR systems are collected and analyzed Error types of English characters are reclassified and 0.5% errors are corrected using an English character confusion table with a self-organizing dictionary containing 25,145 English words. And also error types of Chinese characters are classified and 6.1% errors are corrected using a Chinese character confusion table with a self-organizing dictionary carrying 34,593 Chinese words.ese words.

  • PDF