• Title/Summary/Keyword: Printed Korean characters recognition

Search Result 44, Processing Time 0.027 seconds

A Recognition System for Multi-Form Korean Characters Based on Hierarchical Temporal Memory

  • Haibao, Nan;Bae, Sun-Gap;Bae, Jong-Min;Kang, Hyun-Syug
    • Journal of Korea Multimedia Society
    • /
    • v.12 no.12
    • /
    • pp.1718-1727
    • /
    • 2009
  • Traditional character recognition systems usually aim at characters with simple variation. With the development of multimedia technology, printed characters may appear more diversely. Existing recognition technologies can't deal with Hangul recognition effectively in diverse environments. This paper presents a recognition system for multi-form Korean characters called RSMFK, which is based on the model of Hierarchical Temporal Memory (HTM). Our system can effectively recognize the printed Korean characters of different fonts, scales, rotation, noise and background. HTM is a model which simulates the neocortex of human brain to recognize and memorize intelligently. Experimental results show that RSMFK performs a good recognition rate of 97.8% on average, which is proved to be obviously improved over the conventional methods.

  • PDF

Recognition of the Printed English Sentence by Using Japanese Puzzle

  • Sohn, Young-Sun
    • International Journal of Fuzzy Logic and Intelligent Systems
    • /
    • v.8 no.3
    • /
    • pp.225-230
    • /
    • 2008
  • In this paper we embody a system that recognizes printed alphabet, numeral figures and symbols written on the keyboard for the recognition of English sentences. The image of the printed sentences is inputted and binarized, and the characters are separated by using histogram method that is the same as the existing character recognition method. During the abstraction of the individual characters, we classify one group that has not numerical information by the projection of the vertical center of the character. In case of another group that has the longer width than the height, we assort them by normalizing the width. The other group normalizes the height of the images. With the reverse application of the basic principle of the Japanese Puzzle to a normalized character image, the proposed system classifies and recognizes the printed numeral figures, symbols and characters, consequently we meet with good result.

Recognition of Characters Printed on PCB Components Using Deep Neural Networks (심층신경망을 이용한 PCB 부품의 인쇄문자 인식)

  • Cho, Tai-Hoon
    • Journal of the Semiconductor & Display Technology
    • /
    • v.20 no.3
    • /
    • pp.6-10
    • /
    • 2021
  • Recognition of characters printed or marked on the PCB components from images captured using cameras is an important task in PCB components inspection systems. Previous optical character recognition (OCR) of PCB components typically consists of two stages: character segmentation and classification of each segmented character. However, character segmentation often fails due to corrupted characters, low image contrast, etc. Thus, OCR without character segmentation is desirable and increasingly used via deep neural networks. Typical implementation based on deep neural nets without character segmentation includes convolutional neural network followed by recurrent neural network (RNN). However, one disadvantage of this approach is slow execution due to RNN layers. LPRNet is a segmentation-free character recognition network with excellent accuracy proved in license plate recognition. LPRNet uses a wide convolution instead of RNN, thus enabling fast inference. In this paper, LPRNet was adapted for recognizing characters printed on PCB components with fast execution and high accuracy. Initial training with synthetic images followed by fine-tuning on real text images yielded accurate recognition. This net can be further optimized on Intel CPU using OpenVINO tool kit. The optimized version of the network can be run in real-time faster than even GPU.

The Recognition of Printed Korean Characters by a Neural Network (신경회로망을 이용한 인쇄체 한글 문자의 인식)

  • Kim, Sang-Woo;Jeon, Yun-Ho;Choi, Chong-Ho
    • Journal of the Korean Institute of Telematics and Electronics
    • /
    • v.27 no.2
    • /
    • pp.65-72
    • /
    • 1990
  • The potential of neural networks for the recognition of the printed Korean characters is examined. In spite of good classification capability of neural networks, it is difficult to train a neural network to recognize Korean characters. The difficulty is due to a large number of Korean characters, the similarities among the characters, and the large number of data from the character images. To reduce the input image data, DC components are extracted from each input images. These preprocessed data are used as input to the neural network. The output nodes are composed to represent the characteristics of Korean characters. A MLP (multilayer perceptron) with one hidden layer was trained with a modified BEP algorithm, This method gives good recognition rate for the standard positioned characters of more than 2,300. The result shows that neural networks are well suited for the recognition of printed Korean characters.

  • PDF

A Study on the Fractal Attractor Creation and Analysis of the Printed Korean Characters

  • Shon, Young-Woo
    • Journal of information and communication convergence engineering
    • /
    • v.1 no.1
    • /
    • pp.53-57
    • /
    • 2003
  • Chaos theory is a study researching the irregular, unpredictable behavior of deterministic and non-linear dynamical system. The interpretation using Chaos makes us evaluate characteristic existing in status space of system by tine series, so that the extraction of Chaos characteristic understanding and those characteristics enables us to do high precision interpretation. Therefore, This paper propose the new method which is adopted in extracting character features and recognizing characters using the Chaos Theory. Firstly, it gets features of mesh feature, projection feature and cross distance feature from input character images. And their feature is converted into time series data. Then using the modified Henon system suggested in this paper, it gets last features of character image after calculating Box-counting dimension, Natural Measure, information bit and information dimension which are meant fractal dimension. Finally, character recognition is performed by statistically finding out the each information bit showing the minimum difference against the normalized pattern database. An experimental result shows 99% character classification rates for 2,350 Korean characters (Hangul) using proposed method in this paper.

An Efficient Character Recognition Algorithm in Printed Korean/English Documents Including Touching Characters (붙은 글자들이 포함된 인쇄체 한.영 혼용 문서에서의 효과적인 문자 인식 알고리즘)

  • 김규경;김진호;진성일;최흥문
    • Journal of the Korean Institute of Telematics and Electronics B
    • /
    • v.33B no.11
    • /
    • pp.116-126
    • /
    • 1996
  • In this paper, we present a character recognition algorithm in printed korean and english documents including touching characters. We derived two rules to segment and recognize touching characters in the bilingual documents, one from the shape characteristics of korean and english characters of the writing blocks defined in this paper, and the other from the RF (reliability factor) values generated from the classifiers. Overall classification accuracy for the KITE paper of the proposed algorithm was about 96.8% for the english abstract, and about 97.8% for the bilingual parts. Also we confirmed the proposed algorithm significantly improves the accuracy of character segmentation of the actual mixed korean and english documents including touching characters.

  • PDF

A Study on the Printed Korean and Chinese Character Recognition (인쇄체 한글 및 한자의 인식에 관한 연구)

  • 김정우;이세행
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.17 no.11
    • /
    • pp.1175-1184
    • /
    • 1992
  • A new classification method and recognition algorithms for printed Korean and Chinese character is studied for Korean text which contains both Korean and Chinese characters. The proposed method utilizes structural features of the vertical and horizontal vowel in Korean character. Korean characters are classified into 6 groups. Vowel and consonant are separated by means of different vowel extraction methods applied to each group. Time consuming thinning process is excluded. A modified crossing distance feature is measured to recognize extracted consonant. For Chinese character, an average of stroke crossing number is calculated on every characters, which allows the characters to be classified into several groups. A recognition process is then followed in terms of the stroke crossing number and the black dot rate of character. Classification between Korean and Chinese character was at the rate of 90.5%, and classification rate of Ming-style 2512 Korean characters was 90.0%. The recognition algorithm was applied on 1278 characters. The recognition rate was 92.2%. The densest class after classification of 4585 Chinese characters was found to contain only 124 characters, only 1/40 of total numbers. The recognition rate was 89.2%.

  • PDF

A Recognition of the Printed Alphabet by Using Nonogram Puzzle (노노그램 퍼즐을 이용한 인쇄체 영문자 인식)

  • Sohn, Young-Sun;Kim, Bo-Sung
    • Journal of the Korean Institute of Intelligent Systems
    • /
    • v.18 no.4
    • /
    • pp.451-455
    • /
    • 2008
  • In this paper we embody a system that recognizes the printed alphabet of two font types (Batang, Dodum) inputted by a black-and-white CCD camera and converts it into an editable text form. The image of the inputted printed sentences is binarized, then the rows of each sentence are separated through the vertical projection using the Histogram method, and the height of the characters are normalized to 48 pixels. With the reverse application of the basic principle of the Nonogram puzzle to the individual normalized character, the character is covered with the pixel-based squares, representing the characteristics of the character as the numerical information of the Nonogram puzzle in order to recognize the character through the comparison with the standard pattern information. The test of 2609 characters of font type Batang and 1475 characters of font type Dodum yielded a 100% recognition rate.

Character Segmentation and Recognition Algorithm for Steel Manufacturing Process Automation (슬라브 제품 정보 인식을 위한 문자 분리 및 문자 인식 알고리즘 개발)

  • Choi, Sung-Hoo;Yun, Jong-Pil;Park, Young-Su;Park, Jee-Hoon;Koo, Keun-Hwi;Kim, Sang-Woo
    • Proceedings of the KIEE Conference
    • /
    • 2007.04a
    • /
    • pp.389-391
    • /
    • 2007
  • This paper describes about the printed character segmentation and recognition system for slabs in steel manufacturing process. To increase the recognition rate, it is important to improve success rate of character segmentation. Since Slabs front area surface are not uniform and surface temperature is very high, marked characters not only undergo damages but also have much noise. On the other hand, since almost marked characters are very thick and the space between characters is only about 10 $^{\sim}$ 15 mm, there are many touching characters. Therefore appropriate character image preprocessing and segmentation algorithm is needed. In this paper we propose a multi-local thresholding method for damaged character restoration, a modified touching character segmentation, algorithm for marked characters. Finally a effective Multi-Class SVM is used to recognize segmented characters.

  • PDF

A Study on the Pre-Classification of Handwritten Hangeul Characters Using Partial Separation and Recognition of Initial Consonants (초성자소분리 인식에 의한 필기 한글문자의 대분류에 관한 연구)

  • 안석출;김명기
    • Journal of the Korean Graphic Arts Communication Society
    • /
    • v.6 no.1
    • /
    • pp.41-57
    • /
    • 1988
  • Recently, it Is required to develop OCR(Optical Character Reader) along with the progress of the information processing system for Hangeul. Characters have to be recognized clearly so that OCR can be applied, Structure analysis method and lump method are used for the recognition of characters, and OCR is now available for the recognition of printed characters and handwritten alphanumeric characters having simple structure by them However, It is known that there should be much more study on the development of handwritten Hangout's OCR. This paper proposed a new method for the handwritten Hangout character recognition. The units of Initial consonant of Hangout are separated and then recognized from the utilization of the position- Information of Hangeul's units from the normalized patterns using the regression line theory. It is carried out for the extraction of the block which exists in the virtual Initial consonant region from the normalized input patterns and the calculation on maximum value (${\beta}$) of likelihood after comparing the features of separated subpattern with the initial consonant dictionary.

  • PDF