• Title/Summary/Keyword: font recognition

Search Result 67, Processing Time 0.025 seconds

A Recognition of the Printed Alphabet by Using Nonogram Puzzle (노노그램 퍼즐을 이용한 인쇄체 영문자 인식)

  • Sohn, Young-Sun;Kim, Bo-Sung
    • Journal of the Korean Institute of Intelligent Systems
    • /
    • v.18 no.4
    • /
    • pp.451-455
    • /
    • 2008
  • In this paper we embody a system that recognizes the printed alphabet of two font types (Batang, Dodum) inputted by a black-and-white CCD camera and converts it into an editable text form. The image of the inputted printed sentences is binarized, then the rows of each sentence are separated through the vertical projection using the Histogram method, and the height of the characters are normalized to 48 pixels. With the reverse application of the basic principle of the Nonogram puzzle to the individual normalized character, the character is covered with the pixel-based squares, representing the characteristics of the character as the numerical information of the Nonogram puzzle in order to recognize the character through the comparison with the standard pattern information. The test of 2609 characters of font type Batang and 1475 characters of font type Dodum yielded a 100% recognition rate.

A Distinction of the Korean Character, Chinese Character and English Character using the Threshold Stroke Density (임계 획 밀도를 이용한 한글, 한자, 영문구분)

  • 원남식
    • Journal of Korea Society of Industrial Information Systems
    • /
    • v.5 no.4
    • /
    • pp.32-38
    • /
    • 2000
  • It is an important factor to distinguish the kind of the character for increasing recognition rate before the character recognition in the document recognition system composed of the multi-font and multi-letter. All the letters of each country have a various men characteristic in the each composition. In this paper, we used the stroke density as a method to distinguish the letter, and it has been adopted Korean, English and Chinese character. Input data is processed by the normalization to adopt multi-font document. Proposed method has been proved by the results of experiment the fact that the distinction probability of the Korean and English is more than 80%.

  • PDF

Machine Printed and Handwritten Text Discrimination in Korean Document Images

  • Trieu, Son Tung;Lee, Guee Sang
    • Smart Media Journal
    • /
    • v.5 no.3
    • /
    • pp.30-34
    • /
    • 2016
  • Nowadays, there are a lot of Korean documents, which often need to be identified in one of printed or handwritten text. Early methods for the identification use structural features, which can be simple and easy to apply to text of a specific font, but its performance depends on the font type and characteristics of the text. Recently, the bag-of-words model has been used for the identification, which can be invariant to changes in font size, distortions or modifications to the text. The method based on bag-of-words model includes three steps: word segmentation using connected component grouping, feature extraction, and finally classification using SVM(Support Vector Machine). In this paper, bag-of-words model based method is proposed using SURF(Speeded Up Robust Feature) for the identification of machine printed and handwritten text in Korean documents. The experiment shows that the proposed method outperforms methods based on structural features.

Distinction of the Korean and English Character Using the Stroke Density (획 밀도를 이용한 한영 구분)

  • Won, Nam-Sik;Jeon, Il-Soo;Lee, Doo-Han
    • The Transactions of the Korea Information Processing Society
    • /
    • v.4 no.7
    • /
    • pp.1873-1880
    • /
    • 1997
  • It is an important factor to distinguish the kind of the character for increasing recognition rate before the character recognition in the document recognition system composed of the multi-font and multi-letters. All the letters of each country have a various unique characteristic in the each composition. In this paper, we used the stroke density as a method to distinguish the letter, and it has been adopted only Korean and English character. Input data is processed by the normalization to adopt multi-font document. Proposed method has been proved by the results of experiment the fact that the distinction probability of the Korean and English is more than 90%.

  • PDF

Development of fashion cultural products utilizing the World Heritage of Korea - Focusing on Hangeul font and architecture - (한국의 세계유산을 활용한 패션문화상품 개발 - 한글 글자꼴과 건축물을 중심으로 -)

  • Song, Jaemin;Kim, Jiyoung;Choi, Jongmyoung
    • The Research Journal of the Costume Culture
    • /
    • v.25 no.5
    • /
    • pp.611-628
    • /
    • 2017
  • As a plan for establishing Korea's cultural identity and its competitive edge in the world market and for enhancing Korea's cultural status, creative and unique high value-added cultural products need to be developed utilizing our inherent cultural assets. Accordingly, this study focused on the development of the design of fashion cultural products that utilize the convergence of Hangeul our peculiar font style and Korea's cultural heritage, which is registered as part of UNESCO's World Heritage. A design method was devised that converges archetypal images of cultural property with the unique Hangeul font in a way that targets Korea's symbolic architectures. The symbolic architecture includes Korea's world-heritage pagoda architecture, such as Seokgatap pagoda and Dabotap pagoda at Bulguksa temple. It also included the architecture of royal palace, such as Injeongjeon hall at Changdeokgung palace. Finally, it also included the architecture of the fortress wall, such as Paldalmun gate in Hwaseong fortress. Thus, by developing cultural assets made from a convergence between architecture and the Hangeul font as a consumer-product image that has universality, the possibility of cultural products was pursued by applying color planning after an analysis that involved extracting the compositional colors of the flags of the world. This research and approach will lead to opportunities for further progress for Korea's cultural products in the global market as a results of additional recognition for their value, excellence, and universal appeal.

Locating Text in Web Images Using Image Based Approaches (웹 이미지로부터 이미지기반 문자추출)

  • Chin, Seongah;Choo, Moonwon
    • Journal of Intelligence and Information Systems
    • /
    • v.8 no.1
    • /
    • pp.27-39
    • /
    • 2002
  • A locating text technique capable of locating and extracting text blocks in various Web images is presented here. Until now this area of work has been ignored by researchers even if this sort of text may be meaningful for internet users. The algorithms associated with the technique work without prior knowledge of the text orientation, size or font. In the work presented in this research, our text extraction algorithm utilizes useful edge detection followed by histogram analysis on the genuine characteristics of letters defined by text clustering region, to properly perform extraction of the text region that does not depend on font styles and sizes. By a number of experiments we have showed impressively acceptable results.

  • PDF

A CHARACTER RECOGNITION SYSTEM BASED ON SYNTACTIC APPROACH (인쇄체 영문의 구문론적 인식)

  • Park, Dong-Choon;Park, Sung-Han
    • Proceedings of the KIEE Conference
    • /
    • 1987.07b
    • /
    • pp.1598-1601
    • /
    • 1987
  • This paper proposes a new set of topological features (primitives) for use with a syntactic recognizer for high-accuracy recognition of printed alphanumeric characters. The recognition is accomplished on nine character groups, where each group has different combinations of four feature points. A skeleton enhancement eliminating isolated points and smoothing irregular points is developed. The tree automata processed in parallel enables the realization of high-recognition speeds and font-type independent recognition. The proposed character recognition system is tested for alphanumeric character fonts of dot matrix printer and plotter using IBM-PC/XT.

  • PDF

High Speed Character Recognition by Multiprocessor System (멀티 프로세서 시스템에 의한 고속 문자인식)

  • 최동혁;류성원;최성남;김학수;이용균;박규태
    • Journal of the Korean Institute of Telematics and Electronics B
    • /
    • v.30B no.2
    • /
    • pp.8-18
    • /
    • 1993
  • A multi-font, multi-size and high speed character recognition system is designed. The design principles are simpilcity of algorithm, adaptibility, learnability, hierachical data processing and attention by feed back. For the multi-size character recognition, the extracted character images are normalized. A hierachical classifier classifies the feature vectors. Feature is extracted by applying the directional receptive field after the directional dege filter processing. The hierachical classifier is consist of two pre-classifiers and one decision making classifier. The effect of two pre-classifiers is prediction to the final decision making classifier. With the pre-classifiers, the time to compute the distance of the final classifier is reduced. Recognition rate is 95% for the three documents printed in three kinds of fonts, total 1,700 characters. For high speed implemention, a multiprocessor system with the ring structure of four transputers is implemented, and the recognition speed of 30 characters per second is aquired.

  • PDF

A Study on Printed Hangeul Recognition with Dynamic Jaso Segmentation and Neural Network (동적자소분할과 신경망을 이용한 인쇄체 한글 문자인식기에 관한 연구)

  • 이판호;장희돈;남궁재찬
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.19 no.11
    • /
    • pp.2133-2146
    • /
    • 1994
  • In this paper, we present a method for dynamic Jaso segmentation and Hangeul recognition using neural network. It uses the feature vector which is extracted from the mesh depending on the segmentation result. At first, each character is converted to 256 dimension feature vector by four direction contributivity and $8\times8$ mesh. And then, the character is classified into 6 class by neural network and is segmented into Jaso using the classification result the statistic vowel location information and the structural information. After Jaso segmentation, Hanguel recognition using neural network is performed. We experiment on four font of which three fonts are used for training the neural net and the rest is used of testing. Each font has the 2350 characters which are comprised in KS C 5601. The overall recognition rates for the training data and the testing data are 97,4% and 94&% respectively. This result shows the effectivness of proposed method.

  • PDF