• Title/Summary/Keyword: 문자특징 추출

Search Result 252, Processing Time 0.025 seconds

A Study on Character Segmentation in Car Plates (번호판에서의 문자 세그멘테이션에 관한 연구)

  • Lee, Sang-Hoon;Kim, Kyung-Hyun;Kim, Chun-Lin;Cha, Eui-Young
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2003.05a
    • /
    • pp.623-626
    • /
    • 2003
  • 본 논문에서는 현재 자동차 번호판의 형식이 구 번호판과 신 번호판 두 가지 유형으로 구성되어 있다는 점을 고려하여 번호판의 세부적 세그멘테이션의 성능을 개선하는 방법에 대하여 제시한다. 컴퓨터 비젼을 바탕으로 한 자동차 번호판의 인식방법과 문자인식방법은 비용면이나 간편성에서 맡은 장점을 가지고 있으며 여러 응용분야에서 사용될 수 있기 때문에 다방면에서 시도되고 있다. 본 시스템은 모폴로지 연산과 클러스트링을 이용하여 자동차 번호판 전체 영역을 추출하는 방법을 사용한다. 다음으로 구번호판에서 신번호판으로 넘어가는 과도기적 단계에 있는 번호판들의 특징인 용도기능의 표시문자의 위치 차이를 이용하여 구 번호판과 신번호판을 먼저 분류한다. 분류된 번호판에서 두 번호판의 차이점인 차종기초 표시영역의 숫자를 나누어서 세그멘테이션함으로서 기존의 연구방법보다 개선된 세그멘테이션 능력과 이로 인하여 향상된 번호판 인식결과를 얻을 수 있다.

  • PDF

Automatic gasometer reading system using selective optical character recognition (관심 문자열 인식 기술을 이용한 가스계량기 자동 검침 시스템)

  • Lee, Kyohyuk;Kim, Taeyeon;Kim, Wooju
    • Journal of Intelligence and Information Systems
    • /
    • v.26 no.2
    • /
    • pp.1-25
    • /
    • 2020
  • In this paper, we suggest an application system architecture which provides accurate, fast and efficient automatic gasometer reading function. The system captures gasometer image using mobile device camera, transmits the image to a cloud server on top of private LTE network, and analyzes the image to extract character information of device ID and gas usage amount by selective optical character recognition based on deep learning technology. In general, there are many types of character in an image and optical character recognition technology extracts all character information in an image. But some applications need to ignore non-of-interest types of character and only have to focus on some specific types of characters. For an example of the application, automatic gasometer reading system only need to extract device ID and gas usage amount character information from gasometer images to send bill to users. Non-of-interest character strings, such as device type, manufacturer, manufacturing date, specification and etc., are not valuable information to the application. Thus, the application have to analyze point of interest region and specific types of characters to extract valuable information only. We adopted CNN (Convolutional Neural Network) based object detection and CRNN (Convolutional Recurrent Neural Network) technology for selective optical character recognition which only analyze point of interest region for selective character information extraction. We build up 3 neural networks for the application system. The first is a convolutional neural network which detects point of interest region of gas usage amount and device ID information character strings, the second is another convolutional neural network which transforms spatial information of point of interest region to spatial sequential feature vectors, and the third is bi-directional long short term memory network which converts spatial sequential information to character strings using time-series analysis mapping from feature vectors to character strings. In this research, point of interest character strings are device ID and gas usage amount. Device ID consists of 12 arabic character strings and gas usage amount consists of 4 ~ 5 arabic character strings. All system components are implemented in Amazon Web Service Cloud with Intel Zeon E5-2686 v4 CPU and NVidia TESLA V100 GPU. The system architecture adopts master-lave processing structure for efficient and fast parallel processing coping with about 700,000 requests per day. Mobile device captures gasometer image and transmits to master process in AWS cloud. Master process runs on Intel Zeon CPU and pushes reading request from mobile device to an input queue with FIFO (First In First Out) structure. Slave process consists of 3 types of deep neural networks which conduct character recognition process and runs on NVidia GPU module. Slave process is always polling the input queue to get recognition request. If there are some requests from master process in the input queue, slave process converts the image in the input queue to device ID character string, gas usage amount character string and position information of the strings, returns the information to output queue, and switch to idle mode to poll the input queue. Master process gets final information form the output queue and delivers the information to the mobile device. We used total 27,120 gasometer images for training, validation and testing of 3 types of deep neural network. 22,985 images were used for training and validation, 4,135 images were used for testing. We randomly splitted 22,985 images with 8:2 ratio for training and validation respectively for each training epoch. 4,135 test image were categorized into 5 types (Normal, noise, reflex, scale and slant). Normal data is clean image data, noise means image with noise signal, relfex means image with light reflection in gasometer region, scale means images with small object size due to long-distance capturing and slant means images which is not horizontally flat. Final character string recognition accuracies for device ID and gas usage amount of normal data are 0.960 and 0.864 respectively.

A New Extraction Method of the Target Regions for AVI System (AVI 시스템을 위한 목표 영역의 새로운 추출 기법)

  • Cho, Dong Uk;Park, Young;Choi, Dong-Sun
    • The Journal of the Acoustical Society of Korea
    • /
    • v.17 no.5
    • /
    • pp.22-27
    • /
    • 1998
  • 본 논문에서는 차량 자동 인식 시스템(AVI:Automatic Vehicle Identification)구현에 있어 목표 영역이 되는 차량 번호판과 운전자 얼굴의 특진요소를 효율적으로 추출하기 위한 방법에 대해 다루고자 한다. 이를 위해 카메라를 두 대 설치하여 한 대의 카메라로부터는 차량 번호판 영역을 추출하고 또 하나의 카메라로는 운전자의 얼굴영역을 추출한다. 목표가 되는 두 영역의 추출을 위해 환경에 불변인 경계선 추출 방법을 제안하였고, 히스토그램의 특성을 이용하여 목표영역을 추출한다. 최종적으로 차량 번호판의 경우 추출된 번호판 영역 에 다시 X, Y 라인히스토그램을 이용하여 문자영역의 분리를 행하였고, 운전자의 경우 눈, 코, 입 등에 대한 특징을 추출하였다.

  • PDF

An Efficient Slant Correction for Handwritten Hangul Strings using Structural Properties (한글필기체의 구조적 특징을 이용한 효율적 기울기 보정)

  • 유대근;김경환
    • Journal of KIISE:Software and Applications
    • /
    • v.30 no.1_2
    • /
    • pp.93-102
    • /
    • 2003
  • A slant correction method for handwritten Korean strings based on analysis of stroke distribution, which effectively reflects structural properties of Korean characters, is presented in this paper. The method aims to deal with typical problems which have been frequently observed in slant correction of handwritten Korean strings with conventional approaches developed for English/European languages. Extracted strokes from a line of text image are classified into two clusters by applying the K-means clustering. Gaussian modeling is applied to each of the clusters and the slant angle is estimated from the model which represents the vertical strokes. Experimental results support the effectiveness of the proposed method. For the performance comparison 1,300 handwritten address string images were used, and the results show that the proposed method has more superior performance than other conventional approaches.

Hybrid Word-Character Neural Network Model for the Improvement of Document Classification (문서 분류의 개선을 위한 단어-문자 혼합 신경망 모델)

  • Hong, Daeyoung;Shim, Kyuseok
    • Journal of KIISE
    • /
    • v.44 no.12
    • /
    • pp.1290-1295
    • /
    • 2017
  • Document classification, a task of classifying the category of each document based on text, is one of the fundamental areas for natural language processing. Document classification may be used in various fields such as topic classification and sentiment classification. Neural network models for document classification can be divided into two categories: word-level models and character-level models that treat words and characters as basic units respectively. In this study, we propose a neural network model that combines character-level and word-level models to improve performance of document classification. The proposed model extracts the feature vector of each word by combining information obtained from a word embedding matrix and information encoded by a character-level neural network. Based on feature vectors of words, the model classifies documents with a hierarchical structure wherein recurrent neural networks with attention mechanisms are used for both the word and the sentence levels. Experiments on real life datasets demonstrate effectiveness of our proposed model.

A Study on Recognition of Both of PCA and LAD Using Types of Vehicle Plate (PCA와 LDA을 이용한 차량 번호판 통합 인식에 관한 연구)

  • Lee, Jin-Ki;Kim, Hyun-Yul;Lee, Seung-Kyu;Lee, Geon-Wha;Park, Yung-Rok;An, Ki-Nam;Bae, Cheol-Su;Park, Young-Cheol
    • The Journal of Korea Institute of Information, Electronics, and Communication Technology
    • /
    • v.6 no.1
    • /
    • pp.6-17
    • /
    • 2013
  • Recently, the color of vehicle license plate has been changed from green to white. Thus the vehicle plate recognition system used for parking management systems, speed and signal violation detection systems should be robust to the both colors. This paper presents a vehicle license plate recognition system, which works on both of green and white plate at the same time. In the proposed system, the image of license plate is taken from a captured vehicle image by using morphological information. In the next, each character region in the license plate image is extracted based on the vertical and horizontal projection of plate image and the relative position of individual characters. Finally, for the recognition process of extracted characters, PCA(Principal Component Analysis) and LDA(Linear Discriminant Analysis) are sequentially utilized. In the experiment, vehicle license plates of both green background and white background captured under irregular illumination conditions have been tested, and the relatively high extraction and recognition rates are observed.

Stroke Extraction in Phoneme for Off-Line Handwritten Hangul Recognition (오프라인 필기체 한글 인식을 위한 자소 내 자획의 분리)

  • Jung Min-Chul
    • Journal of the Korea Academia-Industrial cooperation Society
    • /
    • v.7 no.3
    • /
    • pp.385-392
    • /
    • 2006
  • This paper proposes a new stroke extraction algorithm for phoneme segmentation, which is one of main techniques for off-line handwritten Hangul recognition. The proposed algorithm extracts vertical, slant, and horizontal strokes from phonemes using run-length. The run-length of vertical or slant strokes becomes the width, and also the number of horizontal run-lengths the width. After extracting horizontal strokes from phonemes, the algorithm links two continuous vertical or slant stokes with run-lengths of the strokes' width to represent the features of a character. The extracted strokes can be utilized to recognize a character, using template matching of strokes, which is being adopted in on-line handwritten Hangul recognition.

  • PDF

Recognition of a New Car License Plate Using HSI Information, Fuzzy Binarization and ART2 Algorithm (HSI 정보와 퍼지 이진화 및 ART2 알고리즘을 이용한 신차량 번호판의 인식)

  • Kim, Kwang-Baek;Woo, Young-Woon;Park, Choong-Shik
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.11 no.5
    • /
    • pp.1004-1012
    • /
    • 2007
  • In this paper, we proposed a new car license plate recognition method using an unsupervised ART2 algorithm with HSI color model. The proposed method consists of two main modules; extracting plate area from a vehicle image and recognizing the characters in the plate after that. To extract plate area, hue(H) component of HSI color model is used, and the sub-area containing characters is acquired using modified fuzzy binarization method. Each character is further divided by a 4-directional edge tracking algorithm. To recognize the separated characters, noise-robust ART2 algorithm is employed. When the proposed algorithm is applied to recognize license plate characters, the extraction rate is better than that of existing RGB model and the overall recognition rate is about 97.4%.

Segmentation of Words from the Lines of Unconstrained Handwritten Text using Neural Networks (신경회로망을 이용한 제약 없이 쓰여진 필기체 문자열로부터 단어 분리 방법)

  • Kim, Gyeong-Hwan
    • Journal of the Korean Institute of Telematics and Electronics C
    • /
    • v.36C no.7
    • /
    • pp.27-35
    • /
    • 1999
  • Researches on the recognition of handwritten script have been conducted under the assumption that the isolated recognition units are provided as inputs. However, in practical recognition system designs, providing the isolated recognition unit is an challenge due to various writing syles. This paper proposes an approach for segmenting words from lines of unconstrained handwritten text, without help of recognition. In contrast to the conventional approaches which are based on physical gaps between connected components, clues that reflect the author's writing style, in terms of spacing, are extracted and utilized for the segmentation using a simple neural network. The clues are from character segments and include normalized heights and intervals of the segments. Effectiveness of the proposed approach compared with the conventional connected component based approaches in terms of word segmentation performance was evaluated by experiments.

  • PDF

Nonlinear Shape Normalization Algorithms for Gray-Scale Handwritten Hangul Images (명도 한글 글씨 영상에서의 비선형 형태 정규화 알고리즘)

  • Kim, Sang-Yup;Kim, Dae-In;Lee, Seong-Whan
    • Annual Conference on Human and Language Technology
    • /
    • 1996.10a
    • /
    • pp.98-104
    • /
    • 1996
  • 일반적으로 비선형 형태 정규화 과정은 필기체 문자에서 발생하는 형태 변형을 보상하기 위하여 사용되며, 현재까지 이진 영상에 대한 비선형 형태 정규화 방법들이 제안되었다. 그러나 현존하는 대부분의 문자 인식 시스템은 스캐너를 통하여 입력된 명도 문자영상을 이진화하여 사용하고 있기 때문에 이진화로 인해 야기되는 물자 영상에 대한 정보 유실 및 잡영 첨가 현상이 비선형 형태 정규화 과정에 누적되어 결과적으로 좋은 특징 추출 결과를 기대하기 어려운 실정이다. 본 연구에서는 이진화에 의한 정보의 손실을 최소화시키고, 필기체 문자에서 발생하는 다양한 형태 변형을 효과적으로 보상할 수 있는 명도 영상에서의 비선형 형태 정규화 방법을 제안한다. 제안된 명도 영상에서의 비선형 형태 정규화 방법들의 성능을 객관적으로 검증하기 위하여 처리 시간 및 복잡도 등을 기준으로 평가하였으며, 다양한 명도 한글 글씨 데이터에 대한 실험을 통하여 이진 영상에서의 비선형 형태 정규화 방법에 비해 제안된 방법이 변형이 심한 한글 글씨 데이타의 품질을 개선하는데 있어서 매우 효율적임을 확인할 수 있었다.

  • PDF