• Title/Summary/Keyword: Intelligent Character Recognition

Search Result 62, Processing Time 0.026 seconds

An EIIiptic Approach to Learning Discriminants

  • Karbou, Fatiha;Karbou, Fatima;Karbou, M.
    • Proceedings of the Korean Institute of Intelligent Systems Conference
    • /
    • 1998.06a
    • /
    • pp.153-157
    • /
    • 1998
  • It is wisely stated that the most valuable knowledge that a person cam acquire is the knowledge of how to learn. The human's learning is characterized by the ability to extract relationships between the different characters of a given situation. The ellipse is a first approach of comparison. We assimilate each character to a half axis of the ellipse and the result is a geometrical figure that varies according to values of the two characters. Thus, we take into account the two characters as an alone entity.

  • PDF

Illumination-Robust Foreground Extraction for Text Area Detection in Outdoor Environment

  • Lee, Jun;Park, Jeong-Sik;Hong, Chung-Pyo;Seo, Yong-Ho
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.11 no.1
    • /
    • pp.345-359
    • /
    • 2017
  • Optical Character Recognition (OCR) that has been a main research topic of computer vision and artificial intelligence now extend its applications to detection of text area from video or image contents taken by camera devices and retrieval of text information from the area. This paper aims to implement a binarization algorithm that removes user intervention and provides robust performance to outdoor lights by using TopHat algorithm and channel transformation technique. In this study, we particularly concentrate on text information of outdoor signboards and validate our proposed technique using those data.

Comparing object images using fuzzy-logic induced Hausdorff Distance (퍼지 논리기반 HAUSDORFF 거리를 이용한 물체 인식)

  • 강환일
    • Journal of Intelligence and Information Systems
    • /
    • v.6 no.1
    • /
    • pp.65-72
    • /
    • 2000
  • In this paper we propose the new binary image matching algorithm called the Fuzzy logic induced Hausdorff Distance(FHD) for finding the maximally matched image with the query image. The membership histogram is obtained by normalizing the cardinality of the subset with the corresponding radius after obtaining the distribution of the minimum distance computed by the Hausdroff distance between two binary images. in the proposed algorithm, The fuzzy influence method Center of Gravity(COG) is applied to calculate the best matching candidate in the membership function described above. The proposed algorithm shows the excellent results for the face image recognition when the noise is added to the query image as well as for the character recognition.

  • PDF

Robust Motorbike License Plate Detection and Recognition using Image Warping based on YOLOv2 (YOLOv2 기반의 영상워핑을 이용한 강인한 오토바이 번호판 검출 및 인식)

  • Dang, Xuan-Truong;Kim, Eung-Tae
    • Journal of Broadcast Engineering
    • /
    • v.24 no.5
    • /
    • pp.713-725
    • /
    • 2019
  • Automatic License Plate Recognition (ALPR) is a technology required for many applications such as Intelligent Transportation Systems and Video Surveillance Systems. Most of the studies have studied were about the detection and recognition of license plates on cars, and there is very little about detecting and recognizing license plates on motorbikes. In the case of a car, the license plate is located at the front or rear center of the vehicle and is a straight or slightly sloped license plate. Also, the background of the license plate is mainly monochromatic, and license plate detection and recognition process is less complicated. However since the motorbike is parked by using a kickstand, it is inclined at various angles when parked, so the process of recognizing characters on the motorbike license plate is more complicated. In this paper, we have developed a 2-stage YOLOv2 algorithm to detect the area of a license plate after detection of a motorbike area in order to improve the recognition accuracy of license plate for motorbike data set parked at various angles. In order to increase the detection rate, the size and number of the anchor boxes were adjusted according to the characteristics of the motorbike and license plate. Image warping algorithms were applied after detecting tilted license plates. As a result of simulating the license plate character recognition process, the proposed method had the recognition rate of license plate of 80.23% compared to the recognition rate of the conventional method(YOLOv2 without image warping) of 47.74%. Therefore, the proposed method can increase the recognition of tilted motorbike license plate character by using the adjustment of anchor boxes and the image warping which fit the motorbike license plate.

An Improved License Plate Recognition Technique in Outdoor Image (옥외영상의 개선된 차량번호판 인식기술)

  • Kim, Byeong-jun;Kim, Dong-hoon;Lee, Joonwhoan
    • Journal of the Korean Institute of Intelligent Systems
    • /
    • v.26 no.5
    • /
    • pp.423-431
    • /
    • 2016
  • In general LPR(License Plate Recognition) in outdoor image is not so simple differently from in the image captured from manmade environment, because of geometric shape distortion and large illumination changes. this paper proposes three techniques for LPR in outdoor images captured from CCTV. At first, a serially connected multi-stage Adaboost LP detector is proposed, in which different complementary features are used. In the proposed detector the performance is increased by the Haar-like Adaboost LP detector consecutively connected to the MB-LBP based one in serial manner. In addition the technique is proposed that makes image processing easy by the prior determination of LP type, after correction of geometric distortion of LP image. The technique is more efficient than the processing the whole LP image without knowledge of LP type in that we can take the appropriate color to gray conversion, accurate location for separation of text/numeric character sub-images, and proper parameter selection for image processing. In the proposed technique we use DBN(Deep Belief Network) to achieve a robust character recognition against stroke loss and geometric distortion like slant due to the incomplete image processing.

Development of Intelligent OCR Technology to Utilize Document Image Data (문서 이미지 데이터 활용을 위한 지능형 OCR 기술 개발)

  • Kim, Sangjun;Yu, Donghui;Hwang, Soyoung;Kim, Minho
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2022.05a
    • /
    • pp.212-215
    • /
    • 2022
  • In the era of so-called digital transformation today, the need for the construction and utilization of big data in various fields has increased. Today, a lot of data is produced and stored in a digital device and media-friendly manner, but the production and storage of data for a long time in the past has been dominated by print books. Therefore, the need for Optical Character Recognition (OCR) technology to utilize the vast amount of print books accumulated for a long time as big data was also required in line with the need for big data. In this study, a system for digitizing the structure and content of a document object inside a scanned book image is proposed. The proposal system largely consists of the following three steps. 1) Recognition of area information by document objects (table, equation, picture, text body) in scanned book image. 2) OCR processing for each area of the text body-table-formula module according to recognized document object areas. 3) The processed document informations gather up and returned to the JSON format. The model proposed in this study uses an open-source project that additional learning and improvement. Intelligent OCR proposed as a system in this study showed commercial OCR software-level performance in processing four types of document objects(table, equation, image, text body).

  • PDF

The Study on Dynamic Images Processing for Finger Languages (지화 인식을 위한 동영상 처리에 관한 연구)

  • Kang, Min-Ji;Choi, Eun-Sook;Sohn, Young-Sun
    • Journal of the Korean Institute of Intelligent Systems
    • /
    • v.14 no.2
    • /
    • pp.184-189
    • /
    • 2004
  • In this paper, we realized a system that receives the dynamic images of finger languages, which is the method of intention transmission of the hearing disabled person, using the white and black CCD camera, and that recognizes the images and converts them to the editable text document. We use the afterimage to draw a sharp line between indistinct images and clear images from a series of inputted images, and get the character alphabet from the away of continuous images and output the accomplished character to the word editor by applying the automata theory. After the system removes the varied wrist part from the data of clean image, it gets the controid point of hand by the maximum circular movement method and recognizes the hand that is necessary to analyze the finger languages by applying the circular pattern vector algorithm. The system abstracts the characteristic vectors of the hand using the distance spectrum from the center of the hand and it compares the characteristic vector of inputted pattern from the standard pattern by applying the fuzzy inference and recognizes the movement of finger languages.

Brain-Computer Interface based on Changes of EEG on Broca's Area (Broca 영역에서의 뇌파 변화에 기반한 뇌-컴퓨터 인터페이스)

  • Yeom, Hong-Gi;Jang, In-Hun;Sim, Kwee-Bo
    • Journal of the Korean Institute of Intelligent Systems
    • /
    • v.19 no.1
    • /
    • pp.122-127
    • /
    • 2009
  • In this paper, we measured EEG signals on frontal and Broca's area when subjects imagine to speak A or B or C or D. These signals were analyzed by Event-Related Spectral Perturbation (ERSP), Inter-Trial Coherence (ITC) and Event Related Potential (ERP) methods. As a result, high coherences were showed at 1$\sim$13Hz during 0$\sim$300ms after the stimuli of each character and P300 was seen clearly and there are several differences between the ERP results. However, unlike the motivation of this study to classify the characters, it is impossible that we can classify each intention or each character cause these differences. Nevertheless, this paper suggest an application system using this results so BCI can provide various services.

Detection of Number and Character Area of License Plate Using Deep Learning and Semantic Image Segmentation (딥러닝과 의미론적 영상분할을 이용한 자동차 번호판의 숫자 및 문자영역 검출)

  • Lee, Jeong-Hwan
    • Journal of the Korea Convergence Society
    • /
    • v.12 no.1
    • /
    • pp.29-35
    • /
    • 2021
  • License plate recognition plays a key role in intelligent transportation systems. Therefore, it is a very important process to efficiently detect the number and character areas. In this paper, we propose a method to effectively detect license plate number area by applying deep learning and semantic image segmentation algorithm. The proposed method is an algorithm that detects number and text areas directly from the license plate without preprocessing such as pixel projection. The license plate image was acquired from a fixed camera installed on the road, and was used in various real situations taking into account both weather and lighting changes. The input images was normalized to reduce the color change, and the deep learning neural networks used in the experiment were Vgg16, Vgg19, ResNet18, and ResNet50. To examine the performance of the proposed method, we experimented with 500 license plate images. 300 sheets were used for learning and 200 sheets were used for testing. As a result of computer simulation, it was the best when using ResNet50, and 95.77% accuracy was obtained.

Construct OCR on mobile mechanic system for android wireless dynamics and structure stabilization

  • Shih, Bih-Yaw;Chen, Chen-Yuan;Su, Wei-Lun
    • Structural Engineering and Mechanics
    • /
    • v.42 no.5
    • /
    • pp.747-760
    • /
    • 2012
  • In today's online social structure, people with electronic devices or network have been closely related to whether any of the activities, work, school, etc., is related to electronic devices, intelligent robot, and network control. The best mobility and the first rich media of these products as smart phones, smart phones rise rapidly in recent years, high speed processing performance and high free way to install software, deeply loved by many business people. However, not only for smart phone business aspects of the use, but also can engage in education of the teachers or the students are learning a great help. This study construct OCR-assisted learning software written by the JAVA made, and the installation is provided by the Android mobile phone users.