통합 검색 | Korea Science

Arabic Words Extraction and Character Recognition from Picturesque Image Macros with Enhanced VGG-16 based Model Functionality Using Neural Networks

Ayed Ahmad Hamdan Al-Radaideh;Mohd Shafry bin Mohd Rahim;Wad Ghaban;Majdi Bsoul;Shahid Kamal;Naveed Abbas
- KSII Transactions on Internet and Information Systems (TIIS)
- /
- 제17권7호
- /
- pp.1807-1822
- /
- 2023
Innovation and rapid increased functionality in user friendly smartphones has encouraged shutterbugs to have picturesque image macros while in work environment or during travel. Formal signboards are placed with marketing objectives and are enriched with text for attracting people. Extracting and recognition of the text from natural images is an emerging research issue and needs consideration. When compared to conventional optical character recognition (OCR), the complex background, implicit noise, lighting, and orientation of these scenic text photos make this problem more difficult. Arabic language text scene extraction and recognition adds a number of complications and difficulties. The method described in this paper uses a two-phase methodology to extract Arabic text and word boundaries awareness from scenic images with varying text orientations. The first stage uses a convolution autoencoder, and the second uses Arabic Character Segmentation (ACS), which is followed by traditional two-layer neural networks for recognition. This study presents the way that how can an Arabic training and synthetic dataset be created for exemplify the superimposed text in different scene images. For this purpose a dataset of size 10K of cropped images has been created in the detection phase wherein Arabic text was found and 127k Arabic character dataset for the recognition phase. The phase-1 labels were generated from an Arabic corpus of quotes and sentences, which consists of 15kquotes and sentences. This study ensures that Arabic Word Awareness Region Detection (AWARD) approach with high flexibility in identifying complex Arabic text scene images, such as texts that are arbitrarily oriented, curved, or deformed, is used to detect these texts. Our research after experimentations shows that the system has a 91.8% word segmentation accuracy and a 94.2% character recognition accuracy. We believe in the future that the researchers will excel in the field of image processing while treating text images to improve or reduce noise by processing scene images in any language by enhancing the functionality of VGG-16 based model using Neural Networks.
https://doi.org/10.3837/tiis.2023.07.004 인용 PDF HTML

서포트벡터머신과 정칙화판별함수를 이용한 비디오 문자인식의 분류 성능 개선 (Video character recognition improvement by support vector machines and regularized discriminant analysis)

임수열;백장선;김민수
- Journal of the Korean Data and Information Science Society
- /
- 제21권4호
- /
- pp.689-697
- /
- 2010
본 연구에서는 비디오이미지로부터 추출된 텍스트영역으로부터 문자인식을 수행하였다. 비디오영상으로부터 추출된 문자열은 한글, 영어, 숫자, 특수문자 등으로 혼합되어 있거나, 또는 다양한 폰트와 크기, 그래픽 형태의 글자 존재, 영상의 기울어짐, 끊김, 잡영, 접촉, 저해상도의 글자 등으로 인하여 일반적인 문자인식에 비해 많은 어려움이 존재한다. 이와 같은 어려움을 극복하기위해 본 연구에서는 모든 글자에 대해서 인식하지 않고 가장 빈번하게 등장하는 글자만을 인식하고 나머지는 버리는 방법을 사용하였으며 지지도벡터기계와 정칙화판별분석의 2단계 문자인식 방법을 이용하여 인식률을 개선하였다. 또한 인식률이 좋지 못한 4형식과 5형식 글자에 대해 모음별로 중분류를 실시하였다. 실험결과 지지도벡터기계와 정칙화판별분석을 동시에 사용하는 방법이 다른 문자인식의 방법들보다 인식률이 우수하였으며, 부분적인 중분류의 방법을 이용한 경우 향상된 인식 성능을 나타냈다.
PDF KSCI

Mellin 변환 방식과 BPEJTC를 이용한 영상 문자 인식 (Image Character Recognition using the Mellin Transform and BPEJTC)

서춘원;고성원;이병선
- 조명전기설비학회논문지
- /
- 제17권4호
- /
- pp.26-35
- /
- 2003
자연계에서 다양한 형태로 입력되는 물체 영상을 효과적으로 인식하려면, 물체의 위치, 회전, 크기 변화에 관계없이 인식할 수 있는 왜곡 불변 특성의 추출이 반드시 요구된다. 이러한 왜곡 불변 특성은 동일한 영상의 변화에 대하여 인식 특성이 같고, 서로 다른 영상의 변화에 대해서는 분리 식별이 용이해야 한다. 이러한 인식 특성을 얻기위해 다각도로 많은 연구가 진행되고 있으며, 특히 회전 및 크기에 불변 특성을 동시에 얻을 수 있는 Mellin변환을 이용한 방법 등이 영상 인식에 많이 이용되고 있다[1][2][3]. 따라서, 본 논문에서는 Mellin 변환 방법에 의한 크기 및 회전에 대한 불변 특성을 얻을 수 있는 문자 인식 시스템을 위한 문자 특징 추출 방법을 제시하고자 하였으며, 영문자 26 문자의 입력 영상에 대하여 무게 중심법에 의한 문자 이동과 Mellin 변환 방법에 의한 특징 추출 방법에 보간법을 이용하여 특징을 추출하였으며, 추출된 특징에 대하여 특징의 이질도를 검사하여, 각 특징의 이질도가 약 50% 이상의 결과를 얻었다. 또한, Mellin 변환 방법에 의해 추출된 특징을 기준 영상으로 하는 BPEJTC(Binary Phase Extraction Joint Transform Correlator)를 이용하여 크기, 회전 및 이동에 따른 입력 문자의 인식이 가능한 BPEJTC 시스템을 구현하였으며, 이에 따라 본 논문에서는 약 90%의 인식률을 얻을 수 있었다. 따라서 본 논문에서 제시하는 Mellin 변환 방법에 따라 추출된 문자의 특징과 BPEJTC를 이용하여 영상 문자를 인식할 수 있는 영상 문자 인식 시스템의 가능성을 제시하였다.
https://doi.org/10.5207/JIEIE.2003.17.4.026 인용 PDF KSCI

금융 장표 자동 처리를 위한 인식 시스템 개발 (Development of a Recognition System for Automatic Giro Processing)

황재원;이만희;장동식
- 산업공학
- /
- 제13권2호
- /
- pp.188-194
- /
- 2000
A pattern recognition system is proposed to recognize characters in any type of Giro. The system consist of the character segmentation and the character recognition. Positional features from two round markers at the upper-right part and lower-left part of Giro is used for extracting character strings from images and RLE analysis is used if there are no round markers. A multi step combined method, which use a structural method and a statistical method, is used to improve recognition. The structural method apply rules on each characters, whereas a statistical method gives a different weighting vector to each pixel for improving the classification performance in regard to noises and distortions. The experimental results show that the proposed combined method has higher recognition rate, over than 98% even in cases that images are rotated about 10 degrees as well as have noises.
PDF

Online Recognition of Handwritten Korean and English Characters

Ma, Ming;Park, Dong-Won;Kim, Soo Kyun;An, Syungog
- Journal of Information Processing Systems
- /
- 제8권4호
- /
- pp.653-668
- /
- 2012
In this study, an improved HMM based recognition model is proposed for online English and Korean handwritten characters. The pattern elements of the handwriting model are sub character strokes and ligatures. To deal with the problem of handwriting style variations, a modified Hierarchical Clustering approach is introduced to partition different writing styles into several classes. For each of the English letters and each primitive grapheme in Korean characters, one HMM that models the temporal and spatial variability of the handwriting is constructed based on each class. Then the HMMs of Korean graphemes are concatenated to form the Korean character models. The recognition of handwritten characters is implemented by a modified level building algorithm, which incorporates the Korean character combination rules within the efficient network search procedure. Due to the limitation of the HMM based method, a post-processing procedure that takes the global and structural features into account is proposed. Experiments showed that the proposed recognition system achieved a high writer independent recognition rate on unconstrained samples of both English and Korean characters. The comparison with other schemes of HMM-based recognition was also performed to evaluate the system.
https://doi.org/10.3745/JIPS.2012.8.4.653 인용 PDF KSCI

자. 모 해석적 모델에 의한 고정도 한글 인식 알고리즘에 관한 연구 - 패턴정합법에 기초한 후보문자 선정 및 구조해석적인 방법에 의한 유사문자 판별 - (A Study on the Highly Accurate Korean Character Recognition Algorithm, by analyzing Vowel and Consonant Models - Selectiong of candidates using pattern matching method and discriminating similar characters by structural analysis -)

강선미;김봉석;김덕진
- 전자공학회논문지B
- /
- 제30B권7호
- /
- pp.24-30
- /
- 1993
In this paper, a new method is proposed to recognize a character from its similar characters, which are selected by pattern matching method in Korean character recognition. This new method, which couples the merits of already suggested methods, can choose the character to be in the candidate set and discriminate it from the others correctly. To evaluate performance of this algorithm, we used 15 kinds of different laser printer fonts and obtained about 97% of recognition rate.
PDF

신경 진동자를 이용한 한글 문자의 인식 속도의 개선에 관한 연구 (A study for improvement of Recognition velocity of Korean Character using Neural Oscillator)

Kwon, Yong-Bum;Lee, Joon-Tark
- 한국지능시스템학회:학술대회논문집
- /
- 한국퍼지및지능시스템학회 2004년도 춘계학술대회 학술발표 논문집 제14권 제1호
- /
- pp.491-494
- /
- 2004
Neural Oscillator can be applied to oscillatory systems such as the image recognition, the voice recognition, estimate of the weather fluctuation and analysis of geological fluctuation etc in nature and principally, it is used often to pattern recoglition of image information. Conventional BPL(Back-Propagation Learning) and MLNN(Multi Layer Neural Network) are not proper for oscillatory systems because these algorithm complicate Learning structure, have tedious procedures and sluggish convergence problem. However, these problems can be easily solved by using a synchrony characteristic of neural oscillator with PLL(phase-Locked Loop) function and by using a simple Hebbian learning rule. And also, Recognition velocity of Korean Character can be improved by using a Neural Oscillator's learning accelerator factor η$\_$ij/
PDF

Robust Multi-Layer Hierarchical Model for Digit Character Recognition

Yang, Jie;Sun, Yadong;Zhang, Liangjun;Zhang, Qingnian
- Journal of Electrical Engineering and Technology
- /
- 제10권2호
- /
- pp.699-707
- /
- 2015
Although digit character recognition has got a significant improvement in recent years, it is still challenging to achieve satisfied result if the data contains an amount of distracting factors. This paper proposes a novel digit character recognition approach using a multi-layer hierarchical model, Hybrid Restricted Boltzmann Machines (HRBMs), which allows the learning architecture to be robust to background distracting factors. The insight behind the proposed model is that useful high-level features appear more frequently than distracting factors during learning, thus the high-level features can be decompose into hybrid hierarchical structures by using only small label information. In order to extract robust and compact features, a stochastic 0-1 layer is employed, which enables the model's hidden nodes to independently capture the useful character features during training. Experiments on the variations of Mixed National Institute of Standards and Technology (MNIST) dataset show that improvements of the multi-layer hierarchical model can be achieved by the proposed method. Finally, the paper shows the proposed technique which is used in a real-world application, where it is able to identify digit characters under various complex background images.
https://doi.org/10.5370/JEET.2015.10.2.699 인용 PDF KSCI KPUBS HTML

영상처리기술을 이용한 핵 연료봉 문자 자동인식시스템 개발 (Development of Automatic Nuclear Fuel Rod Character Recognition System Based on Image Processing Technique)

Woong Ki Kim;Yong Bum Lee;Jong Min Lee;Sung IL Chien
- Nuclear Engineering and Technology
- /
- 제25권3호
- /
- pp.424-429
- /
- 1993
핵연료 소결체가 장전되는 핵연료봉의 끝부분에는 각각의 핵연료봉을 구분해주는 고유의 문자가 인쇄되어 있다. 핵연료 집합체 제조 과정에서 각각의 핵연료봉은 고유 문자에 의해 구분되어 체계적으로 관리되고 있으며 아울러 핵연료 연소 이상상태 감시 및 사용후 핵연료 검사 분야에서 핵연료봉 제조과정 추적에 이용되고 있다. 핵연료봉 문자 자동인식은 핵연료 집합체 제조과정의 자동화를 위한 핵심 기술이다. 본 연구에서는 핵연료봉 문자인식 시스템을 개발하여, 핵 연료봉단에 기록된 각 문자로 부터 추출한 메쉬 특징값을 데이타베이스에 저장된 특정 문자의 특징값과 비교하여 자동으로 문자인식을 수행하도록 하였다. 실험 결과, 95.83 퍼센트의 양호한 인식률을 기록하였다.
PDF

명암과 색상 정보를 이용한 번호판 인식 (Recognition of License Plate with Brightness and Tone of Color Data)

이승수;이기성
- 대한전기학회:학술대회논문집
- /
- 대한전기학회 2003년도 학술회의 논문집 정보 및 제어부문 B
- /
- pp.528-531
- /
- 2003
Recognition of licence plate becomes a key issue to many traffic related application such as road traffic monitoring or parking lots access control. In this paper, the brightness, YIQ and HSI methods were used to locate a license. After the characters in license plate were extracted, template matching method was applied for character recognitions. To test the performance of the proposed algorithm, images of seventy vehicle were tested. The success rates for license plate and character recognition were approximately 99%, and 96%, respectively
PDF

검색결과 574건 처리시간 0.038초

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

자세히 찾기

이미지 검색 (β)