Search | Korea Science

SEL-RefineMask: A Seal Segmentation and Recognition Neural Network with SEL-FPN

Dun, Ze-dong;Chen, Jian-yu;Qu, Mei-xia;Jiang, Bin
- Journal of Information Processing Systems
- /
- v.18 no.3
- /
- pp.411-427
- /
- 2022
Digging historical and cultural information from seals in ancient books is of great significance. However, ancient Chinese seal samples are scarce and carving methods are diverse, and traditional digital image processing methods based on greyscale have difficulty achieving superior segmentation and recognition performance. Recently, some deep learning algorithms have been proposed to address this problem; however, current neural networks are difficult to train owing to the lack of datasets. To solve the afore-mentioned problems, we proposed an SEL-RefineMask which combines selector of feature pyramid network (SEL-FPN) with RefineMask to segment and recognize seals. We designed an SEL-FPN to intelligently select a specific layer which represents different scales in the FPN and reduces the number of anchor frames. We performed experiments on some instance segmentation networks as the baseline method, and the top-1 segmentation result of 64.93% is 5.73% higher than that of humans. The top-1 result of the SEL-RefineMask network reached 67.96% which surpassed the baseline results. After segmentation, a vision transformer was used to recognize the segmentation output, and the accuracy reached 91%. Furthermore, a dataset of seals in ancient Chinese books (SACB) for segmentation and small seal font (SSF) for recognition were established which are publicly available on the website.
https://doi.org/10.3745/JIPS.02.0174 인용 PDF KSCI

Study on video character extraction and recognition (비디오 자막 추출 및 인식 기법에 관한 연구)

김종렬;김성섭;문영식
- Proceedings of the IEEK Conference
- /
- 2001.06c
- /
- pp.141-144
- /
- 2001
In this paper, a new algorithm for extracting and recognizing characters from video, without pre-knowledge such as font, color, size of character, is proposed. To improve the recognition rate for videos with complex background at low resolution, continuous frames with identical text region are automatically detected to compose an average frame. Using boundary pixels of a text region as seeds, we apply region filling to remove background from the character Then color clustering is applied to remove remaining backgrounds according to the verification of region filling process. Features such as white run and zero-one transition from the center, are extracted from unknown characters. These feature are compared with a pre-composed character feature set to recognize the characters.
PDF

Precise Detection of Car License Plates by Locating Main Characters

Lee, Dae-Ho;Choi, Jin-Hyuk
- Journal of the Optical Society of Korea
- /
- v.14 no.4
- /
- pp.376-382
- /
- 2010
We propose a novel method to precisely detect car license plates by locating main characters, which are printed with large font size. The regions of the main characters are directly detected without detecting the plate region boundaries, so that license regions can be detected more precisely than by other existing methods. To generate a binary image, multiple thresholds are applied, and segmented regions are selected from multiple binarized images by a criterion of size and compactness. We do not employ any character matching methods, so that many candidates for main character groups are detected; thus, we use a neural network to reject non-main character groups from the candidates. The relation of the character regions and the intensity statistics are used as the input to the neural network for classification. The detection performance has been investigated on real images captured under various illumination conditions for 1000 vehicles. 980 plates were correctly detected, and almost all non-detected plates were so stained that their characters could not be isolated for character recognition. In addition, the processing time is fast enough for a commercial automatic license plate recognition system. Therefore, the proposed method can be used for recognition systems with high performance and fast processing.
https://doi.org/10.3807/JOSK.2010.14.4.376 인용 PDF KSCI

Development of vision system for the recognition of character image which was included at the slab image (슬라브 영상에 포함된 문자영상의 인식을 위한 비전시스템의 개발)

Park, Sang-Gug
- Journal of Korea Society of Industrial Information Systems
- /
- v.12 no.1
- /
- pp.95-100
- /
- 2007
In the steel & iron processing line, some characters are marked for the material management in the surface of material. This paper describes about the developed results of vision system for the recognition of material management characters, which was included in the slab image. Our vision system for the character recognition includes that CCD camera system which acquire slab image, optical transmission system which transmit captured image to the long distance, input and output system for the interface with existing system and monitoring system for the checking of recognition results. We have installed our vision system at the continuous casting line and tested. Also, we have performed inspection of durability, reliability and recognition rate. Through the testing, we have confirmed that our system have high recognition rate, 97.4%.
PDF

An implementation of the mixed type character recognition system using combNET (CombNET 신경망을 이용한 혼용 문서 인식 시스템의 구현)

최재혁;손영우;남궁재찬
- The Journal of Korean Institute of Communications and Information Sciences
- /
- v.21 no.12
- /
- pp.3265-3276
- /
- 1996
The studies of document recongnition have been focused mainly on Korean documents. But most of documents composed of Korean and other characters. So, in this paper, we propose the document recognition system that can recognize the multi-size, multi font and mixed type characters. We have utilized a large scale network model, "CombNET" which consists of a 4 layered network with combstructure. And we propose recognition method that can recognize characters without discrimination of character type. The first layer constitutes a Kohonen's SOFM network which quantizes an input feature vector space into several sub-spaces and the following 2-4 layers constitutes BP network modules which classify input data in each sub-space into specified catagories. An experimental result demonstrated the usefulness of this approach with the recognition rates of 95.6% for the training data. For the mixed type character documents we obtained the recognition rates of 92.6% and recognition speed of 10.3 characters per second.
PDF

Synthesis of Multiplexed MACE Filter for Optical Korean Character Recognition (인쇄체 한글의 광학적 인식을 위한 다중 MACE 필터의 합성)

김정우;김철수;배장근;도양회;김수중
- The Journal of Korean Institute of Communications and Information Sciences
- /
- v.19 no.12
- /
- pp.2364-2375
- /
- 1994
For the efficient recognition of printed Korean characters, a multiplexed minimum average correlation energy(MMACE) filter is proposed. Proposed method solved the disadvantages of the tree structure algorithm which recognition system is very huge and recognition method is sophisticated. Using only one consonant MMACE filter and one vowel one, we recognized the full Korean character. Each MMACE filter is multiplexed by 4 K-tuple MACE filters which are synthesized by 24 consonants and vowels. Hence the proposed MMACE filter and the correlation distribution plane are divided by 4 subregion. We obtained the binary codes for the Korean character recognition from each correlation distribution subplane. And the obtained codes are compared with the truth table for consonants and vowels in computer. We can recognize the full Korean characters when substitute the corresponded consonant or vowel font of the consistent code to the correlation peak place in the output correlation plane. The computer simulation and optical experiment results show that the proposed compact Korean character recognition system using the MMACE filters has high discrimination capability.
PDF

Study on News Video Character Extraction and Recognition (뉴스 비디오 자막 추출 및 인식 기법에 관한 연구)

김종열;김성섭;문영식
- Journal of the Institute of Electronics Engineers of Korea SP
- /
- v.40 no.1
- /
- pp.10-19
- /
- 2003
Caption information in news videos can be useful for video indexing and retrieval since it usually suggests or implies the contents of the video very well. In this paper, a new algorithm for extracting and recognizing characters from news video is proposed, without a priori knowledge such as font type, color, size of character. In the process of text region extraction, in order to improve the recognition rate for videos with complex background at low resolution, continuous frames with identical text regions are automatically detected to compose an average frame. The image of the averaged frame is projected to horizontal and vertical direction, and we apply region filling to remove backgrounds to produce the character. Then, K-means color clustering is applied to remove remaining backgrounds to produce the final text image. In the process of character recognition, simple features such as white run and zero-one transition from the center, are extracted from unknown characters. These feature are compared with the pre-composed character feature set to recognize the characters. Experimental results tested on various news videos show that the proposed method is superior in terms of caption extraction ability and character recognition rate.
PDF KSCI

Learning Module Design for Neural Network Processor(ERNIE) (신경회로망칩(ERNIE)을 위한 학습모듈 설계)

Jung, Je-Kyo;Kim, Yung-Joo;Dong, Sung-Soo;Lee, Chong-Ho
- Proceedings of the KIEE Conference
- /
- 2003.11b
- /
- pp.171-174
- /
- 2003
In this paper, a Learning module for a reconfigurable neural network processor(ERNIE) was proposed for an On-chip learning. The existing reconfigurable neural network processor(ERNIE) has a much better performance than the software program but it doesn't support On-chip learning function. A learning module which is based on Back Propagation algorithm was designed for a help of this weak point. A pipeline structure let the learning module be able to update the weights rapidly and continuously. It was tested with five types of alphabet font to evaluate learning module. It compared with C programed neural network model on PC in calculation speed and correctness of recognition. As a result of this experiment, it can be found that the neural network processor(ERNIE) with learning module decrease the neural network training time efficiently at the same recognition rate compared with software computing based neural network model. This On-chip learning module showed that the reconfigurable neural network processor(ERNIE) could be a evolvable neural network processor which can fine the optimal configuration of network by itself.
PDF

A Study on the Recognition of Numerals for AGV Navigation Control (AGV 주행제어를 위한 숫자인식에 관한 연구)

박영만;박경우;안동순
- Journal of the Korea Society of Computer and Information
- /
- v.8 no.2
- /
- pp.1-7
- /
- 2003
This study is a research on character recognition based on image processing, using only color tape to mark guidelines instead of magnetic tape or electric wire used by existing AGV. AGV must follow given courses, and stop recognizing signs such as marks and numbers that indicate destinations. In this study. marks to stop AGV employed blue characters of the same font and size as those of number plates. Yellow driving lines and blue numeric characters were marked in corridors. AGV ran ing the characteristics of colors and detecting lines, and temporarily stopped recognizing numbers of 100％ through DP pattern matching. This study presented the image processing technique and the result of operating AGV.
PDF

Hangul Recognition Using a Hierarchical Neural Network (계층구조 신경망을 이용한 한글 인식)

최동혁;류성원;강현철;박규태
- Journal of the Korean Institute of Telematics and Electronics B
- /
- v.28B no.11
- /
- pp.852-858
- /
- 1991
An adaptive hierarchical classifier(AHCL) for Korean character recognition using a neural net is designed. This classifier has two neural nets: USACL (Unsupervised Adaptive Classifier) and SACL (Supervised Adaptive Classifier). USACL has the input layer and the output layer. The input layer and the output layer are fully connected. The nodes in the output layer are generated by the unsupervised and nearest neighbor learning rule during learning. SACL has the input layer, the hidden layer and the output layer. The input layer and the hidden layer arefully connected, and the hidden layer and the output layer are partially connected. The nodes in the SACL are generated by the supervised and nearest neighbor learning rule during learning. USACL has pre-attentive effect, which perform partial search instead of full search during SACL classification to enhance processing speed. The input of USACL and SACL is a directional edge feature with a directional receptive field. In order to test the performance of the AHCL, various multi-font printed Hangul characters are used in learning and testing, and its processing its speed and and classification rate are compared with the conventional LVQ(Learning Vector Quantizer) which has the nearest neighbor learning rule.
PDF

Search Result 67, Processing Time 0.019 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)