• Title/Summary/Keyword: Character Matching

Search Result 155, Processing Time 0.028 seconds

Regular Expression Matching Processor Architecture Supporting Character Class Matching (문자클래스 매칭을 지원하는 정규표현식 매칭 프로세서 구조)

  • Yun, SangKyun
    • Journal of KIISE
    • /
    • v.42 no.10
    • /
    • pp.1280-1285
    • /
    • 2015
  • Many hardware-based regular expression matching architectures are proposed for high performance matching. In particular, regular expression processors such as ReCPU and SMPU perform pattern matching in a similar approach to that used in general purpose processors, which provide the flexibility when updating patterns. However, these processors are inefficient in performing class matching since they do not provide character class matching capabilities. This paper proposes an instruction set and architecture of a regular expression matching processor, which can support character class matching. The proposed processor can efficiently perform character class matching since it includes character class, character range, and negated character class matching capabilities.

Architecture for Efficient Character Class Matching in Regular Expression Processor (정규표현식 프로세서에서의 효율적 문자 클래스 매칭을 위한 구조)

  • Yun, SangKyun
    • Journal of IKEEE
    • /
    • v.22 no.1
    • /
    • pp.87-92
    • /
    • 2018
  • Like CPUs, regular expression processors that perform regular expression pattern matching using instructions have been proposed recently. Of these, only REMPc provides features for character class matching. In this paper, we propose an architecture for efficient character class matching in a regular expression processor, which use character class bitmap format in a instruction operand field and implement the hard-wired character class comparator for several frequently used character classes. Using the proposed method, most of the character classes used in Snort rule can be represented by an operand or an instruction. Thus, character class matching can be performed more efficiently in the proposed archiecture than in REMPc.

String Matching Algorithm on Multi-byte Character Set Texts (다중바이트 문자집합 텍스트에서의 문자열 검색 알고리즘)

  • Kim, Eun-Sang;Kim, Jin-Wook;Park, Kun-Soo
    • Journal of KIISE:Computing Practices and Letters
    • /
    • v.16 no.10
    • /
    • pp.1015-1019
    • /
    • 2010
  • An extensive research on exact string matching has been done, but there have been few researches on the matching in multi-byte character set texts such as EUC~KR. This paper shows that false matches may occur in multi-byte character set texts such as EUC-KR when using KMP algorithm, and presents a refined KMP algorithm without false matches applying a character-based prefix function. And also, Experimental results show that our algorithm is faster than string matching algorithms of widely used editors, Vim and Emacs, and the existing automata-based algorithm.

Robust Character Image Retrieval Method Using Bipartite Matching (Bipartite Matching을 이용한 강인한 캐릭터 영상 검색 방법)

  • 이상엽;김회율
    • Journal of Broadcast Engineering
    • /
    • v.7 no.2
    • /
    • pp.136-144
    • /
    • 2002
  • In this paper, a novel approach that makes use of both shape and color information to retrieve character images in terms of similarity distance from a large-capacity image database or from a streaming image database, in particular, character image logo or trademark. In order to combine both features of completely different characteristics bipartite matching has been employed in computing similarity distance, The proposed method turned out to bealso very effective in matching natural object or human-drawn images whose shape varies substantially.

Keyword Spotting on Hangul Document Images Using Character Feature Models (문자 별 특징 모델을 이용한 한글 문서 영상에서 키워드 검색)

  • Park, Sang-Cheol;Kim, Soo-Hyung;Choi, Deok-Jai
    • The KIPS Transactions:PartB
    • /
    • v.12B no.5 s.101
    • /
    • pp.521-526
    • /
    • 2005
  • In this Paper, we propose a keyword spotting system as an alternative to searching system for poor quality Korean document images and compare the Proposed system with an OCR-based document retrieval system. The system is composed of character segmentation, feature extraction for the query keyword, and word-to-word matching. In the character segmentation step, we propose an effective method to remove the connectivity between adjacent characters and a character segmentation method by making the variance of character widths minimum. In the query creation step, feature vector for the query is constructed by a combination of a character model by typeface. In the matching step, word-to-word matching is applied base on a character-to-character matching. We demonstrated that the proposed keyword spotting system is more efficient than the OCR-based one to search a keyword on the Korean document images, especially when the quality of documents is quite poor and point size is small.

An Approach for Efficient Handwritten Word Recognition Using Dynamic Programming Matching (동적 프로그래밍 정합을 이용한 효율적인 필기 단어 인식 방법)

  • 김경환
    • Journal of the Korean Institute of Telematics and Electronics C
    • /
    • v.36C no.4
    • /
    • pp.54-64
    • /
    • 1999
  • This paper proposes an efficient handwritten English word recognition scheme which can be applied practical applications. To effectively use the lexicon which is available in most handwriting related applications, the lexicon entries are introduced in the early stage of the recognition. Dynamic programming is used for matching between over-segmented character segments and letters in the lexicon entries. Character segmentation statistics which can be obtained while the training is being performed are used to adjust the matching window size. Also, the matching results between the character segments and the letters in the lexicon entries are cached to avoid repeat of the same computation. In order to verify the effectiveness of the proposed methods, several experiments were performed using thousands of word images with various writing styles. The results show that the proposed methods significantly improve the matching speed as well as the accuracy.

  • PDF

A Study on the Highly Accurate Korean Character Recognition Algorithm, by analyzing Vowel and Consonant Models - Selectiong of candidates using pattern matching method and discriminating similar characters by structural analysis - (자. 모 해석적 모델에 의한 고정도 한글 인식 알고리즘에 관한 연구 - 패턴정합법에 기초한 후보문자 선정 및 구조해석적인 방법에 의한 유사문자 판별 -)

  • 강선미;김봉석;김덕진
    • Journal of the Korean Institute of Telematics and Electronics B
    • /
    • v.30B no.7
    • /
    • pp.24-30
    • /
    • 1993
  • In this paper, a new method is proposed to recognize a character from its similar characters, which are selected by pattern matching method in Korean character recognition. This new method, which couples the merits of already suggested methods, can choose the character to be in the candidate set and discriminate it from the others correctly. To evaluate performance of this algorithm, we used 15 kinds of different laser printer fonts and obtained about 97% of recognition rate.

  • PDF

On-Line Recognition of Cursive Hangeul by Extended DP Matching Method (擴張된 DP 매칭법에 依한 흘림체 한글 온라인 認識)

  • Lee, Hee-Dong;Kim, Tae-Kyun;Agui, Takeshi;Nakajima, Masayuki
    • Journal of the Korean Institute of Telematics and Electronics
    • /
    • v.26 no.1
    • /
    • pp.29-37
    • /
    • 1989
  • This paper presents an application of the extended DP matching method to the on-line recognition of cursive Hangeul (Korean characters). We decrease the number of matching's objects by performing rough classification matching which makes the best use of features in the first and the last segment of Hangeul. By adding the extraction function of the basic character patterns to DP matching method, we try to calculate precisely the difference among Hangeul. The extraction of the basic character patterns is done by examining the features of segments in character. Applying the extended DP matching method to the on-line recognition of cursive Hangeul, absorption of writing motion and stable separation of strokes can be performed with flexibility.

  • PDF

A Study on Type Classification and Recognition Using Structural Information in Character Pattern of HANGEUL Shape (한글 Shape 문자 Pattern에서의 구조적 정보를 이용한 형식분류와 인식 관한 연구)

  • 전종익;조용주;남궁재찬
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.16 no.2
    • /
    • pp.180-195
    • /
    • 1991
  • In this paper, we studied on new method of recognition using structural information to recognize character pattern in orginal shape of Hangeul. First, for the purpose of knowing location of character in input image. it processed Making block. Second, after we investigated. whether vertical vowel exited or not in character image accordingly the center of gravity of Hangeul. each character was classified into Type of Hangeul by searching location and length for horizontal vowel and short pole. Last, we processed it by means of template matching which calculate Uclid's distance on each Jaso in accordance to type classified. This paper made an experiment on 2350 characters and obtained 98.3% classifing rate and 95.2% recognizing rate.

  • PDF

Development Character Recognition Algorithm in Gerber File for the PCB Assembly Machine (PCB 조립 장비를 위한 거버 문자 인식 알고리즘 개발)

  • 김철한;박태형
    • 제어로봇시스템학회:학술대회논문집
    • /
    • 2000.10a
    • /
    • pp.297-297
    • /
    • 2000
  • This paper proposed character recognition method by using DB Matching and Artificial Neural Network at the Gerber files. Gerber files are file for make PCB. But we also use the file to a program of extraction PCB position data. If the Gerber file recognized a character, the extraction PCB position data will be faster and also when the recognition rate is high, it can be possible to automatic extraction. We apply to the construction PCB Gerber file program and Simulation results are presented to verify the usefulness of the method.

  • PDF