• Title/Summary/Keyword: pattern matching algorithm

Search Result 309, Processing Time 0.035 seconds

An Efficient DNA Sequence Compression using Small Sequence Pattern Matching

  • Murugan., A;Punitha., K
    • International Journal of Computer Science & Network Security
    • /
    • v.21 no.8
    • /
    • pp.281-287
    • /
    • 2021
  • Bioinformatics is formed with a blend of biology and informatics technologies and it employs the statistical methods and approaches for attending the concerning issues in the domains of nutrition, medical research and towards reviewing the living environment. The ceaseless growth of DNA sequencing technologies has resulted in the production of voluminous genomic data especially the DNA sequences thus calling out for increased storage and bandwidth. As of now, the bioinformatics confronts the major hurdle of management, interpretation and accurately preserving of this hefty information. Compression tends to be a beacon of hope towards resolving the aforementioned issues. Keeping the storage efficiently, a methodology has been recommended which for attending the same. In addition, there is introduction of a competent algorithm that aids in exact matching of small pattern. The DNA representation sequence is then implemented subsequently for determining 2 bases to 6 bases matching with the remaining input sequence. This process involves transforming of DNA sequence into an ASCII symbols in the first level and compress by using LZ77 compression method in the second level and after that form the grid variables with size 3 to hold the 100 characters. In the third level of compression, the compressed output is in the grid variables. Hence, the proposed algorithm S_Pattern DNA gives an average better compression ratio of 93% when compared to the existing compression algorithms for the datasets from the UCI repository.

A Multiple Pattern Matching Scheme to Improve Rule Application Performance (규칙 적용 성능을 개선하기 위한 다중 패턴매칭 기법)

  • Lee, Jae-Kook;Kim, Hyong-Shik
    • Journal of the Korea Institute of Information Security & Cryptology
    • /
    • v.18 no.3
    • /
    • pp.79-88
    • /
    • 2008
  • On the internet, the NIDS(Network Intrusion Detection System) has been widely deployed to protect the internal network. The NIDS builds a set of rules with analysis results on illegal packets and filters them using the rules, thus protecting the internal system. The number of rules is ever increasing as the attacks are becoming more widespread and well organized these days. As a result, the performance degradation has been found severe in the rule application fer the NIDS. In this paper, we propose a multiple pattern matching scheme to improve rule application performance. Then we compare our algorithm with Wu-Mantel algorithm which is known to do high performance multi-pattern matching.

High Throughput Parallel KMP Algorithm Considering CPU-GPU Memory Hierarchy (CPU-GPU 메모리 계층을 고려한 고처리율 병렬 KMP 알고리즘)

  • Park, Soeun;Kim, Daehee;Lee, Myungho;Park, Neungsoo
    • The Transactions of The Korean Institute of Electrical Engineers
    • /
    • v.67 no.5
    • /
    • pp.656-662
    • /
    • 2018
  • Pattern matching algorithm is widely used in many application fields such as bio-informatics, intrusion detection, etc. Among many string matching algorithms, KMP (Knuth-Morris-Pratt) algorithm is commonly used because of its fast execution time when using large texts. However, the processing speed of KMP algorithm is also limited when the text size increases significantly. In this paper, we propose a high throughput parallel KMP algorithm considering CPU-GPU memory hierarchy based on OpenCL in GPGPU (General Purpose computing on Graphic Processing Unit). We focus on the optimization for the allocation of work-times and work-groups, the local memory copy of the pattern data and the failure table, and the overlapping of the data transfer with the string matching operations. The experimental results show that the execution time of the optimized parallel KMP algorithm is about 3.6 times faster than that of the non-optimized parallel KMP algorithm.

Order preserving matching with k mismatches (k개의 오차를 허용하는 순위 패턴 매칭)

  • Lee, Inbok
    • Smart Media Journal
    • /
    • v.9 no.2
    • /
    • pp.33-38
    • /
    • 2020
  • Order preserving matching refers to the problem of reporting substrings of a given text where there exists order isomorphism with the pattern. In this paper, we propose a new algorithm based on filtering and evaluation. The proposed algorithm is simple and easy to implement, and runs in linear time on average. Experimental results show that it works efficiently with real world data.

Development of an Integer Algorithm for Computation of the Matching Probability in the Hidden Markov Model (I) (은닉마르코브 모델의 부합확률연산의 정수화 알고리즘 개발 (I))

  • 김진헌;김민기;박귀태
    • Journal of the Korean Institute of Telematics and Electronics B
    • /
    • v.31B no.8
    • /
    • pp.11-19
    • /
    • 1994
  • The matching probability P(ο/$\lambda$), of the signal sequence(ο) observed for a finite time interval with a HMM (Hidden Markov Model $\lambda$) indicates the probability that signal comes from the given model. By utilizing the fact that the probability represents matching score of the observed signal with the model we can recognize an unknown signal pattern by comparing the magnitudes of the matching probabilities with respect to the known models. Because the algorithm however uses floating point variables during the computing process hardware implementation of the algorithm requires floating point units. This paper proposes an integer algorithm which uses positive integer numbers rather than float point ones to compute the matching probability so that we can economically realize the algorithm into hardware. The algorithm makes the model parameters integer numbers by multiplying positive constants and prevents from divergence of data through the normalization of variables at each step. The final equation of matching probability is composed of constant terms and a variable term which contains logarithm operations. A scheme to make the log conversion table smaller is also presented. To analyze the qualitive characteristics of the proposed algorithm we attatch simulation result performed on two groups of 10 hypothetic models respectively and inspect the statistical properties with repect to the model order the magnitude of scaling constants and the effect of the observation length.

  • PDF

A New S/W Architecture for YARA Speed Enhancement (YARA 속도 개선을 위한 새로운 S/W 구조설계)

  • Kim, Chang Hoon
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.41 no.12
    • /
    • pp.1858-1860
    • /
    • 2016
  • In this paper, a modified YARA software architecture that can perform pattern matching for multi-rule files is proposed. Based on a improved scanning thread algorithm, the new design reduces memory loading time of rule files for pattern matching. Therefore, the proposed architecture can reduce operation time for pattern matching while it requires an increased memory in proportion to the number of rule files.

New Matching Scheme for Panorama Image: A Simulation Study

  • Kim, Jeong-Seok;Chung, Sung-Taek;Hong, In-Ki
    • Journal of Biomedical Engineering Research
    • /
    • v.28 no.1
    • /
    • pp.127-131
    • /
    • 2007
  • This paper presents a new matching scheme for creating a single panoramic image from a sequence of partially overlapping images of the same object or scene. This matching scheme is based directly on the searching algorithm, using a multiscale approach to the Hooke-Jeeves algorithm. Matching scheme evaluation was performed using simulated pattern images. The proposed matching scheme reveals good results and could be effectively applied to real ultrasound applications.

Optimal Search Patterns for Fast Block Matching Motion Estimation (고속 블록정합 움직임 추정을 위한 최적의 탐색 패턴)

  • 임동근;호요성
    • Proceedings of the IEEK Conference
    • /
    • 2000.06d
    • /
    • pp.39-42
    • /
    • 2000
  • Motion estimation plays an important role for video coding. In this paper, we derive optimal search patterns for fast block matching motion estimation. By analyzing the block matching algorithm as a function of block shape and size, we can find an optimal search pattern for initial motion estimation. The proposed idea, which has been verified experimentally by computer simulations, can provide an analytical basis for the current MPEG-2 proposals. In order to choose a more compact search pattern for BMA, we exploit the statistical relationship between the motion and the frame difference of each block.

  • PDF

A Study on Algorithm of Phonemes Extraction in Korean Character Pattern Recognition (한글 인식에서 자소 추출에 관한 연구)

  • 정영화;김은진;김정선
    • Proceedings of the Korean Institute of Communication Sciences Conference
    • /
    • 1985.10a
    • /
    • pp.109-112
    • /
    • 1985
  • This paper proposes a algorithm of phonemes extraction in korean character pattern recognition. The phonemes are classified into the patterns which are separable and connected with each other. The former is extracted by means of pattern matching in consideration of topological structure of ponemes and direction of stroke sequentially. The latter is extracted by means of index and window algorithm which are performed by a 3$\times$3 sequential local operation in the thinned character pattern.

  • PDF

Study on Faults Diagnosis of Nuclear Pressure Boundary Components using Pattern Recognition of Nuclear Power Plant Simulator Data (원자력발전소 시뮬레이터 데이터의 패턴인식을 이용한 압력경계기기 고장 진단 연구)

  • Ahn, Hongmin;Choi, Hyunwoo;Kang, Seongki;Chai, Jangbom
    • Transactions of the Korean Society of Pressure Vessels and Piping
    • /
    • v.13 no.1
    • /
    • pp.48-53
    • /
    • 2017
  • We diagnosed the defect using the data obtained from the nuclear power plant simulator. In this paper, we diagnosed faults in the nuclear power plant system for discovery instead of the traditional single-component or device unit. We created the six fault scenarios and used a fault simulator to obtain the fault data. It was extracted pattern from acquired failure data. Neural network model was trained and simple pattern matching algorithm was applied. We presented a simulation result and confirmed that the applied algorithm works correctly.