• 제목/요약/키워드: Pattern matching Algorithm

검색결과 309건 처리시간 0.023초

An Efficient DNA Sequence Compression using Small Sequence Pattern Matching

  • Murugan., A;Punitha., K
    • International Journal of Computer Science & Network Security
    • /
    • 제21권8호
    • /
    • pp.281-287
    • /
    • 2021
  • Bioinformatics is formed with a blend of biology and informatics technologies and it employs the statistical methods and approaches for attending the concerning issues in the domains of nutrition, medical research and towards reviewing the living environment. The ceaseless growth of DNA sequencing technologies has resulted in the production of voluminous genomic data especially the DNA sequences thus calling out for increased storage and bandwidth. As of now, the bioinformatics confronts the major hurdle of management, interpretation and accurately preserving of this hefty information. Compression tends to be a beacon of hope towards resolving the aforementioned issues. Keeping the storage efficiently, a methodology has been recommended which for attending the same. In addition, there is introduction of a competent algorithm that aids in exact matching of small pattern. The DNA representation sequence is then implemented subsequently for determining 2 bases to 6 bases matching with the remaining input sequence. This process involves transforming of DNA sequence into an ASCII symbols in the first level and compress by using LZ77 compression method in the second level and after that form the grid variables with size 3 to hold the 100 characters. In the third level of compression, the compressed output is in the grid variables. Hence, the proposed algorithm S_Pattern DNA gives an average better compression ratio of 93% when compared to the existing compression algorithms for the datasets from the UCI repository.

규칙 적용 성능을 개선하기 위한 다중 패턴매칭 기법 (A Multiple Pattern Matching Scheme to Improve Rule Application Performance)

  • 이재국;김형식
    • 정보보호학회논문지
    • /
    • 제18권3호
    • /
    • pp.79-88
    • /
    • 2008
  • 인터넷 환경에서 내부 네트워크를 보호하기 위하여 침입탐지시스템이 광범위하게 사용되고 있다. 침입탐지시스템은 비정상 패킷의 특성을 분석하여 규칙을 생성하고 이 규칙들을 이용하여 패킷들을 필터링함으로써 내부 시스템들을 보호한다. 최근 공격 사례가 많아지고, 공격 형태가 구조화되면서 이를 탐지하는 규칙의 수도 지속적으로 증가하고 있다. 이에 따라 침입탐지시스템이 규칙을 적용하는 과정에서의 성능 하락 정도도 커지고 있다. 본 논문은 규칙을 적용하는 과정에서 상대적으로 오버헤드가 큰 문자열 검색 성능을 개선하고자 복수개의 부분패턴을 이용한 다중 패턴매칭 기법을 제안한다. 그리고 대표적인 고성능의 다중 패턴매칭 알고리즘인 Wu-Manber 알고리즘과 성능을 비교하고 그 결과를 보인다.

CPU-GPU 메모리 계층을 고려한 고처리율 병렬 KMP 알고리즘 (High Throughput Parallel KMP Algorithm Considering CPU-GPU Memory Hierarchy)

  • 박소은;김대희;이명호;박능수
    • 전기학회논문지
    • /
    • 제67권5호
    • /
    • pp.656-662
    • /
    • 2018
  • Pattern matching algorithm is widely used in many application fields such as bio-informatics, intrusion detection, etc. Among many string matching algorithms, KMP (Knuth-Morris-Pratt) algorithm is commonly used because of its fast execution time when using large texts. However, the processing speed of KMP algorithm is also limited when the text size increases significantly. In this paper, we propose a high throughput parallel KMP algorithm considering CPU-GPU memory hierarchy based on OpenCL in GPGPU (General Purpose computing on Graphic Processing Unit). We focus on the optimization for the allocation of work-times and work-groups, the local memory copy of the pattern data and the failure table, and the overlapping of the data transfer with the string matching operations. The experimental results show that the execution time of the optimized parallel KMP algorithm is about 3.6 times faster than that of the non-optimized parallel KMP algorithm.

k개의 오차를 허용하는 순위 패턴 매칭 (Order preserving matching with k mismatches)

  • 이인복
    • 스마트미디어저널
    • /
    • 제9권2호
    • /
    • pp.33-38
    • /
    • 2020
  • 순위 패턴 매칭 문제는 패턴과 텍스트가 주어졌을 때, 텍스트의 부분 문자열 중 패턴과 순위 동형을 만족하는 것들을 찾는 문제이다. 이 논문에서는 순위 패턴 매칭에 k개의 오차를 허용하는 문제를 푸는 알고리즘을 제안한다. 제안하는 알고리즘은 기존의 알고리즘에 비하여 간단하고 구현이 쉬우며, 평균적인 경우 선형 시간 복잡도를 가진다. 또한 실험을 통해서, 제안된 알고리즘이 현실적인 데이터에 대해서 효율적으로 동작함을 보인다.

은닉마르코브 모델의 부합확률연산의 정수화 알고리즘 개발 (I) (Development of an Integer Algorithm for Computation of the Matching Probability in the Hidden Markov Model (I))

  • 김진헌;김민기;박귀태
    • 전자공학회논문지B
    • /
    • 제31B권8호
    • /
    • pp.11-19
    • /
    • 1994
  • The matching probability P(ο/$\lambda$), of the signal sequence(ο) observed for a finite time interval with a HMM (Hidden Markov Model $\lambda$) indicates the probability that signal comes from the given model. By utilizing the fact that the probability represents matching score of the observed signal with the model we can recognize an unknown signal pattern by comparing the magnitudes of the matching probabilities with respect to the known models. Because the algorithm however uses floating point variables during the computing process hardware implementation of the algorithm requires floating point units. This paper proposes an integer algorithm which uses positive integer numbers rather than float point ones to compute the matching probability so that we can economically realize the algorithm into hardware. The algorithm makes the model parameters integer numbers by multiplying positive constants and prevents from divergence of data through the normalization of variables at each step. The final equation of matching probability is composed of constant terms and a variable term which contains logarithm operations. A scheme to make the log conversion table smaller is also presented. To analyze the qualitive characteristics of the proposed algorithm we attatch simulation result performed on two groups of 10 hypothetic models respectively and inspect the statistical properties with repect to the model order the magnitude of scaling constants and the effect of the observation length.

  • PDF

YARA 속도 개선을 위한 새로운 S/W 구조설계 (A New S/W Architecture for YARA Speed Enhancement)

  • 김창훈
    • 한국통신학회논문지
    • /
    • 제41권12호
    • /
    • pp.1858-1860
    • /
    • 2016
  • 논문에서는 YARA의 스캐닝 스레드 알고리즘을 개선하여 다수의 룰 파일 패턴 매칭을 수행할 수 있는 새로운 소프트웨어 구조를 제안한다. 제안하는 방식은 기존의 YARA에 비해 매칭을 위한 룰 파일의 메모리 적재 횟수를 감소시킨다. 따라서 제안된 구조를 적용할 경우 메모리 사용량은 룰 파일의 개수에 비례하여 증가하지만 패턴 매칭 수행에 따른 시간을 감소시킬 수 있다.

New Matching Scheme for Panorama Image: A Simulation Study

  • Kim, Jeong-Seok;Chung, Sung-Taek;Hong, In-Ki
    • 대한의용생체공학회:의공학회지
    • /
    • 제28권1호
    • /
    • pp.127-131
    • /
    • 2007
  • This paper presents a new matching scheme for creating a single panoramic image from a sequence of partially overlapping images of the same object or scene. This matching scheme is based directly on the searching algorithm, using a multiscale approach to the Hooke-Jeeves algorithm. Matching scheme evaluation was performed using simulated pattern images. The proposed matching scheme reveals good results and could be effectively applied to real ultrasound applications.

고속 블록정합 움직임 추정을 위한 최적의 탐색 패턴 (Optimal Search Patterns for Fast Block Matching Motion Estimation)

  • 임동근;호요성
    • 대한전자공학회:학술대회논문집
    • /
    • 대한전자공학회 2000년도 하계종합학술대회 논문집(4)
    • /
    • pp.39-42
    • /
    • 2000
  • Motion estimation plays an important role for video coding. In this paper, we derive optimal search patterns for fast block matching motion estimation. By analyzing the block matching algorithm as a function of block shape and size, we can find an optimal search pattern for initial motion estimation. The proposed idea, which has been verified experimentally by computer simulations, can provide an analytical basis for the current MPEG-2 proposals. In order to choose a more compact search pattern for BMA, we exploit the statistical relationship between the motion and the frame difference of each block.

  • PDF

한글 인식에서 자소 추출에 관한 연구 (A Study on Algorithm of Phonemes Extraction in Korean Character Pattern Recognition)

  • 정영화;김은진;김정선
    • 한국통신학회:학술대회논문집
    • /
    • 한국통신학회 1985년도 추계학술발표회 논문집
    • /
    • pp.109-112
    • /
    • 1985
  • This paper proposes a algorithm of phonemes extraction in korean character pattern recognition. The phonemes are classified into the patterns which are separable and connected with each other. The former is extracted by means of pattern matching in consideration of topological structure of ponemes and direction of stroke sequentially. The latter is extracted by means of index and window algorithm which are performed by a 3$\times$3 sequential local operation in the thinned character pattern.

  • PDF

원자력발전소 시뮬레이터 데이터의 패턴인식을 이용한 압력경계기기 고장 진단 연구 (Study on Faults Diagnosis of Nuclear Pressure Boundary Components using Pattern Recognition of Nuclear Power Plant Simulator Data)

  • 안홍민;최현우;강성기;채장범
    • 한국압력기기공학회 논문집
    • /
    • 제13권1호
    • /
    • pp.48-53
    • /
    • 2017
  • We diagnosed the defect using the data obtained from the nuclear power plant simulator. In this paper, we diagnosed faults in the nuclear power plant system for discovery instead of the traditional single-component or device unit. We created the six fault scenarios and used a fault simulator to obtain the fault data. It was extracted pattern from acquired failure data. Neural network model was trained and simple pattern matching algorithm was applied. We presented a simulation result and confirmed that the applied algorithm works correctly.