Search | Korea Science

Computing Longest Common Substrings by Using Suffix Arrays (써픽스 배열을 이용한 최장 공통 부분 스트링 계산)

전정은;박희진;김동규
- Proceedings of the Korean Information Science Society Conference
- /
- 2004.10a
- /
- pp.739-741
- /
- 2004
최장 공통 부분 스트링이란 주어진 두 개 이상의 스트링에서 가장 길게 일치하는 공통 부분 스트링을 계산하는 문제이다 최장 공통 부분 스트링은 스트링 프로세싱이나 생물정보학 분야에서 널리 사용되고 있는 중요한 문제이지만, 현재까지 연구된 동적 프로그래밍이나 써픽스 트리를 사용한 방법은 저장 공간을 많이 차지하므로 효율적이지 못하다 따라서 적은 저장 공간을 차지하면서도 최장 공통 부분 스트링을 빨리 구할 수 있는 알고리즘이 필요하며, 본 논문에서는 이를 위해 써픽스 배열을 도입하였다. 본 논문에서 제시한 알고리즘은 선형 시간, 공간 복잡도를 가지며, 써픽스 트리의 최하 공통 조상(LCA, Lowest Common Ancestor) 연산이나 써픽스 배열에서 사용하는 그와 비슷한 구간 최소 값 질의(RMQ, Range Minima Query)를 전혀 사용하지 않으므로 매우 효율적이다.
PDF

An Effective Algorithm for Checking Subsumption Relation on String Data Containing Wildcard Characters (와일드카드 문자를 포함하는 스트링 데이터 사이의 포함관계 확인을 위한 효율적인 알고리즘)

Kim, Do-Han;Park, Hee-Jin;Paek, Eun-Ok
- Journal of KIISE:Computer Systems and Theory
- /
- v.32 no.9
- /
- pp.475-482
- /
- 2005
String data containing wildcard characters may represent certain patterns in texts. A subsumption relation between two patterns can be defined by a subset relation between sets of strings that match those patterns. Thus, the subsumption relation check is important to determine whether each pattern represents a set of strings without any overlap with another pattern. In this paper, we propose an effective algorithm that can determine subsumption relation between strings with wildcard characters. First, we consider a simple extension of the suffix tree algorithm so that it nay include wildcard characters and then we propose another method that checks the subsumption relation by dividing a suffix tree structure at each location of string data.
PDF KSCI

The Recognition of Occluded 2-D Objects Using the String Matching and Hash Retrieval Algorithm (스트링 매칭과 해시 검색을 이용한 겹쳐진 이차원 물체의 인식)

Kim, Kwan-Dong;Lee, Ji-Yong;Lee, Byeong-Gon;Ahn, Jae-Hyeong
- The Transactions of the Korea Information Processing Society
- /
- v.5 no.7
- /
- pp.1923-1932
- /
- 1998
This paper deals with a 2-D objects recognition algorithm. And in this paper, we present an algorithm which can reduce the computation time in model retrieval by means of hashing technique instead of using the binary~tree method. In this paper, we treat an object boundary as a string of structural units and use an attributed string matching algorithm to compute similarity measure between two strings. We select from the privileged strings a privileged string wIth mmimal eccentricity. This privileged string is treated as the reference string. And thell we wllstructed hash table using the distance between privileged string and the reference string as a key value. Once the database of all model strings is built, the recognition proceeds by segmenting the scene into a polygonal approximation. The distance between privileged string extracted from the scene and the reference string is used for model hypothesis rerieval from the table. As a result of the computer simulation, the proposed method can recognize objects only computing, the distance 2-3tiems, while previous method should compute the distance 8-10 times for model retrieval.
PDF

A new statistical test for random sequences (난수열에 대한 새로운 통계적 검정)

김혜정;이경현
- Proceedings of the Korea Institutes of Information Security and Cryptology Conference
- /
- 1997.11a
- /
- pp.332-341
- /
- 1997
본 논문에서는 여러 난수열 발생기들의 안전성 평가를 위한 새로운 통계적 검정법을 소개한다. 검정에서 구현된 기본 개념은 다음 비트 검정 이론을 바탕으로 하였으며 전체 스트링과 스트링의 일부분에 관한 확률적 통계치가 주어진다면 이를 이용하여 추측할 수 있는 다음 비트들에 관한 정보를 얻을 수 있게 된다는 점을 이용하였다. 본 검정에서는 난수 발생기의 랜덤성 평가시 입력되는 스트링 크기의 크고 작음에 관계없이 모든 임의 길이의 스트링에 적용될 수 있도록 하였으며 이는 난수 발생기를 이용한 암호 시스템의 안전성 평가에 있어서 매우 유용하게 사용될 수 있을 것이다.
PDF

The Measuring Instrument and Algorithm To Find Degraded Solar String Configuration Modules (태양광 스트링 구성 모듈의 성능 저하 검출용 계측기 및 알고리즘)

Son, Han-Byeol;Park, Seong-Mi;Park, Sung-Jun
- Proceedings of the KIPE Conference
- /
- 2018.07a
- /
- pp.362-363
- /
- 2018
태양광 발전시스템에서 태양광 모듈은 고압의 발전전압 형성을 위해 직렬로 구성한 스트링을 사용하고 있다. 그러나 직렬로 연결된 태양광 모듈 중 한 개의 모듈이라도 노후화가 발생하면 노후화가 발생한 스트링의 발전 효율이 감소하는 문제점이 있다. 따라서 본 논문에서는 태양광 스트링에서 노후화 모듈을 판정을 위해 순시 PV 특성곡선을 계측할 수 있는 토포로지를 제안하고 계측된 PV 특성곡선을 이용한 노후화 판정 알고리즘을 제안한다.
PDF

An Experimental and Theoretical Evaluation of the Axial Vibration Properties of a Typical Drillstring (드릴스트링의 종진동 특성에 대한 실험적 및 이론적 연구)

Lee,
- Journal of KSNVE
- /
- v.5 no.1
- /
- pp.107-115
- /
- 1995
An analytical model for drillstring axial vibration is proposed. The drillstring is modelled as an equivalent stepwise uniform bar, and the bottom boundary is modelled asa spring and a damper which depend on WOB(weight on bit). The effect of tool joints and the effect of surrounding layers, such as mud and formation, are evaluated theoretically. To investigate the bottom boundary condition, a forced axial vibration testing technique was developed and the tests with a typical drillstring were performed at various WOB's. The results show good agreement with theoretical results. An important conclusion is that the flexibility of the bottom rock must be included in order to predict resonant frequencies of the drillstring axial vibration.
PDF

Efficient External Memory Algorithm for Finding the Maximum Suffix of a String (스트링의 최대 서픽스를 계산하는 효율적인 외부 메모리 알고리즘)

Kim, Sung-Kwon;Kim, Soo-Cheol;Cho, Jung-Sik
- The KIPS Transactions:PartA
- /
- v.15A no.4
- /
- pp.239-242
- /
- 2008
We study the problem of finding the maximum suffix of a string on the external memory model of computation with one disk. In this model, we are primarily interested in designing algorithms that reduce the number of I/Os between the disk and the internal memory. A string of length N has N suffixes and among these, the lexicographically largest one is called the maximum suffix of the string. Finding the maximum suffix of a string plays a crucial role in solving some string problems. In this paper, we present an external memory algorithm for computing the maximum suffix of a string of length N. The algorithm uses four blocks in the internal memory and performs at most 4(N/L) disk I/Os, where L is the size of a block.
https://doi.org/10.3745/KIPSTA.2008.15-A.4.239 인용 PDF KSCI

Fast Construction of Suffix Arrays for DNA Strings (DNA 스트링에 대하여 써픽스 배열을 구축하는 빠른 알고리즘)

Jo, Jun-Ha;Kim, Nam-Hee;Kwon, Ki-Ryong;Kim, Dong-Kyue
- Journal of KIISE:Computer Systems and Theory
- /
- v.34 no.8
- /
- pp.319-326
- /
- 2007
To perform fast searching in massive data such as DNA strings, the most efficient method is to construct full-text index data structures of given strings. The widely used full-text index structures are suffix trees and suffix arrays. Since the suffix may uses less space than the suffix tree, the suffix array is proper for DNA strings. Previously developed construction algorithms of suffix arrays are not suitable for DNA strings since those are designed for integer alphabets. We propose a fast algorithm to construct suffix arrays on DNA strings whose alphabet sizes are fixed by 4. We reduce the construction time by improving encoding and merging steps on Kim et al.[1]'s algorithm. Experimental results show that our algorithm constructs suffix arrays on DNA strings 1.3-1.6 times faster than Kim et al.'s algorithm, and also for other algorithms in most cases.
PDF KSCI

Searching Algorithms for Protein Sequences and Weighted Strings (단백질 시퀀스와 가중치 스트링에 대한 탐색 알고리즘)

Kim, Sung-Kwon
- Journal of KIISE:Computer Systems and Theory
- /
- v.29 no.8
- /
- pp.456-462
- /
- 2002
We are developing searching algorithms for weighted strings such as protein sequences. Let${\sum}$ be an alphabet and for each $a{\in}{\sum}$ its weight ${\mu}(a)$ is given. Given a string $A=a_1a_2…a_n\; with each ai{\in}{\sum}$, a substring<$A(i.j)=a_ia_{i+1}…a_j$ has weight ${\in}(A(i.j))={\in}(a_i)+{\in}(a_i+1)+…+{\in}(a_j)$.The problem we are dealing with is to preprocess A to build a searching structure, and later, given a query weight M, the structure is used to answer the question of whether there is a substring A(i,j) such that$M={\in}(A(i,j))$.In this paper an algorithm that improves over the previous result will be presented. The previously best known algorithm answers a query in $0(\frac{nlog\;logn}{log\; n})$time using a searching structure that requires O(n) amount of memory. Our algorithm reduces the memory requirement to $0(\frac{n}{log\; n})$ while achieving the same query answer time.
PDF KSCI

Development of the Pattern Matching Engine using Regular Expression (정규 표현식을 이용한 패턴 매칭 엔진 개발)

Ko, Kwang-Man;Park, Hong-Jin
- The Journal of the Korea Contents Association
- /
- v.8 no.2
- /
- pp.33-40
- /
- 2008
In various manners, string pattern matching algorithm has been proven for prominence in speed of searching particular queries and keywords. Whereas, the existing algorithms are limited in terms of various pattern. In this paper, regular expression has been utilized to improve efficiency of pattern matching through efficient execution towards various pattern of queries including particular keywords. Such as this research would enable to search various harmful string pattern more efficiently, rather than matching simple keywords, which also implies excellent speed of string pattern matching compared to that of those existing algorism. In this research, the proposed string search engine generated from the LEX are more efficient than BM & AC algorithm for a string patterns search speed in cases of 1000 with more than patterns, but we have got similar results for the keywords pattern matching.
https://doi.org/10.5392/JKCA.2008.8.2.033 인용 PDF

Search Result 270, Processing Time 0.024 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)