• 제목/요약/키워드: sequence-based method

검색결과 1,746건 처리시간 0.036초

Global Sequence Homology Detection Using Word Conservation Probability

  • Yang, Jae-Seong;Kim, Dae-Kyum;Kim, Jin-Ho;Kim, Sang-Uk
    • Interdisciplinary Bio Central
    • /
    • 제3권4호
    • /
    • pp.14.1-14.9
    • /
    • 2011
  • Protein homology detection is an important issue in comparative genomics. Because of the exponential growth of sequence databases, fast and efficient homology detection tools are urgently needed. Currently, for homology detection, sequence comparison methods using local alignment such as BLAST are generally used as they give a reasonable measure for sequence similarity. However, these methods have drawbacks in offering overall sequence similarity, especially in dealing with eukaryotic genomes that often contain many insertions and duplications on sequences. Also these methods do not provide the explicit models for speciation, thus it is difficult to interpret their similarity measure into homology detection. Here, we present a novel method based on Word Conservation Score (WCS) to address the current limitations of homology detection. Instead of counting each amino acid, we adopted the concept of 'Word' to compare sequences. WCS measures overall sequence similarity by comparing word contents, which is much faster than BLAST comparisons. Furthermore, evolutionary distance between homologous sequences could be measured by WCS. Therefore, we expect that sequence comparison with WCS is useful for the multiple-species-comparisons of large genomes. In the performance comparisons on protein structural classifications, our method showed a considerable improvement over BLAST. Our method found bigger micro-syntenic blocks which consist of orthologs with conserved gene order. By testing on various datasets, we showed that WCS gives faster and better overall similarity measure compared to BLAST.

Heuristics for Job Shop Scheduling Problems with Progressive Weighted Tardiness Penalties and Inter-machine Overlapping Sequence-dependent Setup Times

  • Mongkalig, Chatpon;Tabucanon, Mario T.;Hop, Nguyen Van
    • Industrial Engineering and Management Systems
    • /
    • 제4권1호
    • /
    • pp.1-22
    • /
    • 2005
  • This paper presents new scheduling heuristics, namely Mean Progressive Weighted Tardiness Estimator (MPWT) Heuristic Method and modified priority rules with sequence-dependent setup times consideration. These are designed to solve job shop scheduling problems with new performance measures - progressive weighted tardiness penalties. More realistic constraints, which are inter-machine overlapping sequence-dependent setup times, are considered. In real production environments, inter-machine overlapping sequence-dependent setups are significant. Therefore, modified scheduling generation algorithms of active and nondelay schedules for job shop problems with inter-machine overlapping sequence-dependent setup times are proposed in this paper. In addition, new customer-based measures of performance, which are total earliness and progressive weighted tardiness, and total progressive weighted tardiness, are proposed. The objective of the first experiment is to compare the proposed priority rules with the consideration of sequence-dependent setup times and the standard priority rules without setup times consideration. The results indicate that the proposed priority rules with setup times consideration are superior to the standard priority rules without the consideration of setup times. From the second experiment and the third experiment to compare the proposed MPWT heuristic approach with the efficient priority rules with setup times consideration, the MPWT heuristic method is significantly superior to the Batched Apparent Tardiness Cost with Sequence-dependent Setups (BATCS) rule, and other priority rules based on total earliness and progressive weighted tardiness, and total earliness and tardiness.

Synthetic Self-Similar 네트워크 Traffic의 세 가지 고정길이 Sequence 생성기에 대한 비교 (A Comparison of Three Fixed-Length Sequence Generators of Synthetic Self-Similar Network Traffic)

  • 정해덕;이종숙
    • 정보처리학회논문지C
    • /
    • 제10C권7호
    • /
    • pp.899-914
    • /
    • 2003
  • 최근의 통신 네트워크에서 teletraffic의 양상은 Poisson 프로세스보다 self-similar프로세스에 의해서 더 잘 반영된다. 이는 통신 네트워크의 teletraffic에 관련하여 self-similar한 성질을 고려하지 않는다면, 통신 네트워크의 성능에 관한 결과는 부정확 할 수밖에 없다는 의미가 된다. 따라서, 통신 네트워크에 관한 시뮬레이션을 수행하기 위한 매우 중요한 요소 중에 하나는 충분히 긴 self-similar한 sequence를 얼마나 잘 생성하느냐의 문제이다. 본 논문에서는 FFT〔20〕, RMD〔12〕 그리고 SRA〔5, 10〕 방법을 이용한 세 개의 pseudo-random self-similar sequence 생성기를 비교 분석하였다. 본 Pseudo-random self-similar sequence 생성기의 성질을 매우 긴 sequence를 생성하는데 요구되는 통계적인 정확도와 생성시간에 대해서 분석하였다. 세 개의 pseudo-random self-similar sequence 생성기의 성능은 Hurst 변수의 상대적인 정확도로 보았을 때는 유사했으나, RMD와 SRA 방법을 이용한 pseudo-random self-similar sequence 생성기가 FFT 방법을 이용한 것보다 속도 면에서는 훨씬 빠른 것으로 나타났다. 또한 본 연구를 통해서 pseudo-random self-similar sequence 생성기의 비교분석을 위한 좀더 좋은 방법이 필요하다는 것을 보여주었다.

3D 내시경 영상시컨스의 MPEG-2 코딩 효율에 관한 연구 (A Study on the MPEG-2 Coding Performance of 3D Endoscopic Image Sequence)

  • 송철규;이영묵;이상민;김원기;이제호;이명호
    • 대한의용생체공학회:학술대회논문집
    • /
    • 대한의용생체공학회 1997년도 춘계학술대회
    • /
    • pp.84-87
    • /
    • 1997
  • In this study, for the coding of stereoscopic video sequence, two approaches are presented based on simulcast mathod and sidefield image format. The field sequential method for stereoscopic visualization have been specified. Also, camera parameter and shooting conditions for each test sequence are studied. Coding method based on sidefield format structure revealed better performance over simulcast in PSNR.

  • PDF

교통신호의 페이스순서 및 페이스간격을 고려한 신호최적화 (Optimum signal setting based on phase sequence and interval in an isolated intersection)

  • 김경철;임강원
    • 대한교통학회지
    • /
    • 제14권2호
    • /
    • pp.45-58
    • /
    • 1996
  • In a large signal intersection, it is the most important to set phase sequences and phase intervals of traffic signal in order to improve the efficiency of the capacity as well as safety. These setting allows to select the best sequence of signal phase among several alternatives, and thus to rearrange the starting and ending points of the individual phase using an effective interphase periods (EIP). The EIP is a gap between previous and current traffic movements at a potential collision point in an intersection. Each of traffic movements has an equality for safety and efficiency at the balanced condition of EIP. This paper presents how to set optimally the phase sequences and intervals of traffic signal in an intersection using phase based approach. And in the second part, we applied the theory developed in the first part. In particular, a numerical example of phase base signal setting is presented using a matrix computation method in order to select the best sequence among several alternatives, and thus to rearrange the starting and ending points of the individual phase using the EIP. This method also allows to apply to optimum signal setting even in five-lag or staggered-type intersection.

  • PDF

시계열 데이터베이스에서 DFT-기반 다차원 인덱스를 위한 물리적 데이터베이스 설계 (Physical Database Design for DFT-Based Multidimensional Indexes in Time-Series Databases)

  • 김상욱;김진호;한병일
    • 한국멀티미디어학회논문지
    • /
    • 제7권11호
    • /
    • pp.1505-1514
    • /
    • 2004
  • 시퀀스 매칭은 시계열 데이터베이스로부터 질의 시퀀스와 변화의 추세가 유사한 데이터 시퀀스들을 검색하는 연산이다. 기존의 대부분의 연구에서는 효과적인 시퀀스 매칭을 위하여 다차원 인덱스를 사용하며, 데이터 시퀀스를 이산 푸리에 변환(Discrete Fourier Transform: DFT)한 후, 단순히 앞의 두 개 내지 세 개의 DFT 계수만을 구성 속성 (organizing attributes)으로 사용함으로써 고차원의 경우 발생하는 차원 저주(dimensionality curse) 문제를 해결한다. 본 논문에서는 기존의 단순한 기법이 가지는 성능 상의 문제점들을 지적하고, 이러한 문제점들을 해결하는 최적의 다차원 인덱스 구성 기법을 제안한다. 제안된 기법은 대상이 되는 시계열 데이터베이스의 특성을 사전에 분석함으로써 변별력이 뛰어난 요소들을 다차원 인덱스의 구성 속성으로 선정하며, 비용 모델(cost model)을 기반으로 한 시퀀스 매칭 비용의 추정을 통하여 다차원 인덱스에 참여하는 최적의 구성 속성의 수를 결정한다. 제안된 기법의 우수성을 규명하기 위하여 실험을 통한기존 기법과의 성능 비교를 수행하였다 실험 결과에 의하면, 제안된 기법은 기존의 기법에 비교하여 매우 큰 성능 개선 효과를 가지는 것으로 나타났다.

  • PDF

An Adaptive Speed Estimation Method Based on a Strong Tracking Extended Kalman Filter with a Least-Square Algorithm for Induction Motors

  • Yin, Zhonggang;Li, Guoyin;Du, Chao;Sun, Xiangdong;Liu, Jing;Zhong, Yanru
    • Journal of Power Electronics
    • /
    • 제17권1호
    • /
    • pp.149-160
    • /
    • 2017
  • To improve the performance of sensorless induction motor (IM) drives, an adaptive speed estimation method based on a strong tracking extended Kalman filter with a least-square algorithm (LS-STEKF) for induction motors is proposed in this paper. With this method, a fading factor is introduced into the covariance matrix of the predicted state, which forces the innovation sequence orthogonal to each other and tunes the gain matrix online. In addition, the estimation error is adjusted adaptively and the mutational state is tracked fast. Simultaneously, the fading factor can be continuously self-tuned with the least-square algorithm according to the innovation sequence. The application of the least-square algorithm guarantees that the information in the innovation sequence is extracted as much as possible and as quickly as possible. Therefore, the proposed method improves the model adaptability in terms of actual systems and environmental variations, and reduces the speed estimation error. The correctness and the effectiveness of the proposed method are verified by experimental results.

패턴사전과 비정형성을 통한 이상치 탐지방법 적용 (Anomaly Detection via Pattern Dictionary Method and Atypicality in Application)

  • 오세홍;박종성;윤영삼
    • 센서학회지
    • /
    • 제32권6호
    • /
    • pp.481-486
    • /
    • 2023
  • Anomaly detection holds paramount significance across diverse fields, encompassing fraud detection, risk mitigation, and sensor evaluation tests. Its pertinence extends notably to the military, particularly within the Warrior Platform, a comprehensive combat equipment system with wearable sensors. Hence, we propose a data-compression-based anomaly detection approach tailored to unlabeled time series and sequence data. This method entailed the construction of two distinctive features, typicality and atypicality, to discern anomalies effectively. The typicality of a test sequence was determined by evaluating the compression efficacy achieved through the pattern dictionary. This dictionary was established based on the frequency of all patterns identified in a training sequence generated for each sensor within Warrior Platform. The resulting typicality served as an anomaly score, facilitating the identification of anomalous data using a predetermined threshold. To improve the performance of the pattern dictionary method, we leveraged atypicality to discern sequences that could undergo compression independently without relying on the pattern dictionary. Consequently, our refined approach integrated both typicality and atypicality, augmenting the effectiveness of the pattern dictionary method. Our proposed method exhibited heightened capability in detecting a spectrum of unpredictable anomalies, fortifying the stability of wearable sensors prevalent in military equipment, including the Army TIGER 4.0 system.

Discriminative Training of Sequence Taggers via Local Feature Matching

  • Kim, Minyoung
    • International Journal of Fuzzy Logic and Intelligent Systems
    • /
    • 제14권3호
    • /
    • pp.209-215
    • /
    • 2014
  • Sequence tagging is the task of predicting frame-wise labels for a given input sequence and has important applications to diverse domains. Conventional methods such as maximum likelihood (ML) learning matches global features in empirical and model distributions, rather than local features, which directly translates into frame-wise prediction errors. Recent probabilistic sequence models such as conditional random fields (CRFs) have achieved great success in a variety of situations. In this paper, we introduce a novel discriminative CRF learning algorithm to minimize local feature mismatches. Unlike overall data fitting originating from global feature matching in ML learning, our approach reduces the total error over all frames in a sequence. We also provide an efficient gradient-based learning method via gradient forward-backward recursion, which requires the same computational complexity as ML learning. For several real-world sequence tagging problems, we empirically demonstrate that the proposed learning algorithm achieves significantly more accurate prediction performance than standard estimators.

The Stacking Sequence Optimization of Stiffened Laminated Curved Panels with Different Loading and Stiffener Spacing

  • Kim Cheol;Yoon In-Se
    • Journal of Mechanical Science and Technology
    • /
    • 제20권10호
    • /
    • pp.1541-1547
    • /
    • 2006
  • An efficient procedure to obtain the optimal stacking sequence and the minimum weight of stiffened laminated composite curved panels under several loading conditions and stiffener layouts has been developed based on the finite element method and the genetic algorithm that is powerful for the problem with integer variables. Often, designing composite laminates ends up with a stacking sequence optimization that may be formulated as an integer programming problem. This procedure is applied for a problem to find the stacking sequence having a maximum critical buckling load factor and the minimum weight. The object function in this case is the weight of a stiffened laminated composite shell. Three different types of stiffener layouts with different loading conditions are investigated to see how these parameters influence on the stacking sequence optimization of the panel and the stiffeners. It is noticed from the results that the optimal stacking sequence and lay-up angles vary depending on the types. of loading and stiffener spacing.