• Title/Summary/Keyword: pairwise alignment sequence comparison

Search Result 4, Processing Time 0.023 seconds

Multiple Sequence Aligmnent Genetic Algorithm (진화 알고리즘을 사용한 복수 염기서열 정렬)

  • Kim, Jin;Song, Min-Dong;Choi, Hong-Sik;Chang, Yeon-Ah
    • Korean Journal of Microbiology
    • /
    • v.35 no.2
    • /
    • pp.115-120
    • /
    • 1999
  • Multiple Sequence Alignment of DNA and protem sequences is a imnport'mt tool in the study of molecular evolution, gene regulation. and prolein suucture-function relationships. Progressive pairwise alignment method generates multiple sequence alignment fast but not necessarily with optimal costs. Dynamic programming generates multiple sequence alig~~menl with optimal costs in most cases but long execution time. In this paper. we suggest genetlc algorithm lo improve the multiple sequence alignment generated from the cnlent methods, describe the design of the genetic algorithm, and compare the multiple sequence alignments from 0111 method and current methods.

  • PDF

Identification of Viral Taxon-Specific Genes (VTSG): Application to Caliciviridae

  • Kang, Shinduck;Kim, Young-Chang
    • Genomics & Informatics
    • /
    • v.16 no.4
    • /
    • pp.23.1-23.5
    • /
    • 2018
  • Virus taxonomy was initially determined by clinical experiments based on phenotype. However, with the development of sequence analysis methods, genotype-based classification was also applied. With the development of genome sequence analysis technology, there is an increasing demand for virus taxonomy to be extended from in vivo and in vitro to in silico. In this study, we verified the consistency of the current International Committee on Taxonomy of Viruses taxonomy using an in silico approach, aiming to identify the specific sequence for each virus. We applied this approach to norovirus in Caliciviridae, which causes 90% of gastroenteritis cases worldwide. First, based on the dogma "protein structure determines its function," we hypothesized that the specific sequence can be identified by the specific structure. Firstly, we extracted the coding region (CDS). Secondly, the CDS protein sequences of each genus were annotated by the conserved domain database (CDD) search. Finally, the conserved domains of each genus in Caliciviridae are classified by RPS-BLAST with CDD. The analysis result is that Caliciviridae has sequences including RNA helicase in common. In case of Norovirus, Calicivirus coat protein C terminal and viral polyprotein N-terminal appears as a specific domain in Caliciviridae. It does not include in the other genera in Caliciviridae. If this method is utilized to detect specific conserved domains, it can be used as classification keywords based on protein functional structure. After determining the specific protein domains, the specific protein domain sequences would be converted to gene sequences. This sequences would be re-used one of viral bio-marks.

Genetic Relationship of the Ampelopsis brevipedunculata var. heterophylla and Vitis thunbergii var. sinuata with the Other Vitis Plants (개머루와 까마귀머루의 유전적 유연관계 분석)

  • Bae, Young-Min
    • Journal of Life Science
    • /
    • v.27 no.1
    • /
    • pp.89-94
    • /
    • 2017
  • DNA sequences of the intergenic spacer 1 and intergenic spacer 2 of the nineteen plants belonging Vitis genus were collected from the Genbank. DNA sequences of the same regions of Vitis thunbergii var. sinuata and Ampelopsis brevipedunculata var. heterophylla, both common plants in Korea, were not available in Genbank. Those two plants were collected, their genomic DNA encoding 18S rRNA, intergenic spacer 1, 5.8S rRNA, intergenic spacer 2 and part of 28S rRNA amplified and DNA sequence determined. DNA sequences of twenty-one plants including two Korean plants were aligned by the Multiple sequence comparison by log-expectation(MUSCLE) algorithm and the alignment was used to calculate neighbor-joining tree and pairwise distance. The results indicate DNA sequences of the two Korean plants are highly homologous with each other, but they are quite distantly related to the other Vitis plants. Distant relationship of the two Korean plants with the other Vitis plants might be due to independent evolution of those two plants in geographically isolated environment. Those two Korean plants are classified in different genera based on the morphology, one in Vitis genus and the other in Ampelopsis genus, providing another example of discrepancy between morphological and genetic classification.

Correlation Analysis of the Arirangs Based on the Informatics Algorithms (정보 알고리즘 기반 아리랑의 계통도 및 상관관계 분석)

  • Kim, Hak Yong
    • The Journal of the Korea Contents Association
    • /
    • v.14 no.4
    • /
    • pp.407-417
    • /
    • 2014
  • An arirang is the most famous Korean folk song and was registered in UNESCO(Unitied Nations Educational, Scientific and cultural Organization) as an intangible cultural heritage in 2012. Most arirangs are composed of text and refrain parts. Genealogy of the arirang was classified in refrain patterns by using multiple sequence alignment algorithm. There are two different refrain patterns, slow and fast melodies. Of 106 arirangs, 38 and 68 arirangs contain fast and slow melodies, respectively. 73 arirangs and 104 their key words were extracted from bipartate arirang network that composed of arirangs, text works, and their relationships. The correlation among the arirangs was analyzed from the selected arirangs and key words by using pairwise comparison matrix. Also, analysis of correlation among the arirnags was performed by stepwise removal of the single degree nodes from the bipartate arirang network In this study, arirangs were analyzed in genealogy and correlation among arirangs by using informatic algorithm and network technology, in which arirang research will be constructed a stepping stone for the popularization and globalization of the arirangs.