• 제목/요약/키워드: Sequence homology

검색결과 918건 처리시간 0.026초

Cloning of the Adenosine Deaminase Gene from Pseudomonas iodinum IFO 3558

  • Jo, Young-Bae;Baik, Hyung-Suk;Bae, Kyung-Mi;Jun, Hong-Ki
    • Journal of Life Science
    • /
    • 제9권2호
    • /
    • pp.9-14
    • /
    • 1999
  • Pseudomonas iodinum IFO 3558 adenosine deaminase(ADA) gene was cloned by the polymerase chain reaction and deduced the amino acid sequence of the enzyme. DNA sequence homology of Pseudomonas iodinum IFO 3558 ADA gene was compared to those of E. coli, human and mouse ADA genes. Unambiguous sequence from both strands of pM21 was obtained for the region believed to encode ADA. The sequence included a 804-nucleotide open reading frame, bounded on one end by sense primer and on the other end by two antisense primer. This open reading frame encodes a protein of 268 amino acids having a molecular weight of 29,448. The deduced amino acid sequence shows considerable similarity to those of E. coli, mouse and human ADA. Pseudomonas iodinum IFO 3558 nucleotide sequence shows 98.5% homology with that of the E. coli ADA sequence and 51.7% homology with that of the mouse ADA sequence and 52.5% homology with that of the human ADA sequence. The ADA protein sequence of Pseudomonas iodinum IFO 3558 shows 96.9% homology with that of the E. coli and 40.7% homology with that of the mouse and 41.8% homology with that of the human. The distance between two of the conserved elements, TVHAGE and SL(1)NTDDP has veen exactly conserved at 76 amino acids for all four ADAs. Two of the four conserved sequence elements shared among the four ADAs are also present in the yeast, rat, human (M), and Human(L) AMP deaminase. The SLSTDDP sequence differs only in the conservative substitution of a serine for an asparagine. A conserved cysteine with conserved spacing between these two regions is also found. Thus, sequence analysis of four ADAs and four AMP deaminases revealed the presence of a highly conserved sequence motif, SLN(S)TDDP, a conserved dipeptide, HA, and a conserved cysteine residue.

Global Sequence Homology Detection Using Word Conservation Probability

  • Yang, Jae-Seong;Kim, Dae-Kyum;Kim, Jin-Ho;Kim, Sang-Uk
    • Interdisciplinary Bio Central
    • /
    • 제3권4호
    • /
    • pp.14.1-14.9
    • /
    • 2011
  • Protein homology detection is an important issue in comparative genomics. Because of the exponential growth of sequence databases, fast and efficient homology detection tools are urgently needed. Currently, for homology detection, sequence comparison methods using local alignment such as BLAST are generally used as they give a reasonable measure for sequence similarity. However, these methods have drawbacks in offering overall sequence similarity, especially in dealing with eukaryotic genomes that often contain many insertions and duplications on sequences. Also these methods do not provide the explicit models for speciation, thus it is difficult to interpret their similarity measure into homology detection. Here, we present a novel method based on Word Conservation Score (WCS) to address the current limitations of homology detection. Instead of counting each amino acid, we adopted the concept of 'Word' to compare sequences. WCS measures overall sequence similarity by comparing word contents, which is much faster than BLAST comparisons. Furthermore, evolutionary distance between homologous sequences could be measured by WCS. Therefore, we expect that sequence comparison with WCS is useful for the multiple-species-comparisons of large genomes. In the performance comparisons on protein structural classifications, our method showed a considerable improvement over BLAST. Our method found bigger micro-syntenic blocks which consist of orthologs with conserved gene order. By testing on various datasets, we showed that WCS gives faster and better overall similarity measure compared to BLAST.

INSTABILITY OF THE BETTI SEQUENCE FOR PERSISTENT HOMOLOGY AND A STABILIZED VERSION OF THE BETTI SEQUENCE

  • JOHNSON, MEGAN;JUNG, JAE-HUN
    • Journal of the Korean Society for Industrial and Applied Mathematics
    • /
    • 제25권4호
    • /
    • pp.296-311
    • /
    • 2021
  • Topological Data Analysis (TDA), a relatively new field of data analysis, has proved very useful in a variety of applications. The main persistence tool from TDA is persistent homology in which data structure is examined at many scales. Representations of persistent homology include persistence barcodes and persistence diagrams, both of which are not straightforward to reconcile with traditional machine learning algorithms as they are sets of intervals or multisets. The problem of faithfully representing barcodes and persistent diagrams has been pursued along two main avenues: kernel methods and vectorizations. One vectorization is the Betti sequence, or Betti curve, derived from the persistence barcode. While the Betti sequence has been used in classification problems in various applications, to our knowledge, the stability of the sequence has never before been discussed. In this paper we show that the Betti sequence is unstable under the 1-Wasserstein metric with regards to small perturbations in the barcode from which it is calculated. In addition, we propose a novel stabilized version of the Betti sequence based on the Gaussian smoothing seen in the Stable Persistence Bag of Words for persistent homology. We then introduce the normalized cumulative Betti sequence and provide numerical examples that support the main statement of the paper.

Rotaviruses의 염기배열 유사성 측정 (Nucleotide Sequence Homology in Rotaviruses)

  • 양재명
    • 미생물학회지
    • /
    • 제26권3호
    • /
    • pp.155-161
    • /
    • 1988
  • Nucleotide sequence homology between bovine, simian, and porcine rotavirus was determined by the RNA:RNA hybridization technique. Single stranded RNA, prepared in vitro with EDTA activated endogeneous viral RNA polymerase, was hhbridized with tritium labeled bovine rotavirus genomic RNA. The heteroduplex RNA was treated with single stranded RNA specific ribonucleases and the RNase resistant hybrid RNA was precipitated, and collected by filtration on a filter paper. Seventy four percent RNA sequence homology between bovine and simian rotavirus and 8 percent RNA sequence homology between bovine and porcine rotavirus was confirmed by hybridization between tritium labeled single stranded RNA and viral genomic RNA.

  • PDF

북미자생 치마버섯의 Mating Locus의 염기서열 (Nucleotide Sequence of Mating Locus of Schizophyllum commune Indigenous to North America)

  • 박동철;이상선;이인선;김현정;이갑랑
    • 한국균학회지
    • /
    • 제28권1호
    • /
    • pp.22-25
    • /
    • 2000
  • Cosmid clone으로부터 mating activity를 가지는 pSC13의 양 말단은 1-71 $A{\alpha}3$ 교배유전자좌 Z3부위 2,430 bp와 Y3 region 8,160 bp 부근에서 homology를 지니는 것으로 나타나 교배활성에 필요한 모든 부위를 함유하는 것으로 사료되었다. 그리고 pSCE2는 1-71 $A{\alpha}3$의 Y region의 6,080 bp 부근에서 거의 완전한 homology를 가지는 것으로 보아 $A{\alpha}3$의 Y region을 함유하고 있음을 알 수 있으며, 또한 pSCE1도 MEP의 일부 sequence와도 거의 완전한 homology를 가지는 것으로 나타나 1-71 $A{\alpha}3$ 교배유전자좌와 모두 유사한 염기서열를 모두 지니면서 북미자생 치마버섯의 교배유전자좌의 염기배열이 남미자생의 것과 거의 유사한 분자적 구조를 지니고 있음을 나타내었다. 결정된 DNA sequence는 3265 bp로서 남미산 치마버섯 1-71 stain의 mating활성을 나타내는 $A{\alpha}3$ locus 염기서열중에 Z region과 거의 완전한 약 96%의 homology를 나타내었다. 또한 Polypeptide sequence비교에서도 약 82%의 높은 homology를 나타내었으며, 특히 transcription regulator로 알려진 homeodomain 및 acidic region에서는 각각 약 74%. 82%의 상당히 높은 비율의 homology를 지니고 있음이 확인되었다. 이러한 결과로 보아 남미와 북미의 대륙간에 자생하는 같은 allele type간에도 상당히 높은 비율의 교배유전자좌의 보존이 이루어지고 있음을 알 수 있다.

  • PDF

Sequence Analysis of NS4 Region of HCV Isolated from Korean Patient

  • Paik, Sang-Hoon;Lee, Young-Ik;Kim, Won-Bae;Yang, Jai-Myung
    • Journal of Microbiology
    • /
    • 제33권3호
    • /
    • pp.260-266
    • /
    • 1995
  • Hepatitis C virus (HCV) has been considered as a mojor causative agent of post-transfusion related non-A, non-B hepatitis. In this study, the cDNA sequence of NS4 region of HCV (HCV-S) obtained from a Korean patient's plasms was determined. Comparative nucleotide sequence analysis between to type II. 67.2% homology to type III, and 66.4% homology to type IV. The putative amino acid sequence homologies to types I, II, III, and IV were 82.8-84.7%, 92.5-95.1%. 72.5, and 71.1%, respectively. This data strongly suggests that HCV-S should be classified as type II. Significant similarities of hydrophobicity profiles and putative transmembranous domains were found in HCV-S and four major prototypes, indicating that the protein structure is similar in spite of the heterogeneities of intertype homologies at the level of the psrimary nucleotide and amino acid sequences.

  • PDF