• Title/Summary/Keyword: protein sequences

Search Result 1,071, Processing Time 0.026 seconds

Complete Mitochondrial Genome of Echinostoma hortense (Digenea: Echinostomatidae)

  • Liu, Ze-Xuan;Zhang, Yan;Liu, Yu-Ting;Chang, Qiao-Cheng;Su, Xin;Fu, Xue;Yue, Dong-Mei;Gao, Yuan;Wang, Chun-Ren
    • Parasites, Hosts and Diseases
    • /
    • v.54 no.2
    • /
    • pp.173-179
    • /
    • 2016
  • Echinostoma hortense (Digenea: Echinostomatidae) is one of the intestinal flukes with medical importance in humans. However, the mitochondrial (mt) genome of this fluke has not been known yet. The present study has determined the complete mt genome sequences of E. hortense and assessed the phylogenetic relationships with other digenean species for which the complete mt genome sequences are available in GenBank using concatenated amino acid sequences inferred from 12 protein-coding genes. The mt genome of E. hortense contained 12 protein-coding genes, 22 transfer RNA genes, 2 ribosomal RNA genes, and 1 non-coding region. The length of the mt genome of E. hortense was 14,994 bp, which was somewhat smaller than those of other trematode species. Phylogenetic analyses based on concatenated nucleotide sequence datasets for all 12 protein-coding genes using maximum parsimony (MP) method showed that E. hortense and Hypoderaeum conoideum gathered together, and they were closer to each other than to Fasciolidae and other echinostomatid trematodes. The availability of the complete mt genome sequences of E. hortense provides important genetic markers for diagnostics, population genetics, and evolutionary studies of digeneans.

Sequence Analysis of the Gene Encoding gp55 Protein of Suri Strain, an Attenuated Classical Swine Fever (Hog Cholera) Virus (Classical Swine Fever (Hog Cholera) Virus 약독순화주 (Suri 주)의 gp55 Gnen 염기서열 분석)

  • Kim, Kui-Hyun;Chang, Kyung-Soo;Kang, Kyong-Im;Lee, Byung-Hyung;Park, Jong-Hyeon;An, Soo-Hwan;Jun, Moo-Hyung
    • The Journal of Korean Society of Virology
    • /
    • v.28 no.4
    • /
    • pp.303-316
    • /
    • 1998
  • An attenuated classical swine fever virus (CSFV), Suri strain, is a variant derived from a vaccine virus, LOM strain. This study was performed to elucidate the molecular biologcal properties of CSFV Suri strain, and to obtain the basic data for molecular epidemiological approaches for the disease. The truncated form of gp55 gene without the C-terminal transmembrane domain, in size of 1,023bp, was amplified by RT-PCR and sequenced by dye terminator cyclic sequencing method, and inserted into BamHI site of pAcGP67B baculovirus vector, establishing a cloned pAcHEG plasmid. By the nucleotide sequences determined, 341 amino acid sequences were predicted. As compared the nucleotide and amino acid sequences of gp55 of Suri with the various CSFV, Suri strain showed the high homology over 99.1% with ALD and LOM strains, but comparably the lower homology with Alfort and Brescia. In comparison of amino acid sequence in variable domain of gp55 protein, the similar tendency of homology was observed. In hydrophobicity analysis, all of four CSFV strains revealed the analogous patterns of hydrophobicity. The numbers and locations of N-glycosylation site and cysteine residues in gp55 were analyzed, those of Suri strain being coincident with ALD and LOM strains. The results suggest that gp55 in Suri strain has the high similarity to those in ALD and LOM strains in terms of the nucleotide and amino acid sequences and the functional properties of gp55 protein.

  • PDF

Prediction of Rice Embryo Proteins using EST-Databases

  • Woo, Sun-Hee;Cho, Seung-Woo;Kim, Tae-Seon;Chung, Keun-Yook;Cho, Yong-Gu;Kim, Hong-Sig;Song, Beom-Heon;Lee, Chul-Won;Jong, Seung-Keun
    • Korean Journal of Breeding Science
    • /
    • v.40 no.1
    • /
    • pp.1-7
    • /
    • 2008
  • An attempt was made to link rice embryo proteins to DNA sequences and to understand their functions. One hundred of the 700 spots detected on the embryo 2-DE gels were microsequenced. Of these, 28% of the embryo proteins were matched to DNA sequences with known functions, but 72% of the proteins were unknown in functions as previously reported (Woo et al. 2002). In addition, twenty-four protein spots with 100% of homology and nine with over 80% were matched to ESTs (expressed sequence tags) after expanding the amino acid sequences of the protein spots by Database searches using the available rice EST databases at the NCBI (http://www/ncbi.nlm.nih.gov/) and DDBJ (http://www.ddbj.nig.ac.jp/). The chromosomal location of some proteins were also obtained from the rice genetic map provided by Japanese Rice Genome Research Program (http://rgp.dna.affrc.go.jp). The DNA sequence databases including EST have been reported for rice (Oryza sativa L.) now provides whole or partial gene sequence, and recent advances in protein characterization allow the linking proteins to DNA sequences in the functional analysis. This work shows that proteome analysis could be a useful tool strategy to link sequence information and to functional genomics.

Nucleotide Sequence of Coat Protein Gene of Kyuri Green Mottle Mosaic Virus Isolated from Zucchini

  • Lee, Su-Heon;Lee, Young-Gyu;Park, Jin-Woo;Park, Hong-Soo;Kim, Yeong-Tae;Cheon, Jeong-Uk;Lee, Key-Woon
    • The Plant Pathology Journal
    • /
    • v.16 no.2
    • /
    • pp.118-124
    • /
    • 2000
  • The coat protein (CP) gene of kyuri green mottle mosaic virus zucchini strain (KGMMV-Z) isolated from zucchini (Cucurbita pepo) in Chonfu, Korea in 1999 was sequenced by the reverse transcription and polymerase chain reaction with degenerate and generate primers originated from tobamoviruses. The degenerate primers were very effective in amplification of KGMMV-Z CP region. The KGMMV-Z CP gene consisted of 486 nucleotides and had the same nucleotide length compared with those of cucurbit-infecting tobamoviruses. KGMMV-Z CP gene shared 43.8, 44.2, and 44.4% nucleotide sequence similarity with the CP gene of cucumber green mottle mosaic virus watermelon strain (CGMMZ-W), CGMMV-KW1, and CGMMV-SH, respectively, whereas three CGMMV strains among themselves showed 98.6-99.6% nucleotide similarity. The deduced amino acids of KGMMV-Z CP gene were 161 amino acid residues with the molecular weight of 17,181 daltons. The first 24 codons of KGMMV-Z CP gene corresponded to the sequences of the N-terminal amino acid of the viral capsid protein. The amino acid sequences of KGMMV-Z CP had 45.3% similarity compared with those of three CGMMV strains. However, the amino acid sequences of CGMMV strains were identical. These results showed that two cucurbit-infecting tobamovirus members, KGMMV-Z and CGMMV were genetically distantly related.

  • PDF

Protein Microarrays and Their Applications

  • Lee, Bum-Hwan;Teruyuki Nagamune
    • Biotechnology and Bioprocess Engineering:BBE
    • /
    • v.9 no.2
    • /
    • pp.69-75
    • /
    • 2004
  • In recent years, the importance of proteomic works, such as protein expression, detection and identification, has grown in the fields of proteomic and diagnostic research. This is because complete genome sequences of humans, and other organisms, progress as cellular processing and controlling are performed by proteins as well as DNA or RNA. However, conventional I protein analyses are time-consuming; therefore, high throughput protein analysis methods, which allow fast, direct and quantitative detection, are needed. These are so-called protein microarrays or protein chips, which have been developed to fulfill the need for high-throughput protein analyses. Although protein arrays are still in their infancy, technical development in immobilizing proteins in their native conformation on arrays, and the development of more sensitive detection methods, will facilitate the rapid deployment of protein arrays as high-throughput protein assay tools in proteomics and diagnostics. This review summarizes the basic technologies that are needed in the fabrication of protein arrays and their recent applications.

Sequencing of cDNA Clones Expressed in Adipose Tissues of Korean Cattle

  • Bong, J.J.;Tong, K.;Cho, K.K.;Baik, M.G.
    • Asian-Australasian Journal of Animal Sciences
    • /
    • v.18 no.4
    • /
    • pp.483-489
    • /
    • 2005
  • To understand the molecular mechanisms that regulate intramuscular fat deposition and its release, cDNA clones expressed in adipose tissues of Korean cattle were identified by differential screening from adipose tissue cDNA library. By partial nucleotide sequencing of 486 clones and a search for sequence similarity in NCBI nucleotide databases, 245 clones revealed unique clones. By a functional grouping of the clones, 14% of the clones were categorized to metabolism and enzyme-related group (stearoyl CoA desaturase, lactate dehydrogenase, fatty acid synthase, ATP citrate lyase, lipoprotein lipase, acetyl CoA synthetase, etc), and 6% to signal transduction/cell cycle-related group (C/EBP, cAMP-regulated phosphoprotein, calmodulin, cyclin G1, cyclin H, etc), and 4% to cytoskeleton and extracellular matrix components (vimentin, ankyrin 2, gelosin, syntenin, talin, prefoldin 5). The obtained 245 clones will be useful to study lipid metabolism and signal transduction pathway in adipose tissues and to study obesity in human. Some clones were subjected to full-sequencing containing open reading frame. The cDNA clone of bovine homolog of human prefoldin 5 gene had a total length of 959 nucleotides coding for 139 amino acids. Comparison of the deduced amino acid sequences of bovine prefoldin 5 with those of human and mouse showed over 95% identity. The cDNA clone of bovine homolog of human ubiquitin-like/S30 ribosomal fusion protein gene had a total length of 484 nucleotides coding for 133 amino acids. Comparison of the deduced amino acid sequences of bovine ubiquitin-like/S30 ribosomal fusion protein gene with those of human, rat and mouse showed over 97% identity. The cDNA clone of bovine homolog of human proteolipid protein 2 mRNA had a total length of 928 nucleotides coding for 152 amino acids. Comparison of the deduced amino acid sequences of bovine proteolipid protein 2 with those of human and mouse showed 87.5% similarity. The cDNA clone of bovine homolog of rat thymosin beta 4 had a total length of 602 nucleotides coding for 44 amino acids. Comparison of the deduced amino acid sequences of bovine thymosin beta 4 gene with those of human, mouse and rat showed 93.1% similarity. The cDNA clone of bovine homolog of human myotrophin mRNA had a total length of 790 nucleotides coding for 118 amino acids. Comparison of the deduced amino acid sequences of bovine myotrophin gene with those of human, mouse and rat showed 83.9% similarity. The functional role of these clones in adipose tissues needs to be established.

Molecular cloning and nucleotide sequence of schizosaccharomyces pombe Homologue of the receptor for activated protein kinase C gene

  • Park, Seung-Keil;Yoo, Hyang-Sook
    • Journal of Microbiology
    • /
    • v.33 no.2
    • /
    • pp.128-131
    • /
    • 1995
  • Using differential hybridization, we selected the prk gene fortuitously from Schizosaccharomyces pombe homologous to RACK1 of rat which encodes the receptor for activated protein kinase C. The cDNA sequence of prk was determined and its deduced amino acid sequence was 76% homologous to RACK1 and had the feature of trimeric G protein bata subunit. The specific amino acid sequences required for the protein kinase C binding were also present in Prk as in the case of RACK1 protein. From these similarities, we suggest that the Prk is protein kinase C binding protein of S. prombe. The involvement of Prk in signal transduction mediated by protein kinase C remained to be studied.

  • PDF

A Study on Clustering and Identifying Gene Sequences using Suffix Tree Clustering Method and BLAST (서픽스트리 클러스터링 방법과 블라스트를 통합한 유전자 서열의 클러스터링과 기능검색에 관한 연구)

  • Han, Sang-Il;Lee, Sung-Gun;Kim, Kyung-Hoon;Lee, Ju-Yeong;Kim, Young-Han;Hwang, Kyu-Suk
    • Journal of Institute of Control, Robotics and Systems
    • /
    • v.11 no.10
    • /
    • pp.851-856
    • /
    • 2005
  • The DNA and protein data of diverse species have been daily discovered and deposited in the public archives according to each established format. Database systems in the public archives provide not only an easy-to-use, flexible interface to the public, but also in silico analysis tools of unidentified sequence data. Of such in silico analysis tools, multiple sequence alignment [1] methods relying on pairwise alignment and Smith-Waterman algorithm [2] enable us to identify unknown DNA, protein sequences or phylogenetic relation among several species. However, in the existing multiple alignment method as the number of sequences increases, the runtime increases exponentially. In order to remedy this problem, we adopted a parallel processing suffix tree algorithm that is able to search for common subsequences at one time without pairwise alignment. Also, the cross-matching subsequences triggering inexact-matching among the searched common subsequences might be produced. So, the cross-matching masking process was suggested in this paper. To identify the function of the clusters generated by suffix tree clustering, BLAST was combined with a clustering tool. Our clustering and annotating tool is summarized as the following steps: (1) construction of suffix tree; (2) masking of cross-matching pairs; (3) clustering of gene sequences and (4) annotating gene clusters by BLAST search. The system was successfully evaluated with 22 gene sequences in the pyrubate pathway of bacteria, clustering 7 clusters and finding out representative common subsequences of each cluster

A K-Nearest Neighbor Algorithm for Categorical Sequence Data (범주형 시퀀스 데이터의 K-Nearest Neighbor알고리즘)

  • Oh Seung-Joon
    • Journal of the Korea Society of Computer and Information
    • /
    • v.10 no.2 s.34
    • /
    • pp.215-221
    • /
    • 2005
  • TRecently, there has been enormous growth in the amount of commercial and scientific data, such as protein sequences, retail transactions, and web-logs. Such datasets consist of sequence data that have an inherent sequential nature. In this Paper, we study how to classify these sequence datasets. There are several kinds techniques for data classification such as decision tree induction, Bayesian classification and K-NN etc. In our approach, we use a K-NN algorithm for classifying sequences. In addition, we propose a new similarity measure to compute the similarity between two sequences and an efficient method for measuring similarity.

  • PDF