• Title/Summary/Keyword: gene sequence

Search Result 3,901, Processing Time 0.025 seconds

Molecular Cloning of the nahC Gene Encoding 1,2-Dihydroxynaphthalene Dioxygenase from Pseudomonas fluorescens

  • KIM, YEO-JUNG;NA-RI LEE;SOON-YOUNG CHOI;KYUNG-HEE MIN
    • Journal of Microbiology and Biotechnology
    • /
    • v.12 no.1
    • /
    • pp.172-175
    • /
    • 2002
  • The complete nucleotide sequence of the nahC gene from Pseudomonas fluorescens, the structural gene for 1,2-dihydroxynaphthalene (1,2-DHN) dioxygenase, was determined. The 1,2-DHN dioxygenase is an extradiol ring-cleavage enzyme that cleaves the first ring of 1,2-dihydroxynaphthalene. The amino acid sequence of the dioxygenase deduced from the nucleotide sequence suggested that the holoenzyme consists of eight identical subunits with a molecular weight of approximately 34,200. The amino acid sequence of 1,2-DHN dioxygenase showed more than $90\%$ homology with those of the dioxygenases of other Pseudomonas strains. However, sequence similarity with those of the Sphingomonas species was less than $60\%$. The nahC gene of P. fluorescens was moderately expressed in E. coli NM522, as determined by enzymatic activity.

Sequencing of the RSDA Gene Encoding Raw Starch-Digesting $\alpha$-Amylase of Bacillus circulans F-2: Identification of Possible Two Domains for Raw Substrate-Adsorption and Substrate-Hydrolysis

  • Kim, Cheorl-Ho
    • Journal of Microbiology and Biotechnology
    • /
    • v.2 no.1
    • /
    • pp.56-65
    • /
    • 1992
  • The complete nucleotide sequence of the Bacillus circulans F-2 RSDA gene, coding for raw starch digesting a-amylase (RSDA), has been determined. The RSDA structure gene consists of an open reading frame of 2508 bp. Six bp upstream of the translational start codon of the RSDA is a typical gram-positive Shine-Dalgarno sequence and the RSDA encodes a preprotein of 836 amino acids with an Mr of 96, 727. The gene was expressed from its own regulatory region in E. coli and two putative consensus promoter sequences were identified upstream of a ribosome binding site and an ATG start codon. Confirmation of the nucleotide sequence was obtained and the signal peptide cleavage site was identified by comparing the predicted amino acid sequence with that derived by N-terminal analysis of the purified RSDA. The deduced N-terminal region of the RSDA conforms to the general pattern for the signal peptides of secreted prokaryotic proteins. The complete amino acid sequence was deduced and homology with other enzymes was compared. The results suggested that the Thr-Ser-rich hinge region and the non-catalytic domain are necessary for efficient adsorption onto raw substrates, and the catalytic domain (60 kDa) is necessary for the hydrolysis of substrates, as suggested in previous studies (8, 9).

  • PDF

Cloning and Sequence Analysis of Glyceraldehyde-3-Phosphate Dehydrogenase Gene in Yak

  • Li, Sheng-Wei;Jiang, Ming-Feng;Liu, Yong-Tao;Yang, Tu-Feng;Wang, Yong;Zhong, Jin-Cheng
    • Asian-Australasian Journal of Animal Sciences
    • /
    • v.21 no.11
    • /
    • pp.1673-1679
    • /
    • 2008
  • In order to study the biological function of gapdh gene in yak, and prove whether the gapdh gene was a useful intra-reference gene that can be given an important role in molecular biology research of yak, the cDNA sequence encoding glyceraldehyde-3-phosphate dehydrogenase from yak was cloned by the RT-PCR method using gene specific PCR primers. The sequence results indicated that the cloned cDNA fragment (1,008 bp) contained a 1,002 bp open reading frame, encoding 333 amino acids (AAs) with a molecular mass of 35.753 kDa. The deduced amino acids sequence showed a high level of sequence identity to Bos Taurus (99.70%), Xenopus laevis (94.29%), Homo sapiens (97.01%), Mus musculus (97.90%) and Sus scrofa (98.20%). The expression of yak's gapdh gene in heart, spleen, kidney and brain tissues was also detected; the results showed that the gapdh gene was expressed in all these tissues. Further analysis of yak GAPDH amino acid sequence implied that it contained a complete glyceraldehyde-3-phosphate dehydrogenase active site (ASCTTNCL) which ranged from 148 to 155 amino acid residues. It also contained two conserved domains, a NAD binding domain in its N-terminal and a complete catalytic domain of sugar transport in its C-terminal. The phylogenetic analysis showed that yak and Bos taurus were the closest species. The prediction of secondary structures indicated that GAPDH of yak had a similar secondary structure to other isolated GAPDH. The results of this study suggested that the gapdh gene of yak was similar to other species and could be used as the intra-reference to analyze the expression of other genes in yak.

Cloning and Nucleotide Sequence of the recA Gene from Shigella sonnei KNIH104S Isolated in Korea

  • Park, Yong-Chjun;Shin, Hee-Jung;Kim, Young-Chang
    • BMB Reports
    • /
    • v.32 no.5
    • /
    • pp.436-439
    • /
    • 1999
  • Shigella sonnei is an important cause of human enteric infections. S. sonnei KNIH104S was previously reported to be isolated from Korean shigellosis patients. We cloned a 2.8-kb KpnI fragment containing the recA gene encoding a recombinase from the chromosomal DNA of S. sonnei KNIH104S. This recombinant plasmid was named pRAK28. E. coli HB101, a recA mutant, cannot grow on Luria-Bertani medium in the presence of the alkylating agent methylmethane sulfonate, however, E. coli HB101 harboring pRAK28 was found to grow on this medium. As far as we know, we are the first to sequence the recA gene from S. sonnei. This gene is composed of 1062 base pairs with an ATG initiation codon and a TAA termination codon. Nucleotide sequence comparison of the S. sonnei recA gene exhibited 99.7% and 99.5% identity with those of S. flexneri and E. coli, respectively.

  • PDF

Spliced leader sequences detected in EST data of the dinoflagellates Cochlodinium polykrikoides and Prorocentrum minimum

  • Guo, Ruoyu;Ki, Jang-Seu
    • ALGAE
    • /
    • v.26 no.3
    • /
    • pp.229-235
    • /
    • 2011
  • Spliced leader (SL) trans-splicing is a mRNA processing mechanism in dinoflagellate nuclear genes. Although studies have identified a short, conserved dinoflagellate SL (dinoSL) sequence (22-nt) in their nuclear-encoded transcripts, whether the majority of nuclear-coded transcripts in dinoflagellates have the dinoSL sequence remains doubtful. In this study, we investigated dinoSL-containing gene transcripts using 454 pyrosequencing data (Cochlodinium polykrikoides, 93 K sequence reads, 31 Mb; Prorocentrum minimum, 773 K sequence reads, 291 Mb). After making comparisons and performing local BLAST searches, we identified dinoSL for one C. polykrikoides gene transcript and eight P. minimum gene transcripts. This showed transcripts containing the dinoSL sequence were markedly fewer in number than the total expressed sequence tag (EST) transcripts. In addition, we found no direct evidence to prove that most dinoflagellate nuclear-coded transcripts have this dinoSL sequence.

Heterogeneity of Chloroplast DNA in Rice (벼 엽록체 DNA의 이질성)

  • 남백희;문은표
    • Proceedings of the Botanical Society of Korea Conference
    • /
    • 1987.07a
    • /
    • pp.391-401
    • /
    • 1987
  • Plant chloroplast DNA exists as an unique circular structure in which large single copy(LSC) region and small single copy (SSC) region are separated by large inverted repeat sequences (IRS). It has been known that the unique existence of inverted repeat sequences in chloroplast DNA has no relation with the stability of the chloroplast DNA, but causes the inversion between inverted repeat its biological significance has not been understood so far. In rice, several gene clusters have been cloned and sequenced which contain ribulose-5-biophosphate car-boxylase large subunit (rbcL). Especially, one rbcL gene is linked with rp12 gene which is located in the IRS region in one of the gene clusters. By comparison of nucleotide sequence, the two genes are found to be linked through 151 bp repeat sequence which is homologous to the rp123 gene in IRS region. The repeat sequence is found to be located 3' downstream of rfcL gene and near psbA gene in LSC region. The existence of these repeat sequences and the presence of gene clusters caused by the gene rearrangement thorough the repeat sequence provide a possible which is found to be dispersed chloroplast DNA provide the model system to explaine the heterogeneity of the chloroplast DNA in rice in term of gene rearrangement.

  • PDF

Feature Selection with Ensemble Learning for Prostate Cancer Prediction from Gene Expression

  • Abass, Yusuf Aleshinloye;Adeshina, Steve A.
    • International Journal of Computer Science & Network Security
    • /
    • v.21 no.12spc
    • /
    • pp.526-538
    • /
    • 2021
  • Machine and deep learning-based models are emerging techniques that are being used to address prediction problems in biomedical data analysis. DNA sequence prediction is a critical problem that has attracted a great deal of attention in the biomedical domain. Machine and deep learning-based models have been shown to provide more accurate results when compared to conventional regression-based models. The prediction of the gene sequence that leads to cancerous diseases, such as prostate cancer, is crucial. Identifying the most important features in a gene sequence is a challenging task. Extracting the components of the gene sequence that can provide an insight into the types of mutation in the gene is of great importance as it will lead to effective drug design and the promotion of the new concept of personalised medicine. In this work, we extracted the exons in the prostate gene sequences that were used in the experiment. We built a Deep Neural Network (DNN) and Bi-directional Long-Short Term Memory (Bi-LSTM) model using a k-mer encoding for the DNA sequence and one-hot encoding for the class label. The models were evaluated using different classification metrics. Our experimental results show that DNN model prediction offers a training accuracy of 99 percent and validation accuracy of 96 percent. The bi-LSTM model also has a training accuracy of 95 percent and validation accuracy of 91 percent.

Nucleotide Sequence of a Truncated Proteinase Inhibitor I Gene of Potato (감자에서 분리된 절단형 단백질분해효소 억제제 I 유전자의 염기서열)

  • 이종섭
    • Journal of Plant Biology
    • /
    • v.33 no.4
    • /
    • pp.303-307
    • /
    • 1990
  • A genomic clone carrying a proteinase inhibitor I sequence was isolated and characterized. The clone contained a 0.7 kb EcoRI fragment hybridized with tomato inhibitor I cDNA. The nucleotide sequence of the EcoRI fragment revealed presence of a truncated form of a proteinase inhibitor I gene of potato. The truncated gene contained the 5' flanking region and the first exon of a functional proteinase inhibitor I gene. Although the 5' flanking region contained the regulatory sequences TATAAA and CCACT, a deletion of 40 bp occurred between them.

  • PDF

Genetic Stock Identification of Common Carp (Cyprinus carpio) by Detection of Intraspecific DNA Sequence Variation in the Mitochondrial 12S rRNA Gene (미토콘드리아 12S rRNA 유전자 변이 조사를 통한 잉어(Cyprinus carpio)의 유전학적 동정)

  • 남윤권;주수동;정창화;노충환;조재윤;김동수
    • Journal of Aquaculture
    • /
    • v.10 no.4
    • /
    • pp.403-407
    • /
    • 1997
  • Intraspecific sequence variation was detected by polymerase chain reaction (PCR) and direct sequencing of a 350-nucleotide region of the mitochondrial 12S rRNA gene of two natural populations (Han River and Nakdong River) and one hatchery stock (Jinhae Inland Fisheries Institute) of local strain common carp, one Israeli strain of common carp stock from Pukyong National University (PKU), and one hybrid between Israeli strain of common carp female and local strain common carp male from PKU stock. There is little variation in 350 bases of the mitochondrial 12S rRNA gene sequences among 2 natural and 1 hatchery local strain common carp populatins, representing abut 7 to 20 nucleotide differences (less than 6%). The sequence of specimens from Han River was more similar to that from Nakdong River (identity=98.0%) than to that from Jinhae Inland Fisheries Institute (identity=96.3%). Sequence variation between Israeli strain and wild local strain common carp was higher than the variation within natural stocks. The level of variation was ranged from 15.7 to 17.7%. The hybrid showed very similar nucleotide4 sequence of 12S rRNA gene to the sequence of Israeli strain with the identity of 98.9%.

  • PDF

Partial Sequence of the Bovine (Bos taurus coreanae) Myogenic Factor Encoding Gene MyoD

  • Kim, H.S.;Park, E.W.;Yoon, D.H.;Kim, H.B.;Cheong, I.C.;Cho, B.W.;Im, K.S.
    • Asian-Australasian Journal of Animal Sciences
    • /
    • v.12 no.5
    • /
    • pp.689-694
    • /
    • 1999
  • This experiment was carried out to isolate the partial bovine (Bos Taurus coreanae) myogenic factor encoding gene, MyoD, using the rat myogenic factor (MyoD) gene sequence and to compare the gene sequence between another myogenic factor (Myf 5) and MyoD gene of the bovine. To make the probe and isolate the MyoD gene, PCR was performed to amplify rat and bovine MyoD gene including exon I, II and intron I. The homology between mouse and bovine MyoD is high; bovine MyoD gene shows 17 different gene sequence region compared to rat MyoD. Among those, two regions have significant differences; one is the exon I part between 2834 and 2850 bp, the other is intron part between 3274 and 3303 bp of the mouse. At this region homology was 40% in the former and 50% in the latter. Homology between bovine MyoD and Myf5 was 83% in the exon 1. Especially exon I in the Myf5 602-617 bp and 651-683 bp have significant differences. These results suggest that MyoD gene have a similar gene structure in mouse and bovine and MyoD and Myf5 of the bovine, at least in part, have a similar expression and activity.