• Title/Summary/Keyword: Codon preference

Search Result 11, Processing Time 0.025 seconds

Multivariate Procedure for Variable Selection and Classification of High Dimensional Heterogeneous Data

  • Mehmood, Tahir;Rasheed, Zahid
    • Communications for Statistical Applications and Methods
    • /
    • v.22 no.6
    • /
    • pp.575-587
    • /
    • 2015
  • The development in data collection techniques results in high dimensional data sets, where discrimination is an important and commonly encountered problem that are crucial to resolve when high dimensional data is heterogeneous (non-common variance covariance structure for classes). An example of this is to classify microbial habitat preferences based on codon/bi-codon usage. Habitat preference is important to study for evolutionary genetic relationships and may help industry produce specific enzymes. Most classification procedures assume homogeneity (common variance covariance structure for all classes), which is not guaranteed in most high dimensional data sets. We have introduced regularized elimination in partial least square coupled with QDA (rePLS-QDA) for the parsimonious variable selection and classification of high dimensional heterogeneous data sets based on recently introduced regularized elimination for variable selection in partial least square (rePLS) and heterogeneous classification procedure quadratic discriminant analysis (QDA). A comparison of proposed and existing methods is conducted over the simulated data set; in addition, the proposed procedure is implemented to classify microbial habitat preferences by their codon/bi-codon usage. Five bacterial habitats (Aquatic, Host Associated, Multiple, Specialized and Terrestrial) are modeled. The classification accuracy of each habitat is satisfactory and ranges from 89.1% to 100% on test data. Interesting codon/bi-codons usage, their mutual interactions influential for respective habitat preference are identified. The proposed method also produced results that concurred with known biological characteristics that will help researchers better understand divergence of species.

Synonymous Codon Usage Controls Various Molecular Aspects

  • Im, Eu-Hyun;Choi, Sun Shim
    • Genomics & Informatics
    • /
    • v.15 no.4
    • /
    • pp.123-127
    • /
    • 2017
  • Synonymous sites are generally considered to be functionally neutral. However, there are recent contradictory findings suggesting that synonymous alleles might have functional roles in various molecular aspects. For instance, a recent study demonstrated that synonymous single nucleotide polymorphisms have a similar effect size as nonsynonymous single nucleotide polymorphisms in human disease association studies. Researchers have recognized synonymous codon usage bias (SCUB) in the genomes of almost all species and have investigated whether SCUB is due to random nucleotide compositional bias or to natural selection of any functional exposure generated by synonymous mutations. One of the most prominent observations on the non-neutrality of synonymous codons is the correlation between SCUB and levels of gene expression, such that highly expressed genes tend to have a higher preference toward so-called optimal codons than lowly expressed genes. In relation, it is known that amounts of cognate tRNAs that bind to optimal codons are significantly higher than the amounts of cognate tRNAs that bind to non-optimal codons in genomes. In the present paper, we review various functions that synonymous codons might have other than regulating expression levels.

Development of Recombinant Escherichia coli Expressing Rhodotorula glutinis Epoxide Hydrolase (Rhodotorula glutinis의 epoxide hydrolase 고효율 발현 유전자 재조합 Escherichia coli 생촉매 개발)

  • Lee Soo-Jung;Kim Hee-Sook
    • Journal of Life Science
    • /
    • v.16 no.3 s.76
    • /
    • pp.415-419
    • /
    • 2006
  • The epoxide hydrolase (EH) of Rhodotorula glutinis which has a high enantioselectivity against aromatic epoxide substrates was expressed to high levels in Escherichia coli based on codon usage. We analysed the Preference of codon usage between the yeast, R. glutinis, and bacteria, E. coli. E. coli, Rosetta(DE3)pLysS, harbors pRARE plasmid with tRNA genes for rare-codons was employed as a host strain. The recombinant E. coli expressing R. glutinis EH showed an enhanced enantioselective hydrolysis activity toward racemic styrene oxide. Enantiopure (S)-styrene oxide with a high enantiopurity of 99% ee (enantiomeric excess) was obtained from racemic substrates.

Arg243, Invariably Critical for the Transcriptional Activation of Yeast Gcn4p

  • Cho, Gyu-Chull;Lee, Jae-Yung;Kim, Joon
    • Journal of Microbiology
    • /
    • v.37 no.3
    • /
    • pp.154-158
    • /
    • 1999
  • The arginine residue at position 243 (Arg 243) of the yeast transcription factor, Gcn4p, is invariably conserved among bZIP transcription factors. Using site-directed oligonucleotide saturation mutagenesis involving two-step polymerase chain reaction (PCR) amplification, random mutations were successfully introduced at the codon of 243 in the basic domain of Gcn4p. This mutant library was transformed ito Gcn4p defective yeast strain and selected for the transcriptionally active colonies. All colonies which were transcriptionally active had arginines in the codon 243. In this study, the strand preference by Taq polymerase during mutagenesis was also tested. Oligonucleotides were specially designed to test whether or not the polymerase was preferred using the strand as a template. A population of randomly mutated products were cloned into an appropriate vector and characterized by DNA sequencing analysis. Saturation mutagenesis which was performed efficiently by this method revealed a strong bias in terms of strand preference of Taq polymerase by an approximate ratio of 3 to 1 in this study.

  • PDF

Biosynthesis of Chondroitin in Engineered Corynebacterium glutamicum

  • Cheng, Fangyu;Luozhong, Sijin;Yu, Huimin;Guo, Zhigang
    • Journal of Microbiology and Biotechnology
    • /
    • v.29 no.3
    • /
    • pp.392-400
    • /
    • 2019
  • Chondroitin, the precursor of chondroitin sulfate, which is an important polysaccharide, has drawn significant attention due to its applications in many fields. In the present study, a heterologous biosynthesis pathway of chondroitin was designed in a GRAS (generally recognized as safe) strain C. glutamicum. CgkfoC and CgkfoA genes with host codon preference were synthesized and driven by promoter Ptac, which was confirmed as a strong promoter via GFPuv reporter assessment. In a lactate dehydrogenase (ldh) deficient host, intracellular chondroitin titer increased from 0.25 to 0.88 g/l compared with that in a wild-type host. Moreover, precursor enhancement via overexpressing precursor synthesizing gene ugdA further improved chondroitin titers to 1.09 g/l. Chondroitin production reached 1.91 g/l with the engineered strain C. glutamicum ${\Delta}L-CgCAU$ in a 5-L fed-batch fermentation with a single distribution $M_w$ of 186 kDa. This work provides an alternative, safe and novel means of producing chondroitin for industrial applications.

p53 Polymorphisms and Haplotypes as a Possible Predictor of a High-risk Group for Hepatocellular Carcinoma

  • Sato Shigeaki;Shiraki Takashi;Inoue Yoshiki;Takeshita Tatsuya;Morimoto Kanehisa
    • 대한예방의학회:학술대회논문집
    • /
    • 1999.10a
    • /
    • pp.1-15
    • /
    • 1999
  • In a case-control study to evaluate the factors involved in the development of hepatocellular carcinoma, polymorphisms of the p53 gene were compared in 68 cases mostly infected with hepatitis C virus (HCV) and 68 controls matched for sex and age: DNA from peripheral blood leukocytes was analyzed by the polymerase chain reaction-single strand conformation polymorphism method and direct sequencing. Polymorphisms analyzed were those in exon 4 (CCC vs. CGC, Pro vs. Arg at codon 72, Al allele vs. A2 allele), intron 2 (C vs. G at nucleotide 38, Al vs. A2), intron 3 (C vs. A at nucleotide 65, Al vs. A2; absence and presence of 16 base pair repeat at nucleotides 24 to 39, Al vs. A2), intron 6 (A vs. G at nucleotide 62, Al vs. A2) and intron 7 (C and T vs. T and G at nucleotides 72 and 92, Al vs. A2). A significantly higher frequency of the allele for CCC (Pro, Al) at codon 72 of exon 4 was found in cases (39%) than in controls (26%) (p<0.05). Highly significant linkage of the polymorphisms in exon 4, intron 2, intron 3 and intron 7, and between the intron 3-16 bp duplication and polymorphism in intron 6 also was found. Matched Fair analysis showed significantly higher frequencies of certain haplotypes (1-1-1-1-2-2 or 1-1-2-1-2-1 for exon 4, intron 2, intron 3, the intron 3-16 bp duplication, intron 6 and intron 7) in cases than in controls (p=0.014, OR=2.27, 95% CI= 1.08-5.12). No preference of specific p53 polymorphisms for specific HCV genotype was detected. These findings suggest that in hepatocarcinogenesis mainly due to HCV infection, genetic factors may be involved and that genetic markers can serve as predictors of a high-risk group for hepatocarcinogenesis.

  • PDF

Use of the Synthetic Gene Encoding the Truncated Human Rotavirus VP8* Protein in Escherichia coli for Production of Vaccine Candidates or Development of Diagnostic Antibodies (합성 유전자를 이용하여 Escherichia coli에서 백신 후보의 생산 혹은 진단용 항체의 개발을 위한 인간 rotavirus VP8* 부분 단백질의 발현)

  • Kim, Sang-Rae;Lee, Bheong-Uk
    • Journal of Life Science
    • /
    • v.28 no.4
    • /
    • pp.478-482
    • /
    • 2018
  • Human rotavirus is a causative agent of acute diarrhea among children. The artificial gene encoding the truncated $VP8^*$ protein of human rotavirus A (serotype 1 strain WA) was synthesized according to the Escherichia coli codon preference. The synthetic $VP8^*$ gene also possessed the NdeI and HindIII restriction sites for the convenient in-frame cloning for translation and a 6-histidine tag at C-terminus for Ni+ affinity purification. Molecular weight of the truncated $VP8^*$ protein deduced from the nucleotide sequences of the artificial gene was a 19.7-kDa. This synthetic $VP8^*$ DNA fragment was inserted into the pT7-7 expression vector and transformed into E. coli BL21 (DE3). Transformants harboring the synthetic gene encoding the $VP8^*$ protein was induced by supplement of a final concentration of 0.05 mM ITPG at $20^{\circ}C$. Protein crude extract from the E. coli transformants was subjected to Western blotting with the mouse anti-rotavirus capsid antibody, showing ~20-kDa $VP8^*$ protein band. The truncated $VP8^*$ protein band was also observed by Western blotting using the rabbit polyclonal antibody serum made against the truncated $VP8^*$ protein. This study suggested that the synthetic gene could be used as an easy way to produce the antigenic vaccine candidate for control of virus-associated diseases or to develop antibodies for diagnostic purpose.

Construction of fat1 Gene Expression Vector and Its Catalysis Efficiency in Bovine Fetal Fibroblast Cells

  • Liu, Boyang;Yang, Runjun;Li, Junya;Zhang, Lupei;Liu, Jing;Lu, Chunyan;Lian, Chuanjiang;Li, Zezhong;Zhang, Yong-Hong;Zhang, Liying;Zhao, Zhihui
    • Asian-Australasian Journal of Animal Sciences
    • /
    • v.25 no.5
    • /
    • pp.621-628
    • /
    • 2012
  • The FAT-1 protein is an n-3 fatty acid desaturase, which can recognize a range of 18- and 20-carbon n-6 substrates and transform n-6 polyunsaturated fatty acids (PUFAs) into n-3 PUFAs while n-3 PUFAs have beneficial effect on human health. Fat1 gene is the coding sequence from Caenorhabditis elegans which might play an important role on lipometabolism. To reveal the function of fat1 gene in bovine fetal fibroblast cells and gain the best cell nuclear donor for transgenic bovines, the codon of fat1 sequence was optimized based on the codon usage frequency preference of bovine muscle protein, and directionally cloned into the eukaryotic expression vector pEF-GFP. After identifying by restrictive enzyme digests with AatII/XbaI and sequencing, the fusion plasmid pEF-GFP-fat1 was identified successfully. The pEF-GFP-fat1 vector was transfected into bovine fetal fibroblast cells mediated by Lipofectamine2000$^{TM}$. The positive bovine fetal fibroblast cells were selected by G418 and detected by RT-PCR. The results showed that a 1,234 bp transcription was amplified by reverse transcription PCR and the positive transgenic fat1 cell line was successfully established. Then the expression level of fat1 gene in positive cells was detected using quantitative PCR, and the catalysis efficiency was detected by gas chromatography. The results demonstrated that the catalysis efficiency of fat1 was significantly high, which can improve the total PUFAs rich in EPA, DHA and DPA. Construction and expression of pEF-GFP-fat1 vector should be helpful for further understanding the mechanism of regulation of fat1 in vitro. It could also be the first step in the production of fat1 transgenic cattle.

The Stability, and Efficacy Against Penicillin-Resistant Enterococcus faecium, of the Plectasin Peptide Efficiently Produced by Escherichia coli

  • Chen, Xin;Wen, Yaoan;Li, Ling;Shi, Jiawei;Zhu, Zhe;Luo, Yuwen;Li, Yun;Chen, Rui
    • Journal of Microbiology and Biotechnology
    • /
    • v.25 no.7
    • /
    • pp.1007-1014
    • /
    • 2015
  • Plectasin, the first defensin extracted from a fungus (the saprophytic ascomycete Pseudoplectania nigrella), is attractive as a prospective antimicrobial agent. The purpose of this study was to establish a bacterium-based production system and evaluate the antimicrobial activity of the resulting plectasin. A gene encoding plectasin, with the codon preference of Escherichia coli, was optimized based on its amino acid sequence, synthesized using genesplicing with overlap extension PCR, and inserted into the expression vector pGEX-4T-1. The fusion protein was expressed in the soluble fraction of E. coli and purified using glutathione Stransferase affinity chromatography. Plectasin was cleaved from the fusion protein with thrombin and purified by ultrafiltration. The purified plectasin showed strong, concentrationdependent antimicrobial activity against gram-positive bacteria, including antibiotic-resistant bacteria, especially penicillin-resistant Enterococcus faecium. This antimicrobial activity was equal to chemically synthesized plectasin and was maintained over a wide range of pH and temperatures. This soluble recombinant expression system in E. coli is effective for producing plectasin at a relatively lower cost, and higher purity and efficiency than prior systems, and might provide a foundation for developing a large-scale production system. Overall, plectasin shows potential as a novel, high-performance, and safe antibiotic for the treatment of refractory diseases caused by drug-resistant bacterial strains.

Primary Structure of the Human VkII Regions Elicited by Haemophilus influenzae Type b Polysaccharide Vaccines; The J Gene Usage Is Restricted in Child Antibodies Using the A2 Gene

  • Yu, Kang-Yeol;Kim, Jin-Ho;Chung, Gook-Hyun
    • BMB Reports
    • /
    • v.33 no.3
    • /
    • pp.249-255
    • /
    • 2000
  • The Haemophilus influenzae type b (Hib) has been a major cause of bacterial meningitis in children who are less than two years old. The variable (V) region repertoire of adult Caucasian antibodies (Abs) to Hib polysaccharide (PS) has been characterized well. The majority of adult antibodies against Hib uses VL that is derived from the Vk gene A2 and have arginine at the N region. In order to explore the possibility those antibody responses to Hib-PS is variable in various age groups, we examined the VL regions of the antibodies to Hib-PS in Korean adults and children. We immunized Korean adults (n = 8) and children (n = 39) with Hib tetanus conjugated vaccines, isolated RNAs from the peripheral lymphocytes, and amplified the A2-derived VL regions by RT-PCR. The PCR products were subcloned and sequenced. Forty-seven out of 54 independent clones from children used the $J{\kappa}2$, or $J{\kappa}3$ gene in preference. The adults, however, used all of the $J{\kappa}$ genes evenly. With respect to the amino acid sequences of variable regions, adult $A2-J{\kappa}$ recombinants have a germline sequence. But, the 76th codon (AGC) of child $A2-J{\kappa}2$ recombinants was substituted with CGC (arginine) in most cases (88 %) and 77 percent of child clones using the $A2-J{\kappa}3$ genes have isoleucine-109 at the junction of $J{\kappa}-C{\kappa}$ instead of threonine that is found in a germline sequence. These results suggest that the mechanism of antibody production in young children is different from that of adults.

  • PDF