• 제목/요약/키워드: Codon usage

검색결과 55건 처리시간 0.03초

The pattern of coding sequences in the chloroplast genome of Atropa belladonna and a comparative analysis with other related genomes in the nightshade family

  • Satyabrata Sahoo;Ria Rakshit
    • Genomics & Informatics
    • /
    • 제20권4호
    • /
    • pp.43.1-43.18
    • /
    • 2022
  • Atropa belladonna is a valuable medicinal plant and a commercial source of tropane alkaloids, which are frequently utilized in therapeutic practice. In this study, bioinformatic methodologies were used to examine the pattern of coding sequences and the factors that might influence codon usage bias in the chloroplast genome of Atropa belladonna and other nightshade genomes. The chloroplast engineering being a promising field in modern biotechnology, the characterization of chloroplast genome is very important. The results revealed that the chloroplast genomes of Nicotiana tabacum, Solanum lycopersicum, Capsicum frutescens, Datura stramonium, Lyciumbarbarum, Solanum melongena, and Solanum tuberosum exhibited comparable codon usage patterns. In these chloroplast genomes, we observed a weak codon usage bias. According to the correspondence analysis, the genesis of the codon use bias in these chloroplast genes might be explained by natural selection, directed mutational pressure, and other factors. GC12 and GC3S were shown to have no meaningful relationship. Further research revealed that natural selection primarily shaped the codon usage in A. belladonna and other nightshade genomes for translational efficiency. The sequencing properties of these chloroplast genomes were also investigated by investing the occurrences of palindromes and inverted repeats, which would be useful for future research on medicinal plants.

Insights into factors affecting synonymous codon usage in apple mosaic virus and its host adaptability

  • Pourrahim, R.;Farzadfar, Sh.
    • Journal of Plant Biotechnology
    • /
    • 제49권1호
    • /
    • pp.46-60
    • /
    • 2022
  • The genetic variability and population structure of apple mosaic virus (ApMV) have been studied; however, synonymous codon usage patterns influencing the survival rates and fitness of ApMV have not been reported. Based on phylogenetic analyses of 52 ApMV coat protein (CP) sequences obtained from apple, pear, and hazelnut, ApMV isolates were clustered into two groups. High molecular diversity in GII may indicate their recent expansion. A constant and conserved genomic composition of the CP sequences was inferred from the low codon usage bias. Nucleotide composition and relative synonymous codon usage (RSCU) analysis indicated that the ApMV CP gene is AU-rich, but G- and U-ending codons are favored while coding amino acids. This unequal use of nucleotides together with parity rule 2 and the effective number of codon (ENC) plots indicate that mutation pressure together with natural selection drives codon usage patterns in the CP gene. However, in this combination, selection pressure plays a more crucial role. Based on principal component analysis plots, ApMV seems to have originated from apple trees in Europe. However, according to the relative codon deoptimization index and codon adaptation index (CAI) analyses, ApMV exhibited the greatest fitness to hazelnut. As inferred from the results of the similarity index analysis, hazelnut has a major role in shaping ApMV RSCU patterns, which is consistent with the CAI analysis results. This study contributes to the understanding of plant virus evolution, reveals novel information about ApMV evolutionary fitness, and helps find better ApMV management strategies.

Codon Usage Patterns of Tyrosinase Genes in Clonorchis sinensis

  • Bae, Young-An
    • Parasites, Hosts and Diseases
    • /
    • 제55권2호
    • /
    • pp.175-183
    • /
    • 2017
  • Codon usage bias (CUB) is a unique property of genomes and has contributed to the better understanding of the molecular features and the evolution processes of particular gene. In this study, genetic indices associated with CUB, including relative synonymous codon usage and effective numbers of codons, as well as the nucleotide composition, were investigated in the Clonorchis sinensis tyrosinase genes and their platyhelminth orthologs, which play an important role in the eggshell formation. The relative synonymous codon usage patterns substantially differed among tyrosinase genes examined. In a neutrality analysis, the correlation between $GC_{12}$ and $GC_3$ was statistically significant, and the regression line had a relatively gradual slope (0.218). NC-plot, i.e., $GC_3$ vs effective number of codons (ENC), showed that most of the tyrosinase genes were below the expected curve. The codon adaptation index (CAI) values of the platyhelminth tyrosinases had a narrow distribution between 0.685/0.714 and 0.797/0.837, and were negatively correlated with their ENC. Taken together, these results suggested that CUB in the tyrosinase genes seemed to be basically governed by selection pressures rather than mutational bias, although the latter factor provided an additional force in shaping CUB of the C. sinensis and Opisthorchis viverrini genes. It was also apparent that the equilibrium point between selection pressure and mutational bias is much more inclined to selection pressure in highly expressed C. sinensis genes, than in poorly expressed genes.

쌀 저장 단백질 프롤라민 유전자 암호 분석 (Codon usage analysis of rice prolamine genes)

  • 이태호;김주곤;남백희
    • Applied Biological Chemistry
    • /
    • 제36권6호
    • /
    • pp.525-532
    • /
    • 1993
  • 쌀의 주요 저장 단백질중의 하나인 알콜용해성 프롤라민의 성질을 분석하고 이들을 동정하기 위하여 유전자 database로부터 얻은 17개의 프롤라민 유전자의 염기서열을 상호비교 분석하였다. 유전자로부터 유추된 단백질 서열의 다중분석결과 프롤라민들은 계통발생적으로 type에서 type IV의 4개의 군으로 크게 분류할 수 있었다. 이러한 분류는 아미노산 서열 중간과 카복실 말단쪽의 짧은 결손에 의해 여러 형태의 아미노산 서열에 의한 것임을 알 수 있었다. 1군에서 4군까지의 군들은 meththionine과 cysteine과 같은 황을 포함하는 아미노산이 각각 1, 4, 10, 30%로 구성된 특징을 보여 주었다. 또한 각 군들은 등전점이 9.2, 8.2, 6.7, 7.4인 각군별로 매우 상이한 등전점을 나타내었다. 아울러 GC3s에 대한 유효 암호수(effective codon number, Nc)와 우선 암호수(preferred codon number)의 분석과 상관 그래프를 통해서 프롤라민 유전자들의 군별 전이 효율의 차이가 현저하여 프롤라민 생산 수준의 군별차이의 가능성을 제시하였다.

  • PDF

Comparative Analysis of Predicted Gene Expression among Crenarchaeal Genomes

  • Das, Shibsankar;Chottopadhyay, Brajadulal;Sahoo, Satyabrata
    • Genomics & Informatics
    • /
    • 제15권1호
    • /
    • pp.38-47
    • /
    • 2017
  • Research into new methods for identifying highly expressed genes in anonymous genome sequences has been going on for more than 15 years. We presented here an alternative approach based on modified score of relative codon usage bias to identify highly expressed genes in crenarchaeal genomes. The proposed algorithm relies exclusively on sequence features for identifying the highly expressed genes. In this study, a comparative analysis of predicted highly expressed genes in five crenarchaeal genomes was performed using the score of Modified Relative Codon Bias Strength (MRCBS) as a numerical estimator of gene expression level. We found a systematic strong correlation between Codon Adaptation Index and MRCBS. Additionally, MRCBS correlated well with other expression measures. Our study indicates that MRCBS can consistently capture the highly expressed genes.

Insights into the Usage of Nucleobase Triplets and Codon Context Pattern in Five Influenza A Virus Subtypes

  • Deka, Himangshu;Chakraborty, Supriyo
    • Journal of Microbiology and Biotechnology
    • /
    • 제26권11호
    • /
    • pp.1972-1982
    • /
    • 2016
  • Influenza A virus is a single-stranded RNA virus with a genome of negative polarity. Owing to the antigenic diversity and cross concrete shift, an immense number of novel strains have developed astronomically over the years. The present work deals with the codon utilization partialness among five different influenza A viruses isolated from human hosts. All the subtypes showed the homogeneous pattern of nucleotide utilization with a little variation in their utilization frequencies. A lower bias in codon utilization was observed in all the subtypes as reflected by higher magnitudes of an efficacious number of codons. Dinucleotide analysis showed very low CpG utilization and a high predilection of A/T-ending codons. The H5N1 subtype showed noticeable deviation from the rest. Codon pair context analysis showed remarkable depletion of NNC-GNN and NNT-ANN contexts. The findings alluded towards GC-compositional partialness playing a vital role, which is reflected in the consequential positive correlation between the GC contents at different codon positions. Untangling the codon utilization profile would significantly contribute to identifying novel drug targets that will pacify the search for antivirals against this virus.

Rhodotorula glutinis의 epoxide hydrolase 고효율 발현 유전자 재조합 Escherichia coli 생촉매 개발 (Development of Recombinant Escherichia coli Expressing Rhodotorula glutinis Epoxide Hydrolase)

  • 이수정;김희숙
    • 생명과학회지
    • /
    • 제16권3호
    • /
    • pp.415-419
    • /
    • 2006
  • 방향족 에폭사이드 기질에 대한 입체선택적 가수분해능이 우수한 Rhodotorula glutinis의 epoxide hydrolase (EH)를 codon usage를 고려한 Escherichia coli 균주에서 고효율로 발현할 수 있었다. 효모인 R. glutinis와 박테리아인 E. coli에서의 codon usage 선호도를 분석하고 그 차이를 고려하여 E. coli 에서 잘 사용되지 않는 rare codon에 대한 tRNA유전자정보가 들어 있는 pRARE plasmid를 함유한 E. coli 균주인 Rosetta(DE3)pLysS를 숙주세포로 사용하였다. R. glutinis EH를 발현시킨 재조합 E. coli를 생촉매로 사용하여 라세믹 styrene oxide 혼합물과 반응시켰을 때, (R)-styrene oxide에 대한 입체선택적 가수분해활성이 wild type R. glutinis 대비 매우 향상됨을 관찰할 수 있었다. 또한 라세믹 기질로부터 입체적으로 고순도인 99% ee 값을 갖는 광학적으로 순수한 (S)-styrene oxide를 얻을 수 있었다.

Multivariate Procedure for Variable Selection and Classification of High Dimensional Heterogeneous Data

  • Mehmood, Tahir;Rasheed, Zahid
    • Communications for Statistical Applications and Methods
    • /
    • 제22권6호
    • /
    • pp.575-587
    • /
    • 2015
  • The development in data collection techniques results in high dimensional data sets, where discrimination is an important and commonly encountered problem that are crucial to resolve when high dimensional data is heterogeneous (non-common variance covariance structure for classes). An example of this is to classify microbial habitat preferences based on codon/bi-codon usage. Habitat preference is important to study for evolutionary genetic relationships and may help industry produce specific enzymes. Most classification procedures assume homogeneity (common variance covariance structure for all classes), which is not guaranteed in most high dimensional data sets. We have introduced regularized elimination in partial least square coupled with QDA (rePLS-QDA) for the parsimonious variable selection and classification of high dimensional heterogeneous data sets based on recently introduced regularized elimination for variable selection in partial least square (rePLS) and heterogeneous classification procedure quadratic discriminant analysis (QDA). A comparison of proposed and existing methods is conducted over the simulated data set; in addition, the proposed procedure is implemented to classify microbial habitat preferences by their codon/bi-codon usage. Five bacterial habitats (Aquatic, Host Associated, Multiple, Specialized and Terrestrial) are modeled. The classification accuracy of each habitat is satisfactory and ranges from 89.1% to 100% on test data. Interesting codon/bi-codons usage, their mutual interactions influential for respective habitat preference are identified. The proposed method also produced results that concurred with known biological characteristics that will help researchers better understand divergence of species.

Studies on Synonymous Codon and Amino Acid Usage Biases in the Broad-Host Range Bacteriophage KVP40

  • Sau Keya;Gupta Sanjib Kumar;Sau Subrata;Mandal Subhas Chandra;Ghosh Tapash Chandra
    • Journal of Microbiology
    • /
    • 제45권1호
    • /
    • pp.58-63
    • /
    • 2007
  • In this study, the relative synonymous codon and amino acid usage biases of the broad-host range phage, KVP40, were investigated in an attempt to understand the structure and function of its proteins/protein-coding genes, as well as the role of its tRNAs. Synonymous codons in KVP40 were determined to be AT-rich at the third codon positions, and their variations are dictated principally by both mutational bias and translational selection. Further analysis revealed that the RSCU of KVP40 is distinct from that of its Vibrio hosts, V. cholerae and V. parahaemolyticus. Interestingly, the expression of the putative highly expressed genes of KVP40 appear to be preferentially influenced by the abundant host tRNA species, whereas the tRNAs expressed by KVP40 may be required for the efficient synthesis of all its proteins in a diverse array of hosts. The data generated in this study also revealed that KVP40 proteins are rich in low molecular weight amino acid residues, and that these variations are influenced primarily by hydropathy, mean molecular weight, aromaticity, and cysteine content.

Synonymous Codon Usage Analysis of the Mycobacteriophage Bxz1 and Its Plating Bacteria M. smegmatis: Identification of Highly and Lowly Expressed Genes of Bxz1 and the Possible Function of Its tRNA Species

  • Sahu, Keya;Gupta, Sanjib Kumar;Ghosh, Tapash Chandra;Sau, Subrata
    • BMB Reports
    • /
    • 제37권4호
    • /
    • pp.487-492
    • /
    • 2004
  • The extent of codon usage in the protein coding genes of the mycobacteriophage, Bxz1, and its plating bacteria, M. smegmatis, were determined, and it was observed that the codons ending with either G and / or C were predominant in both the organisms. Multivariate statistical analysis showed that in both organisms, the genes were separated along the first major explanatory axis according to their expression levels and their genomic GC content at the synonymous third positions of the codons. The second major explanatory axis differentiates the genes according to their genome type. A comparison of the relative synonymous codon usage between 20 highly- and 20 lowly expressed genes from Bxz1 identified 21 codons, which are statistically over represented in the former group of genes. Further analysis found that the Bxz1- specific tRNA species could recognize 13 out of the 21 over represented synonymous codons, which incorporated 13 amino acid residues preferentially into the highly expressed proteins of Bxz1. In contrast, seven amino acid residues were preferentially incorporated into the lowly expressed proteins by 10 other tRNA species of Bxz1. This analysis predicts for the first time that the Bxz1-specific tRNA species modulates the optimal expression of its proteins during development.