• Title/Summary/Keyword: Genomic Sequence

Search Result 887, Processing Time 0.028 seconds

Sequential Polyadenylation to Enable Alternative mRNA 3' End Formation

  • Yajing Hao;Ting Cai;Chang Liu;Xuan Zhang;Xiang-Dong Fu
    • Molecules and Cells
    • /
    • v.46 no.1
    • /
    • pp.57-64
    • /
    • 2023
  • In eukaryotic cells, a key RNA processing step to generate mature mRNA is the coupled reaction for cleavage and polyadenylation (CPA) at the 3' end of individual transcripts. Many transcripts are alternatively polyadenylated (APA) to produce mRNAs with different 3' ends that may either alter protein coding sequence (CDS-APA) or create different lengths of 3'UTR (tandem-APA). As the CPA reaction is intimately associated with transcriptional termination, it has been widely assumed that APA is regulated cotranscriptionally. Isoforms terminated at different regions may have distinct RNA stability under different conditions, thus altering the ratio of APA isoforms. Such differential impacts on different isoforms have been considered as post-transcriptional APA, but strictly speaking, this can only be considered "apparent" APA, as the choice is not made during the CPA reaction. Interestingly, a recent study reveals sequential APA as a new mechanism for post-transcriptional APA. This minireview will focus on this new mechanism to provide insights into various documented regulatory paradigms.

An Integrated Genomic Resource Based on Korean Cattle (Hanwoo) Transcripts

  • Lim, Da-Jeong;Cho, Yong-Min;Lee, Seung-Hwan;Sung, Sam-Sun;Nam, Jung-Rye;Yoon, Du-Hak;Shin, Youn-Hee;Park, Hye-Sun;Kim, Hee-Bal
    • Asian-Australasian Journal of Animal Sciences
    • /
    • v.23 no.11
    • /
    • pp.1399-1404
    • /
    • 2010
  • We have created a Bovine Genome Database, an integrated genomic resource for Bos taurus, by merging bovine data from various databases and our own data. We produced 55,213 Korean cattle (Hanwoo) ESTs from cDNA libraries from three tissues. We concentrated on genomic information based on Hanwoo transcripts and provided user-friendly search interfaces within the Bovine Genome Database. The genome browser supported alignment results for the various types of data: Hanwoo EST, consensus sequence, human gene, and predicted bovine genes. The database also provides transcript data information, gene annotation, genomic location, sequence and tissue distribution. Users can also explore bovine disease genes based on comparative mapping of homologous genes and can conduct searches centered on genes within user-selected quantitative trait loci (QTL) regions. The Bovine Genome Database can be accessed at http://bgd.nabc.go.kr.

Use of 16S-23S rRNA Intergenic Spacer Region for Species-specific Primer Developed of Vibrio Ichthyoenteri (16S-23S rRNA Intergenic Spacer Region을 이용한 Vibrio ichthyoenteri Species-specific Primer 개발)

  • Moon Young-Gun;Heo Moon-Soo
    • Korean Journal of Microbiology
    • /
    • v.41 no.2
    • /
    • pp.117-124
    • /
    • 2005
  • Two bacterial isolates obtained from rotifer and diseased olive flounder larvae, Paralichthys olivaceus, were identified as Vibrio ichthyoenteri based on the results of phenotypic characterization. In an attempt to develop rapid PCR method for the detection of V. ichthyoenteri, we examined the 16S-23S rRNA intergenic spacer region(ISR) of V. ichthyoenteri and developed species-specific primer for V. ichthyoenteri. Analysis of the ISR sequences showed that V. ichthyoenteri contains one type of polymorphic ISRs. The size of ISRs was 348 bp length and did not contain tRNA genes. Mutiple alignment of representative sequences from different V. species revealed several domains of high sequence variability, and allowed to design species-specific primer for detection of V. ichthyoenteri. The specificity of the primer was examined using genomic DNA prepared from 19 different V. species, isolated 18group Vibrio species and most similar sequence of other known Vibrio species. The results showed that the PCR reaction using species-specific primer designed in this study can be used to detect V. ichthyoenteri.

Comparative Genomic Analysis of Lactobacillus rhamnosus BFE5264, a Probiotic Strain Isolated from Traditional Maasai Fermented Milk

  • Jeong, Haeyoung;Choi, Sanghaeng;Park, Gun-Seok;Ji, Yosep;Park, Soyoung;Holzapfel, Wilhelm Heinrich;Mathara, Julius Maina;Kang, Jihee
    • Microbiology and Biotechnology Letters
    • /
    • v.47 no.1
    • /
    • pp.25-33
    • /
    • 2019
  • Lactobacillus rhamnosus BFE5264, isolated from a Maasai fermented milk product ("kule naoto"), was previously shown to exhibit bile acid resistance, cholesterol assimilation, and adhesion to HT29-MTX cells in vitro. In this study, we re-annotated and analyzed the previously reported complete genome sequence of strain BFE5264. The genome consists of a circular chromosome of 3,086,152 bp and a putative plasmid, which is the largest one identified among L. rhamnosus strains. Among the 2,883 predicted protein-coding genes, those with carbohydrate-related functions were the most abundant. Genome analysis of strain BFE5264 revealed two consecutive CRISPR regions and no known virulence factors or antimicrobial resistance genes. In addition, previously known highly variable regions in the genomes of L. rhamnosus strains were also evident in strain BFE5264. Pairwise comparison with the most studied probiotic strain L. rhamnosus GG revealed strain BFE5264-specific deletions, probably due to insertion sequence-mediated recombination. The latter was associated with loss of the spaCBA pilin gene cluster and exopolysaccharide biosynthetic genes. Comparative genomic analysis of the sequences from all available L. rhamnosus strains revealed that they were clustered into two groups, being within the same species boundary based on the average nucleotide identities. Strain BFE5264 had a sister group relationship with the group that contained strain GG, but neither ANI-based hierarchical clustering nor core-gene-based phylogenetic tree construction showed a clear distinctive pattern associated with the isolation source, implying that the genotype alone cannot account for their ecological niches. These results provide insights into the probiotic mechanisms of strain BFE5264 at the genomic level.

Use of Microsatellite Markers Derived from Genomic and Expressed Sequence Tag (EST) Data to Identify Commercial Watermelon Cultivars (수박 시판 품종의 식별을 위한 Genomic과 Expressed Sequence Tag (EST)에서 유래된 Microsatellite Marker의 이용)

  • Kwon, Yong-Sham;Hong, Jee-Hwa;Kim, Du-Hyun;Kim, Do-Hoon
    • Horticultural Science & Technology
    • /
    • v.33 no.5
    • /
    • pp.737-750
    • /
    • 2015
  • This study was carried out to construct a DNA profile database for 102 watermelon cultivars through the comparison of polymorphism level and genetic relatedness using genomic microsatellite (gMS) and expressed sequence tag (EST)-microsatellite (eMS) markers. Sixteen gMS and 10 eMS primers showed hyper-variability and were able to represent the genetic variation within 102 watermelon cultivars. With gMS markers, an average of 3.63 alleles per marker were detected with a polymorphism information content (PIC) value of 0.479, whereas with eMS markers, the average number of alleles per marker was 2.50 and the PIC value was 0.425, indicating that eMS detects a lower polymorphism level compared to gMS. Cluster analysis and Jaccard's genetic distance coefficients using the unweighted pair group method with arithmetic average (UPGMA) based on the gMS, eMS, and combined data sets showed that 102 commercial watermelon cultivars could be categorized into 6 to 8 major groups corresponding to phenotypic traits. Moreover, this method was sufficient to identify 78 out of 102 cultivars. Correlation analysis with Mantel tests for those clusters using 3 data sets showed high correlation ($r{\geq}0.80$). Therefore, the microsatellite markers used in this study may serve as a useful tool for germplasm evaluation, genetic purity assessment, and fingerprinting of watermelon cultivars.

Isolation of a Rice Genomic Clone Encoding Ribulose-1,5-bisphosphate Carboxylase (리블로스 1,5- 이인산 탄산화효소 유전자의 분리 및 특성규명)

  • Park, Sung-Soon;Kim, Hee-Jin;Kim, Chung-Ho;Kim, Han-Jip;Lee, Jong-Seob;Lee, Kwang-Woong;Choi, Yang-Do
    • Applied Biological Chemistry
    • /
    • v.37 no.5
    • /
    • pp.361-369
    • /
    • 1994
  • To study the light-induced expression mechanism and protein transport into the chloroplast, a rice genomic clone (GrbcS) for the small subunit of ribulose 1,5-bisphosphate carboxylase (rbcS) was isolated and its nucleotide sequence was determined. Nucleotide sequence analysis of GrbcS revealed that the gene consists of two exons interrupted by an intron, encoding a protein of 175 amino acids including a transit peptide of 47 amino acids. These structural features of GrbcS are consistent with those of other rbcS genes from monocot species. Genomic Southern blot analysis suggested that the rbcS genes are present as a relatively small multigene family in the rice genome. Comparison of the nucleotide and deduced amino acid sequences to other rice rbcSs shows close sequence similaritiy. Conserved DNA sequences present in other light-responsive genes are also found in the 5’ upstream region of GrbcS such as G-box, 3AF1-binding site and GATA site. The possible function of these putative regulatory elements are discussed.

  • PDF

Genomic Organization and Characterization of the Promoter Region of Bovine ADRP (Adipocyte Different Related Protein) Gene (소 Adipocyte Differentiation Related Protein (ADRP) 유전자의 Genomic Organization 및 Promoter Region의 특성 규명)

  • Jang, Y. S.;Yoon, D. H.;Kim, T. H.;Cheong, I. C.;Jo, J. K.
    • Journal of Animal Science and Technology
    • /
    • v.45 no.2
    • /
    • pp.169-182
    • /
    • 2003
  • To understand the structure and regulation of bovine ADRP (Adipocyte Differentiation Related Protein) gene, we have isolated the genomic clone of bovine ADRP and determined its sequence. A genomic Southern blot analysis confirmed that ADRP gene is present as a single copy in bovine genome and the ADRP gene spans 12 kb. Bovine ADRP genomic clone, HwADRPg-1, had 8 exons and 7 introns, and all splicing sites conformed to the GT/AG rule with the exon-intron boundaries located exactly. Analysis of the upstream 649 bp of the sequence of HwADRPg-1 showed that it does not contain any canonical TATAA boxes; however Sp1 binding sites and CAAT boxes are found. The promoter contained potential binding sites for AP-1, AP-2 and several putative transcription factor binding sites. The 5'-flanking region of HwADRPg-1 contained muscle specific transcription activator Myo G and C/EBP (CCAAT/ enhancer binding protein) recognizing site. These results suppose that the Myo G transcription activator regulate the transcription of bovine ADRP gene in muscular tissue and its transcriptional activity was triggered by degree of muscular development. Our results provide the necessary analysis for other flanking sequences are needed in addition to the proximal cis elements of this promoter to confer adipocyte differentiation-dependent or growth-dependent transcriptional control.

Protein Sequence Search based on N-gram Indexing

  • Hwang, Mi-Nyeong;Kim, Jin-Suk
    • Bioinformatics and Biosystems
    • /
    • v.1 no.1
    • /
    • pp.46-50
    • /
    • 2006
  • According to the advancement of experimental techniques in molecular biology, genomic and protein sequence databases are increasing in size exponentially, and mean sequence lengths are also increasing. Because the sizes of these databases become larger, it is difficult to search similar sequences in biological databases with significant homologies to a query sequence. In this paper, we present the N-gram indexing method to retrieve similar sequences fast, precisely and comparably. This method regards a protein sequence as a text written in language of 20 amino acid codes, adapts N-gram tokens of fixed-length as its indexing scheme for sequence strings. After such tokens are indexed for all the sequences in the database, sequences can be searched with information retrieval algorithms. Using this new method, we have developed a protein sequence search system named as ProSeS (PROtein Sequence Search). ProSeS is a protein sequence analysis system which provides overall analysis results such as similar sequences with significant homologies, predicted subcellular locations of the query sequence, and major keywords extracted from annotations of similar sequences. We show experimentally that the N-gram indexing approach saves the retrieval time significantly, and that it is as accurate as current popular search tool BLAST.

  • PDF

Simple Sequence Repeat (SSR) and GC Distribution in the Arabidopsis thaliana Genome

  • Mortimer Jennifer C;Batley Jacqueline;Love Christopher G;Logan Erica;Edwards David
    • Journal of Plant Biotechnology
    • /
    • v.7 no.1
    • /
    • pp.17-25
    • /
    • 2005
  • We have mined each of the five A. thaliana chromosomes for the presence of simple sequence repeats (SSRs) and developed custom perl scripts to examine their distribution and abundance in relation to genomic position, local G/C content and location within and around transcribed sequences. The distribution of repeats and G/C content with respect to genomic regions (exons, UTRs, introns, intergenic regions and proximity to expressed genes) are shown. SSRs show a non-random distribution across the genome and a strong association within and around transcribed sequences, while G/C density is associated specifically with the coding portions of transcribed sequences. SSR motif repeat number shows a high degree of variation for each SSR type and a high degree of motif sequence bias reflecting local genome sequence composition. PCR primers suitable for the amplification of identified SSRs have been designed where possible, and are available for further studies.

Comparative Genomics Study of Interferon-$\alpha$ Receptor-1 in Humans and Chimpanzees

  • Kim, Il-Chul;Chi, Seung-Wook;Kim, Dae-Won;Choi, Sang-Haeng;Chae, Sung-Hwa;Park, Hong-Seog
    • Genomics & Informatics
    • /
    • v.3 no.4
    • /
    • pp.142-148
    • /
    • 2005
  • The immune response-related genes have been suggested to be the most favorable genes for positive selection during evolution. Comparing the entire DNA sequence of chimpanzee chromosome 22 (PTR22) with human chromosome 21 (HSA21), we have identified 15 orthologs having indel in their coding sequences. Among them, interferon-${\alpha}$ receptor-1 gene (IFNAR1), an immuneresponse-related gene, is subjected to comparative genomic analysis. Chimpanzee IFNAR1 showed the same genomic structure as human IFNAR1 (11 exons and 10 introns) except the 3 bp insertion in exon 4. The sequence alignment of IFNAR1 coding sequence indicated that 'ISPP' amino acid sequence motif is highly conserved in chimpanzee and other animals including mouse and chicken. However, the human IFNAR1 shows that one proline residue is missing in the sequence motif. The homology modeling of the IFNAR1 structures suggests that the proline deletion in human IFNAR1 leads to the formation of the following ${\alpha}$-helix, whereas two sequential prolines in chimpanzee IFNAR1 inhibit it. As a result, human IFNAR1 may adopt a characteristic structure distinct from chimpanzee IFNAR1. This human specific trait could contribute to specific immune response in the most optimized manner for humans. Further molecular biological studies on the IFNAR1 will help us to gain insights into the molecular implication of species-specific host-pathogen interaction in primate evolution.