• Title/Summary/Keyword: Exon structure

Search Result 67, Processing Time 0.024 seconds

Analyzing Exon Structure with PCA and ICA of Short-Time Fourier Transform

  • Hwang Changha;Sohn Insuk
    • Proceedings of the Korean Statistical Society Conference
    • /
    • 2004.11a
    • /
    • pp.79-84
    • /
    • 2004
  • We use principal component analysis (PCA) to identify exons of a gene and further analyze their internal structures. The PCA is conducted on the short-time Fourier transform (STFT) based on the 64 codon sequences and the 4 nucleotide sequences. By comparing to independent component analysis (ICA), we can differentiate between the exon and intron regions, and how they are correlated in terms of the square magnitudes of STFTs. The experiment is done on the gene F56F11.4 in the chromosome III of C. elegans. For this data, the nucleotide based PCA identifies the exon and intron regions clearly. The codon based PCA reveals a weak internal structure in some exon regions, but not the others. The result of ICA shows that the nucleotides thymine (T) and guanine (G) have almost all the information of the exon and intron regions for this data. We hypothesize the existence of complex exon structures that deserve more detailed analysis.

  • PDF

Genomic Structure Analyses of Five Kinds of Human Sialyltransferase Gene (5종류의 인간유래 시알산전이효소 유전자들의 게놈구조 분석)

  • Kang Nam-Young;Kim Sang-Wan;Kim Cheorl-Ho;Lee Young-Choon
    • Journal of Life Science
    • /
    • v.14 no.6 s.67
    • /
    • pp.1009-1017
    • /
    • 2004
  • Sialyltransferases cloned so far show the remarkable tissue-specific expression, which is correlated with the existence of cell type-specific sialylated sugar structure in glycoconjugates. In the previous studies, we found various mRNA isoforms of human sialyltransferases generated by alternative splicing and alternative promoter utilization. To understand the regulatory mechanisms for specific expression of human sialyltransferase genes and for production of their mRNA isoforms, in this study, we have isolated and characterized five kinds of human sialyltransferase genes: hST3Gal II, hST8Sia II, hST8Sia III, hST8Sia IV, and hST8Sia V. The hST3Gal II gene is composed of six exons, which span over 17kb, with exons ranging in size from 46 to over 1017 bp. The hST8Sia III gene comprises over 10 kb, and consists of only four exons, which is much smaller and simpler than other human sialyltransferase genes. In contrast, three genes (hST8Sia II, hST8Sia IV and hST8Sia V) span more than 70 kb, and comprise five or more exons. All exon-intron boundaries follow the GT-AG rule. In particular, the sialylmotif L, which is a highly conserved region in all cloned sialyltransferases, was found in one exon of hST8Sia III, whereas this motif is encoded by discrete exons in the other human sialyltransferases. Exon structures of these sialyltransferase genes show the structural diversity, as found in other human sialyltransferase genes reported so far. We determined the transcription start site of hST3Gal II gene by the 5'-RACE and cap site hunting experiments.

Label/Quencher-Free Detection of Exon Deletion Mutation in Epidermal Growth Factor Receptor Gene Using G-Quadruplex-Inducing DNA Probe

  • Kim, Hyo Ryoung;Lee, Il Joon;Kim, Dong-Eun
    • Journal of Microbiology and Biotechnology
    • /
    • v.27 no.1
    • /
    • pp.72-76
    • /
    • 2017
  • Detection of exon 19 deletion mutation in the epidermal growth factor receptor (EGFR) gene, which results in increased and sustained phosphorylation of EGFR, is important for diagnosis and treatment guidelines in non-small-cell lung cancer. Here, we have developed a simple and convenient detection system using the interaction between G-quadruplex and fluorophore thioflavin T (ThT) for discriminating EGFR exon 19 deletion mutant DNA from wild type without a label and quencher. In the presence of exon 19 deletion mutant DNA, the probe DNAs annealed to the target sequences were transformed into G-quadruplex structure. Subsequent intercalation of ThT into the G-quadruplex resulted in a light-up fluorescence signal, which reflects the amount of mutant DNA. Due to stark differences in fluorescence intensity between mutant and wild-type DNA, we suggest that the induced G-quadruplex structure in the probe DNA can report the presence of cancer-causing deletion mutant DNAs with high sensitivity.

A Modeling Study of Co-transcriptional Metabolism of hnRNP Using FMR1 Gene

  • Ro-Choi, Tae Suk;Choi, Yong Chun
    • Molecules and Cells
    • /
    • v.23 no.2
    • /
    • pp.228-238
    • /
    • 2007
  • Since molecular structure of hnRNP is not available in foreseeable future, it is best to construct a working model for hnRNP structure. A geometric problem, assembly of $700{\pm}20$ nucleotides with 48 proteins, is visualized by a frame work in which all the proteins participate in primary binding, followed by secondary, tertiary and quaternary binding with neighboring proteins without additional import. Thus, 40S hnRNP contains crown-like secondary structure (48 stemloops) and appearance of 6 petal (octamers) rose-like architectures. The proteins are wrapped by RNA. Co-transcriptional folding for RNP fibril of FMR1 gene can produce 2,571 stem-loops with frequency of 1 stem-loop/15.3 nucleotides and 53 40S hnRNP beaded structure. By spliceosome driven reactions, there occurs removal of 16 separate lariated RNPs, joining 17 separate beaded exonic structures and anchoring EJC on each exon junction. Skipping exon 12 has 5'GU, 3'AG and very compact folding pattern with frequency of 1 stem-loop per 12 nucleotides in short exon length (63 nucleotides). 5' end of exon 12 contains SS (Splicing Silencer) element of UAGGU. In exons 10, 15 and 17 where both regular and alternative splice sites exist, SS (hnRNP A1 binding site) is observed at the regular splicing site. End products are mature FMR-1 mRNP, 4 species of Pri-microRNAs derived from introns 7,9,15 and 3'UTR of exon17, respectively. There may also be some other regulatory RNAs containing ALU/Line elements as well.

Deducing Isoform Abundance from Exon Junction Microarray

  • Kim Po-Ra;Oh S.-June;Lee Sang-Hyuk
    • Genomics & Informatics
    • /
    • v.4 no.1
    • /
    • pp.33-39
    • /
    • 2006
  • Alternative splicing (AS) is an important mechanism of producing transcriptome diversity and microarray techniques are being used increasingly to monitor the splice variants. There exist three types of microarrays interrogating AS events-junction, exon, and tiling arrays. Junction probes have the advantage of monitoring the splice site directly. Johnson et al., performed a genome-wide survey of human alternative pre-mRNA splicing with exon junction microarrays (Science 302:2141-2144, 2003), which monitored splicing at every known exon-exon junctions for more than 10,000 multi-exon human genes in 52 tissues and cell lines. Here, we describe an algorithm to deduce the relative concentration of isoforms from the junction array data. Non-negative Matrix Factorization (NMF) is applied to obtain the transcript structure inferred from the expression data. Then we choose the transcript models consistent with the ECgene model of alternative splicing which is based on mRNA and EST alignment. The probe-transcript matrix is constructed using the NMF-consistent ECgene transcripts, and the isoform abundance is deduced from the non-negative least squares (NNLS) fitting of experimental data. Our method can be easily extended to other types of microarrays with exon or junction probes.

MOLECULAR CLONING OF CHICKEN INTERFERON-GAMMA (닭 인터페론 유전자의 클로닝에 관한 연구)

  • ;Hyun Lillehoj
    • Proceedings of the Korea Society of Poultry Science Conference
    • /
    • 1999.11a
    • /
    • pp.34-50
    • /
    • 1999
  • A cDNA encoding chicken interferon-gamma (chIFN-${\gamma}$) was amplified from P34, a CD4$^{+}$ T-cell hybridoma by reverse transcription-polymerase chain reaction (RT-PCR) and cloned into pUC18. THe sequences of cloned PCR products were determined to confirm the correct cloning. Using this cDNA as probe, chicken genomic library from White Leghorn spleen was screened. Phage clones harboring chicken interferon-gamma (chIFN-${\gamma}$) were isolated and their genomic structure elucidated. The chIFN-${\gamma}$ contains 4 exons and 3 introns spanning over 14 kb, and follows the GT/AG rule for correct splicing at the exon/intron boundaries. The four exons encode 41, 26, 57 and 40 amino acids, respectively, suggesting that the overall structure of IFN-${\gamma}$ is evolutionairly conserved in mammalian and avian species. The 5’-untranslated region and signal sequences are located in exon 1. Several AT-rich sequences located in the fourth exon may indicate a role in mRNA turnover. The 5’-flanking region contains sequences homologous to the potential binding sites for the mammalian transcription factors, activator protein-1(AP-1) activator protein-2(AP-2) cAMP-response element binding protein(CREB), activating transcription factor(ATF), GATA-binding fator(GATA), upstream stimulating factor(USF), This suggests that the mechanisms underlying transcriptional regulation of chicken and mammalian IFN-${\gamma}$ genes may be similar.r.

  • PDF

Alteration of Substrate Specificity by Common Variants, E158K/E308G and V257M, in Human Hepatic Drug-metabolizing Enzyme, Flavin-containing Monooxygenase 3

  • Lee, Jung-Kyu;Kang, Ju-Hee;Cha, Young-Nam;Chung, Woon-Gye;Park, Chang-Shin
    • The Korean Journal of Physiology and Pharmacology
    • /
    • v.7 no.3
    • /
    • pp.157-162
    • /
    • 2003
  • Our earlier studies found a significant correlation between the activities of ranitidine N-oxidation catalyzed by hepatic flavin-containing monooxygenase (FMO) and the presence of mutations in exon 4 (E158K) and exon 7 (E308G) of the FMO3 gene in Korean volunteers. However, caffeine N-1 demethylation (which is also partially catalyzed by FMO) was not significantly correlated with these FMO3 mutations. In this study, we examined another common mutation (V257M) in exon 6 of FMO3 gene. The V257M variant, which is caused by a point mutation (G769A), was commonly observed (13.21% allele frequency) in our subjects (n=159). This point mutation causes a substitution of $Val^{257}$ to $Met^{257}$, with transformation of the secondary structure. The presence of this mutant allele correlated significantly with a reduction in caffeine N-1-demethylating activity, but was not correlated with the activity of N-oxidation of ranitidine. In a family study, the low FMO activity observed in a person heterozygous for a nonsense mutation in exon 4 (G148X) and heterozygous for missense mutation in exon 6 (V257M) of FMO3 was attributed to the mutations. Our results suggest that various point mutations in the coding regions of FMO3 may influence FMO3 activity according to the probe substrates of varying chemical structure that correlate with each mutation on the FMO3 gene.

Effective Exon-Intron Structure Verification of a 1-Pyrroline-5-Carboxylate-Synthetase Gene from Halophytic Leymus chinensis (Trin.) Based on PCR, DNA Sequencing, and Alignment

  • Sun, Yan-Lin;Hong, Soon-Kwan
    • Korean Journal of Plant Resources
    • /
    • v.23 no.6
    • /
    • pp.526-534
    • /
    • 2010
  • Genomes of clusters of related eukaryotes are now being sequenced at an increasing rate. In this paper, we developed an accurate, low-cost method for annotation of gene prediction and exon-intron structure. The gene prediction was adapted for delta 1-pyrroline-5-carboxylate-synthetase (p5cs) gene from China wild-type of the halophytic Leymus chinensis (Trin.), naturally adapted to highly-alkali soils. Due to complex adaptive mechanisms in halophytes, more attentions are being paid on the regulatory elements of stress adaptation in halophytes. P5CS encodes delta 1-pyrroline-5-carboxylate-synthetase, a key regulatory enzyme involved in the biosynthesis of proline, that has direct correlation with proline accumulation in vivo and positive relationship with stress tolerance. Using analysis of reverse transcription-polymerase chain reaction (RT-PCR) and PCR, and direct sequencing, 1076 base pairs (bp) of cDNA in length and 2396 bp of genomic DNA in length were obtained from direct sequencing results. Through gene prediction and exon-intron structure verification, the full-length of cDNA sequence was divided into eight parts, with seven parts of intron insertion. The average lengths of determinated coding regions and non-coding regions were 154.17 bp and 188.57 bp, respectively. Nearly all splice sites displayed GT as the donor sites at the 5' end of intron region, and 71.43% displayed AG as the acceptor sites at the 3' end of intron region. We conclude that this method is a cost-effective way for obtaining an experimentally verified genome annotation.

Fluorometric Detection of Low-Abundance EGFR Exon 19 Deletion Mutation Using Tandem Gene Amplification

  • Kim, Dong-Min;Zhang, Shichen;Kim, Minhee;Kim, Dong-Eun
    • Journal of Microbiology and Biotechnology
    • /
    • v.30 no.5
    • /
    • pp.662-667
    • /
    • 2020
  • Epidermal growth factor receptor (EGFR) mutations are not only genetic markers for diagnosis but also biomarkers of clinical-response against tyrosine kinase inhibitors (TKIs) in non-small cell lung cancer (NSCLC). Among the EGFR mutations, the in-frame deletion mutation in EGFR exon 19 kinase domain (EGFR exon 19-del) is the most frequent mutation, accounting for about 45% of EGFR mutations in NSCLCs. Development of sensitive method for detecting the EGFR mutation is highly required to make a better screening for drug-response in the treatment of NSCLC patients. Here, we developed a fluorometric tandem gene amplification assay for sensitive detection of low-abundance EGFR exon 19-del mutant genomic DNA. The method consists of pre-amplification with PCR, thermal cycling of ligation by Taq ligase, and subsequent rolling circle amplification (RCA). PCR-amplified DNA from genomic DNA samples was used as splint DNA to conjugate both ends of linear padlock DNA, generating circular padlock DNA template for RCA. Long stretches of ssDNA harboring multiple copies of G-quadruplex structure was generated in RCA and detected by thioflavin T (ThT) fluorescence, which is specifically intercalated into the G-quadruplex, emitting strong fluorescence. Sensitivity of tandem gene amplification assay for detection of the EGFR exon 19-del from gDNA was as low as 3.6 pg, and mutant gDNA present in the pooled normal plasma was readily detected as low as 1% fraction. Hence, fluorometric detection of low-abundance EGFR exon 19 deletion mutation using tandem gene amplification may be applicable to clinical diagnosis of NSCLC patients with appropriate TKI treatment.