• Title/Summary/Keyword: Genomic DNA library

Search Result 220, Processing Time 0.023 seconds

A Universal Analysis Pipeline for Hybrid Capture-Based Targeted Sequencing Data with Unique Molecular Indexes

  • Kim, Min-Jung;Kim, Si-Cho;Kim, Young-Joon
    • Genomics & Informatics
    • /
    • v.16 no.4
    • /
    • pp.29.1-29.5
    • /
    • 2018
  • Hybrid capture-based targeted sequencing is being used increasingly for genomic variant profiling in tumor patients. Unique molecular index (UMI) technology has recently been developed and helps to increase the accuracy of variant calling by minimizing polymerase chain reaction biases and sequencing errors. However, UMI-adopted targeted sequencing data analysis is slightly different from the methods for other types of omics data, and its pipeline for variant calling is still being optimized in various study groups for their own purposes. Due to this provincial usage of tools, our group built an analysis pipeline for global application to many studies of targeted sequencing generated with different methods. First, we generated hybrid capture-based data using genomic DNA extracted from tumor tissues of colorectal cancer patients. Sequencing libraries were prepared and pooled together, and an 8-plexed capture library was processed to the enrichment step before 150-bp paired-end sequencing with Illumina HiSeq series. For the analysis, we evaluated several published tools. We focused mainly on the compatibility of the input and output of each tool. Finally, our laboratory built an analysis pipeline specialized for UMI-adopted data. Through this pipeline, we were able to estimate even on-target rates and filtered consensus reads for more accurate variant calling. These results suggest the potential of our analysis pipeline in the precise examination of the quality and efficiency of conducted experiments.

Generation of Protein Lineages with new Sequence Spaces by Functional Salvage Screen

  • Kim, Geun-Joong;Cheon, Young-Hoon;Park, Min-Soon;Park, Hee-Sung;Kim, Hak-Sung
    • Proceedings of the Korean Society for Applied Microbiology Conference
    • /
    • 2001.06a
    • /
    • pp.77-80
    • /
    • 2001
  • A variety of different methods to generate diverse proteins, including random mutagenesis and recombination, are currently available, and most of them accumulate the mutations on the target gene of a protein, whose sequence space remains unchanged. On the other hand, a pool of diverse genes, which is generated by random insertions, deletions, and exchange of the homologous domains with different lengths in the target gene, would present the protein lineages resulting in new fitness landscapes. Here we report a method to generate a pool of protein variants with different sequence spaces by employing green fluorescent protein (GFP) as a model protein. This process, designated functional salvage screen (FSS), comprises the following procedures: a defective GFP template expressing no fluorescence is firstly constructed by genetically disrupting a predetermined region(s) of the protein, and a library of GFP variants is generated from the defective template by incorporating the randomly fragmented genomic DNA from E. coli into the defined region(s) of the target gene, followed by screening of the functionally salvaged, fluorescence-emitting GFPs. Two approaches, sequence-directed and PCR-coupled methods, were attempted to generate the library of GFP variants with new sequences derived from the genomic segments of E. coli. The functionally salvaged GFPs were selected and analyzed in terms of the sequence space and functional property. The results demonstrate that the functional salvage process not only can be a simple and effective method to create protein lineages with new sequence spaces, but also can be useful in elucidating the involvement of a specific region(s) or domain(s) in the structure and function of protein.

  • PDF

Partial genomic sequence of baulovirus associated with white spot syndrome (WSBV) isolated from penaeid shrimp P. chinensis (대하새우로부터 분리한 WSBV의 게놈서열 분석)

  • Kim, Chong-Kyung;Sohn, Sang-Gyu;Heo, Moon-Soo;Lee, Tae-Ho;Jun, Hong-Ki;Jang, Kyung-Lib
    • Journal of fish pathology
    • /
    • v.10 no.2
    • /
    • pp.87-95
    • /
    • 1997
  • Baculovirus associated with white spot syndrome (WSBV) is the causative agent of a disease with high mortalities and causes severe damage to shrimp cultures. In this study, we analyzed a recombinant clone (E3) obtained from a viral genomic library to characterize the causative agent in diseased shrimp Penaeus chinensis with white spot syndrome. According to the analysis of nucleotide sequence of E3, this clone did not showed considerable sequence homology with those of other known viruses, including baculovirus Autographa californica nuclear polyhedrosis virus (AcNPV), indicating that WSBV is a novel virus causing a serious disease in P. chinensis. Based on the sequence of E3 clone, a pair of PCR primers was designed. After 30 cycles of amplification, a specific product of the expected size was detected only if the total nucleic acids extracted from the diseased shrimp was used as a template DNA, suggesting that this method can be used to diagnose the virus infection in diseased shrimp.

  • PDF

Construction of EST Database for Comparative Gene Studies of Acanthamoeba

  • Moon, Eun-Kyung;Kim, Joung-Ok;Xuan, Ying-Hua;Yun, Young-Sun;Kang, Se-Won;Lee, Yong-Seok;Ahn, Tae-In;Hong, Yeon-Chul;Chung, Dong-Il;Kong, Hyun-Hee
    • Parasites, Hosts and Diseases
    • /
    • v.47 no.2
    • /
    • pp.103-107
    • /
    • 2009
  • The genus Acanthamoeba can cause severe infections such as granulomatous amebic encephalitis and amebic keratitis in humans. However, little genomic information of Acanthamoeba has been reported. Here, we constructed Acanthamoeba expressed sequence tags (EST) database (Acanthamoeba EST DB) derived from our 4 kinds of Acanthamoeba cDNA library. The Acanthamoeba EST DB contains 3,897 EST generated from amebae under various conditions of long term in vitro culture, mouse brain passage, or encystation, and downloaded data of Acanthamoeba from National Center for Biotechnology Information (NCBI) and Taxonomically Broad EST Database (TBestDB). The almost reported eDNA/genomic sequences of Acanthamoeba provide stand alone BLAST system with nucleotide (BLAST NT) and amino acid (BLAST AA) sequence database. In BLAST results, each gene links for the significant information including sequence data, gene orthology annotations, relevant references, and a BlastX result. This is the first attempt for construction of Acanthamoeba database with genes expressed in diverse conditions. These data were integrated into a database (http://www. amoeba.or.kr).

Molecular Cloning and Expression of the $\beta$-Xylosidase Gene (xylB) of Bacillus stearothermophilus in Escherichia coli

  • Suh, Jung-Han;Eom, Soo-Jung;Cho, Ssang-Goo;Choi, Yong-Jin
    • Journal of Microbiology and Biotechnology
    • /
    • v.6 no.5
    • /
    • pp.331-335
    • /
    • 1996
  • The second $\beta$-Xylosidase gene (xylB) from Bacillus stearothermophilus was isolated from the genomic library, cloned into pBR322, and subsequently transferred into Escherichia coli HB101. Six out of 10, 000 transformants were selected from the selective LB medium supplemented with p-nitrophenyl-$\alpha$-L-arabinofuranoside (pNPAf) and ampicillin ($50\mu g$/ml) based on their ability to form a yellow ring around the colony. One of the clones was found to harbor the recombinant plasmid with 5.0 kb foreign DNA, which was identical to the $\alpha$-L-arabinofuranosidase gene (arfI) previously cloned in this lab, while the other five had 3.5 kb of the foreign DNA. Southern blotting experiments confirmed that the 3.5 kb insert DNA was from B. stearothermophilus chromosomal DNA. A zymogram with 4-methylumbelliferyl-$\alpha$-L-arabinofuranoside as the enzyme substrate revealed that the cloned gene product was one of the mutiple $\alpha$-L-arabinofuranosidases produced by B. stearothermophilus. Unlike the arfI gene product, the product of the gene on the insert DNA (xylB) showed an activity not only on pNPAf but also on oNPX suggesting that the cloned gene product could be a bifunctional enzyme having both $\alpha$-L-arabinofuranosidase and $\beta$-xylosidase activities.

  • PDF

Characterization of the Gene for the Light-Harvesting Peridinin-Chlorophyll-Protein of Alexandrium tamarense

  • LEE SOON-YOUL;KANG SUNG-HO;JIN EONSEON
    • Journal of Microbiology and Biotechnology
    • /
    • v.15 no.5
    • /
    • pp.1094-1099
    • /
    • 2005
  • Photosynthetic dinoflagellates contain a water-soluble, light-harvesting antenna called the peridinin-chlorophyll-protein (PCP) complex, which has an apoprotein with no sequence similarity to other known proteins. There are two forms of PCP apoproteins; the 15-kDa short form and the 32- to 35­kDa long form. The present study describes the PCP protein and its cDNA from Alexandrium tamarense. A cDNA library was constructed from mRNA isolated from A. tamarense. The complete PCP cDNA was generated by reverse-transcription coupled to polymerase chain reaction (RT-PCR), together with rapid-amplification of cDNA ends (RACE). The A. tamarense PCP cDNA encoded a 55-amino acid signal peptide and a 313-amino acid mature protein with a calculated mass of 32 kDa, which corresponded to that of the long form of PCP. Phylogenetic analysis indicated that the sequence of A. tamarense PCP did not cluster with the short-form PCPs, to which it was only about $55\%$ identical, but which were $79-83\%$ identical to other long-form PCPs. The deduced amino acid sequence of A. tamarense PCP contains an internal duplication, which suggests the possibility that long-form PCPs arose by gene duplication or by the fusion of genes encoding the short form. The abundance of PCP mRNA changed substantially in response to different light conditions, indicating the possible existence of a photo-acclimation response in A. tamarense.

Cloning and Regulation of Schizosaccharomyces pombe Gene Encoding Ribosomal Protein L11

  • Kim, Hong-Gyum;Lee, Jin-Joo;Park, Eun-Hee;Sa, Jae-Hoon;Ahn, Ki-Sup;Lim, Chang-Jin
    • BMB Reports
    • /
    • v.34 no.4
    • /
    • pp.379-384
    • /
    • 2001
  • The cDNA encoding ribosomal protein was identified from a cDNA library of Schizosaccharomyces pombe. The nucleotide sequence of the 548 by cDNA clone reveals an open reading frame, which encodes a putative protein of 166 amino acids with a molecular mass of 18.3 kDa. The amino acid sequence of the S. pombe L11 protein is highly homologous with those of rat and fruit, while it is clearly less similar to those of prokaryotic counterparts. The 1,044 by upstream sequence, and the region encoding N-terminal 7 amino acids of the genomic DNA were fused into the promoterless $\beta$-galactosidase gene of the shuttle vector YEp357 in order to generate the fusion plasmid pHY L11. Synthesis of $\beta$-galactosidase from the fusion plasmid varied according to the growth curve. It decreased significantly in the growth-arrested yeast cells that were treated with aluminum chloride and mercuric chloride. However, it was enhanced by treatments with cadmium chloride ($2.5\;{\mu}M$), zinc chloride ($2.5\;{\mu}M$), and hydrogen peroxide (0.5 mM). This indicates that the expression of the L,11 gene could be induced by oxidative stress.

  • PDF

Molecular Cloning of the Arginine Biosynthetic Genes from Corynebacterium glutamicum

  • Chun, Jae-Shick;Jung, Sam-Il;Ko, Soon-Young;Park, Mee-Young;Kim, Soo-Young;Lee, Heung-Shick;Cheon, Choong-Ill;Min, Kyung-Hee;Lee, Myeong-Sok
    • Journal of Microbiology
    • /
    • v.34 no.4
    • /
    • pp.355-362
    • /
    • 1996
  • Complementation cloning of the argC, E, B, D, F, and G genes in Corynebacterium glutamicum was done by transforming the genomic DNA library into the corresponding arginine auxotrophs fo Escherichia coli. Recombinant plasmids containing 6.7 kb and 4.8kb fragments complementing the E. coli argB mutant were also able to complement the E. coli argC, E, A, D, and F mutants, indicating the clustered organization of the arginine biosynthetic genes within the cloned DNA fragments. The insert DNA fragments in the recombinant plasmids, named pRB1 AND pRB2, were physically mapped with several restriction enzymes. By further subcloning the entire DNA fragment containing the functions and by complementation analysis, we located the arg genes in the order of ACEBDF on the restriction map. We also determined the DNA nucleotide sequence of the fragment and report here the sequence of the argB gene. When compared to that with the mutant strain, higher enzyme activity of N-acetylglutamate kinase was detected in the extract of the mutant carrying the plasmid containing the putative argB gene, indicating that the plasmid contains a functional argB gene. Deduced amino acid sequence of the argB gene shows 45%, 38%, and 25% identity to that from Bacillus strearothermophilus, Bacillus substilus, and E. coli respectively. Our long term goal is genetically engineering C. glutamicum which produces more arginine than a wild type strain does.

  • PDF

Molecular Characterization of Three cDNA Clones Encoding Calmodulin Isoforms of Rice

  • Lee, Sung-Ho;Kim, Cha Young;Lim, Chae Oh;Lee, Soo In;Gal, Sang Wan;Choi, Young Ju
    • Journal of Applied Biological Chemistry
    • /
    • v.43 no.1
    • /
    • pp.5-11
    • /
    • 2000
  • Three cDNA clones encoding rice calmodulin (CaM) isoforms (OsCaM-1, OsCaM-2, and OsCaM-3) were isolated from a rice cDNA library constructed from suspension-cultured rice cells treated with fungal elicitor. The coding regions of OsCaM-1 and O.sCaM-2 were 89% homologous at DNA Ievel, whereas the 5' and 3' untranslated regions were highly divergent. The polypeptides encoded by OsCaM-1 and OsCaM-2 was identical except two conservative substitution at position 8 and 75. The coding region of OsCaM-3 was consist of a typical conserved CaM domain and an additional C-terminal extension. The amino acid sequence of conserved CaM domain of OsCaM-3 shared only 86% identity with that OsCaM-1. The OsCaM-3 cDNA is belongs to a novel group of calmodulin gene due to its C-terminal extension of 38 amino acids, a large number of which are positively charged. The extension also contains a C-terminal CaaX-box prenylation site (CVlL). Genomic Southern analysis revealed at least six copies of CaM or CaM-related genes, suggesting that calmodulin may be represented by a small multigene family in the rice geneme. Expression of OsCaM gene was examined through Northern blot analysis. Transcript level of OsCaM-3 was increased by treatment with a fungal elicitor, whereas the OsCaM-1 and OsCaM-2 genes did not respond to the fungal elicitor. The expression of OsCaM-3 gene was remarkable inhibited in the rice cells treated with cyclosporine A, calcinurin inhibitor.

  • PDF

Genome Research on Peach and Pear

  • Hayashi Tateki;Yamamoto Toshiya
    • Proceedings of the Korean Society of Plant Biotechnology Conference
    • /
    • 2002.04a
    • /
    • pp.101-109
    • /
    • 2002
  • A lot of SSRs (simple sequence repeats) in peach and pear from enriched genomic libraries and in peach from a cDNA library were developed. These SSRs were applied to other related species, giving phenograms of 52 Prunus and 60 pear accessions. Apple SSRs could also be successfully used in Pyrus spp. Thirteen morphological traits were characterized on the basis of the linkage map obtained from an $F_2$ population of peach. This map was compiled with those morphological markers and 83 DNA markers, including SSR markers used as anchor loci, to compare different peach maps. Molecular markers tightly linked to new root-knot nematode resistance genes were also found. A linkage map including disease-related genes, pear scab resistance and black spot susceptibility, in the Japanese pear Kinchaku were constructed using 118 RAPD markers. Another linkage map, of the European pear Bartlett, was also constructed with 226 markers, including 49 SSRs from pear, apple, peach and chewy. Maps of other Japanese pear cultivars, i.e., Kousui and Housui, were also constructed. These maps were the first results of pear species.

  • PDF