• Title/Summary/Keyword: de novo RNA sequencing

Search Result 19, Processing Time 0.029 seconds

A Study on Transcriptome Analysis Using de novo RNA-sequencing to Compare Ginseng Roots Cultivated in Different Environments

  • Yang, Byung Wook
    • Proceedings of the Plant Resources Society of Korea Conference
    • /
    • 2018.04a
    • /
    • pp.5-5
    • /
    • 2018
  • Ginseng (Panax ginseng C.A. Meyer), one of the most widely used medicinal plants in traditional oriental medicine, is used for the treatment of various diseases. It has been classified according to its cultivation environment, such as field cultivated ginseng (FCG) and mountain cultivated ginseng (MCG). However, little is known about differences in gene expression in ginseng roots between field cultivated and mountain cultivated ginseng. In order to investigate the whole transcriptome landscape of ginseng, we employed High-Throughput sequencing technologies using the Illumina HiSeqTM2500 system, and generated a large amount of sequenced transcriptome from ginseng roots. Approximately 77 million and 87 million high-quality reads were produced in the FCG and MCG roots transcriptome analyses, respectively, and we obtained 256,032 assembled unigenes with an average length of 1,171 bp by de novo assembly methods. Functional annotations of the unigenes were performed using sequence similarity comparisons against the following databases: the non-redundant nucleotide database, the InterPro domains database, the Gene Ontology Consortium database, and the Kyoto Encyclopedia of Genes and Genomes pathway database. A total of 4,207 unigenes were assigned to specific metabolic pathways, and all of the known enzymes involved in starch and sucrose metabolism pathways were also identified in the KEGG library. This study indicated that alpha-glucan phosphorylase 1, putative pectinesterase/pectinesterase inhibitor 17, beta-amylase, and alpha-glucan phosphorylase isozyme H might be important factors involved in starch and sucrose metabolism between FCG and MCG in different environments.

  • PDF

Survey of the Applications of NGS to Whole-Genome Sequencing and Expression Profiling

  • Lim, Jong-Sung;Choi, Beom-Soon;Lee, Jeong-Soo;Shin, Chan-Seok;Yang, Tae-Jin;Rhee, Jae-Sung;Lee, Jae-Seong;Choi, Ik-Young
    • Genomics & Informatics
    • /
    • v.10 no.1
    • /
    • pp.1-8
    • /
    • 2012
  • Recently, the technologies of DNA sequence variation and gene expression profiling have been used widely as approaches in the expertise of genome biology and genetics. The application to genome study has been particularly developed with the introduction of the nextgeneration DNA sequencer (NGS) Roche/454 and Illumina/ Solexa systems, along with bioinformation analysis technologies of whole-genome $de$ $novo$ assembly, expression profiling, DNA variation discovery, and genotyping. Both massive whole-genome shotgun paired-end sequencing and mate paired-end sequencing data are important steps for constructing $de$ $novo$ assembly of novel genome sequencing data. It is necessary to have DNA sequence information from a multiplatform NGS with at least $2{\times}$ and $30{\times}$ depth sequence of genome coverage using Roche/454 and Illumina/Solexa, respectively, for effective an way of de novo assembly. Massive shortlength reading data from the Illumina/Solexa system is enough to discover DNA variation, resulting in reducing the cost of DNA sequencing. Whole-genome expression profile data are useful to approach genome system biology with quantification of expressed RNAs from a wholegenome transcriptome, depending on the tissue samples. The hybrid mRNA sequences from Rohce/454 and Illumina/Solexa are more powerful to find novel genes through $de$ $novo$ assembly in any whole-genome sequenced species. The $20{\times}$ and $50{\times}$ coverage of the estimated transcriptome sequences using Roche/454 and Illumina/Solexa, respectively, is effective to create novel expressed reference sequences. However, only an average $30{\times}$ coverage of a transcriptome with short read sequences of Illumina/Solexa is enough to check expression quantification, compared to the reference expressed sequence tag sequence.

RNA-Seq De Novo Assembly and Differential Transcriptome Analysis of Korean Medicinal Herb Cirsium japonicum var. spinossimum

  • Roy, Neha Samir;Kim, Jung-A;Choi, Ah-Young;Ban, Yong-Wook;Park, Nam-Il;Park, Kyong-Cheul;Yang, Hee-sun;Choi, Ik-Young;Kim, Soonok
    • Genomics & Informatics
    • /
    • v.16 no.4
    • /
    • pp.34.1-34.9
    • /
    • 2018
  • Cirsium japonicum belongs to the Asteraceae or Compositae family and is a medicinal plant in Asia that has a variety of effects, including tumour inhibition, improved immunity with flavones, and antidiabetic and hepatoprotective effects. Silymarin is synthesized by 4-coumaroyl-CoA via both the flavonoid and phenylpropanoid pathways to produce the immediate precursors taxifolin and coniferyl alcohol. Then, the oxidative radicalization of taxifolin and coniferyl alcohol produces silymarin. We identified the expression of genes related to the synthesis of silymarin in C. japonicum in three different tissues, namely, flowers, leaves, and roots, through RNA sequencing. We obtained 51,133 unigenes from transcriptome sequencing by de novo assembly using Trinity v2.1.1, TransDecoder v2.0.1, and CD-HIT v4.6 software. The differentially expressed gene analysis revealed that the expression of genes related to the flavonoid pathway was higher in the flowers, whereas the phenylpropanoid pathway was more highly expressed in the roots. In this study, we established a global transcriptome dataset for C. japonicum. The data shall not only be useful to focus more deeply on the genes related to product medicinal metabolite including flavolignan but also to study the functional genomics for genetic engineering of C. japonicum.

Draft Genome of Toxocara canis, a Pathogen Responsible for Visceral Larva Migrans

  • Kong, Jinhwa;Won, Jungim;Yoon, Jeehee;Lee, UnJoo;Kim, Jong-Il;Huh, Sun
    • Parasites, Hosts and Diseases
    • /
    • v.54 no.6
    • /
    • pp.751-758
    • /
    • 2016
  • This study aimed at constructing a draft genome of the adult female worm Toxocara canis using next-generation sequencing (NGS) and de novo assembly, as well as to find new genes after annotation using functional genomics tools. Using an NGS machine, we produced DNA read data of T. canis. The de novo assembly of the read data was performed using SOAPdenovo. RNA read data were assembled using Trinity. Structural annotation, homology search, functional annotation, classification of protein domains, and KEGG pathway analysis were carried out. Besides them, recently developed tools such as MAKER, PASA, Evidence Modeler, and Blast2GO were used. The scaffold DNA was obtained, the N50 was 108,950 bp, and the overall length was 341,776,187 bp. The N50 of the transcriptome was 940 bp, and its length was 53,046,952 bp. The GC content of the entire genome was 39.3%. The total number of genes was 20,178, and the total number of protein sequences was 22,358. Of the 22,358 protein sequences, 4,992 were newly observed in T. canis. Following proteins previously unknown were found: E3 ubiquitin-protein ligase cbl-b and antigen T-cell receptor, zeta chain for T-cell and B-cell regulation; endoprotease bli-4 for cuticle metabolism; mucin 12Ea and polymorphic mucin variant C6/1/40r2.1 for mucin production; tropomodulin-family protein and ryanodine receptor calcium release channels for muscle movement. We were able to find new hypothetical polypeptides sequences unique to T. canis, and the findings of this study are capable of serving as a basis for extending our biological understanding of T. canis.

Marine Metatranscriptome Profiling in the Sea Adjacent to Jeju Island, Korea, by RNA-sequencing (RNA-sequencing을 이용한 제주도 인접 바다의 메타전사체 프로파일링)

  • Hwang, Jinik;Kang, Mingyeong;Kim, Kang Eun;Jung, Seung Won;Lee, Taek-Kyun
    • Journal of Life Science
    • /
    • v.30 no.7
    • /
    • pp.625-629
    • /
    • 2020
  • The Ocean is a rich source of diverse living organisms include viruses. In this study, we examined the microbial communities in the sea adjacent to Jeju Island in two seasons by metatranscriptomics. We collected and extracted total RNA, and, using the next-generation sequencing HiSeq 2000 and de novo transcriptome assembly, we identified 652,984 and 163,759 transcripts from the March and December samples, respectively. The most abundant organisms in March were bacteria, while eukaryotes were dominant in the December sample. The bacterial communities differed between the two samples, suggesting seasonal change. To identify the viruses, we searched the transcripts against a viral reference database using MegaBLAST with the most identified being bacteriophages infecting the marine bacteria. However, we also revealed an abundance of transcripts associated with diverse herpesviruses in the two transcriptomes, indicating the presence or possible threat of infection of fish in the sea around Jeju Island. This data is valuable for the study of marine microbial communities and for identifying possible viral pathogens.

Transcriptome analysis of internal and external stress mechanisms in Aster spathulifolius Maxim.

  • Sivagami, Jean Claude;Park, SeonJoo
    • Proceedings of the Plant Resources Society of Korea Conference
    • /
    • 2019.04a
    • /
    • pp.35-35
    • /
    • 2019
  • Aster spathulifolius Maxim. is belongs to the Asteraceae family which is distributed only in Korea and Japan. It is recognize as a traditionally medicinal plants and economically valuable in ornamental field. However, among the Asteraceae family, the Aster genus, which is lacks in genomic resources and information of molecular function. Therefore, we used high throughput RNA-sequencing transcriptome data of the A. spathulifolius to know molecular level function. DeNovo assembly produced 98,660 unigene with N50 value 1126 bp. Unigenes was performed to analyses the functional annotation against NCBI database like plant database of nucleotide (Nt) and non-redundant protein (Nr), Pfam, Uniprot, KEGG and Transcriptional factor (TF). In addition, Distribution of SSR markers also analyzed for future perfectives. Further, Comparing with other two Asteraceae family species like, Karelinia caspica and Chrysanthemum morifolium to the A. spathulifolius shows the number of gene that regulated in internal and external stress respectively salt-tolerant and heat and drought stress to understand the molecular basis related to the different environments stress.

  • PDF

Neuronal function and dysfunction of CYFIP2: from actin dynamics to early infantile epileptic encephalopathy

  • Zhang, Yinhua;Lee, Yeunkum;Han, Kihoon
    • BMB Reports
    • /
    • v.52 no.5
    • /
    • pp.304-311
    • /
    • 2019
  • The cytoplasmic FMR1-interacting protein family (CYFIP1 and CYFIP2) are evolutionarily conserved proteins originally identified as binding partners of the fragile X mental retardation protein (FMRP), a messenger RNA (mRNA)-binding protein whose loss causes the fragile X syndrome. Moreover, CYFIP is a key component of the heteropentameric WAVE regulatory complex (WRC), a critical regulator of neuronal actin dynamics. Therefore, CYFIP may play key roles in regulating both mRNA translation and actin polymerization, which are critically involved in proper neuronal development and function. Nevertheless, compared to CYFIP1, neuronal function and dysfunction of CYFIP2 remain largely unknown, possibly due to the relatively less well established association between CYFIP2 and brain disorders. Despite high amino acid sequence homology between CYFIP1 and CYFIP2, several in vitro and animal model studies have suggested that CYFIP2 has some unique neuronal functions distinct from those of CYFIP1. Furthermore, recent whole-exome sequencing studies identified de novo hot spot variants of CYFIP2 in patients with early infantile epileptic encephalopathy (EIEE), clearly implicating CYFIP2 dysfunction in neurological disorders. In this review, we highlight these recent investigations into the neuronal function and dysfunction of CYFIP2, and also discuss several key questions remaining about this intriguing neuronal protein.

Application of next generation sequencing (NGS) system for whole-genome sequencing of porcine reproductive and respiratory syndrome virus (PRRSV) (돼지생식기호흡기증후군바이러스(PRRSV)의 전장 유전체 염기서열(whole-genome sequencing) 분석을 위한 차세대 염기서열 분석법의 활용)

  • Moon, Sung-Hyun;Khatun, Amina;Kim, Won-Il;Hossain, Md Mukter;Oh, Yeonsu;Cho, Ho-Seong
    • Korean Journal of Veterinary Service
    • /
    • v.39 no.1
    • /
    • pp.41-49
    • /
    • 2016
  • In the present study, fast and robust methods for the next generation sequencing (NGS) were developed for analysis of PRRSV full genome sequences, which is a positive sensed RNA virus with a high degree of genetic variability among isolates. Two strains of PRRSVs (VR2332 and VR2332-R) which have been maintained in our laboratory were used to validate our methods and to compare with the sequence registered in GenBank (GenBank accession no. EF536003). The results suggested that both of strains had 100% coverage with the reference; the VR2332 had the coverage depth from minimum 3 to maximum 23,012, for the VR2332-R from minimum 3 to maximum 41,348, and 22,712 as an average depth. Genomic data produced from the massive sequencing capacities of the NGS have enabled the study of PRRSV at an unprecedented rate and details. Unlike conventional sequence methods which require the knowledge of conserved regions, the NGS allows de novo assembly of the full viral genomes. Therefore, our results suggested that these methods using the NGS massively facilitate the generation of more full genome PRRSV sequences locally as well as nationally in regard of saving time and cost.

Mercury Resistance and Removal Mechanisms of Pseudomonas sp. Isolated Mercury-contaminated Site in Taiwan

  • Luo, Kai-Hong;Chen, Ssu-Ching;Liao, Hung-Yu
    • Journal of Soil and Groundwater Environment
    • /
    • v.21 no.5
    • /
    • pp.16-24
    • /
    • 2016
  • A new strain of Pseudomonas sp. was isolated from mercury (Hg)-contaminated sites in Taiwan. This bacterium removed more than 80% of Hg present in the culture medium at 12 h incubation and was chosen for further analysis of the molecular mechanisms of Hg tolerance/removal abilities in this Pseudomonas sp. We used RNA-seq, one of the next-generation sequencing methods, to investigate the transcriptomic responses of the Pseudomonas sp. exposed to 60 mg/L of Hg2+. We de novo assembled 4,963 contigs, of which 10,533 up-regulated genes and 5,451 down-regulated genes were found to be regulated by Hg. The 40 genes most altered in expression levels were associated with tolerance to Hg stress and metabolism. Functional analysis showed that some Hg-tolerant genes were related to the mer operon, sulfate uptake and assimilation, the enzymatic antioxidant system, the HSP gene family, chaperones, and metal transporters. The transcriptome were analyzed further with Gene Ontology (GO) and Cluster of Orthologous Groups (COGs) of proteins and showed diverse biological functions and metabolic pathways under Hg stress.

Whole-Genome Characterization of Alfalfa Mosaic Virus Obtained from Metagenomic Analysis of Vinca minor and Wisteria sinensis in Iran: with Implications for the Genetic Structure of the Virus

  • Moradi, Zohreh;Mehrvar, Mohsen
    • The Plant Pathology Journal
    • /
    • v.37 no.6
    • /
    • pp.619-631
    • /
    • 2021
  • Alfalfa mosaic virus (AMV), an economically important pathogen, is present worldwide with a very wide host range. This work reports for the first time the infection of Vinca minor and Wisteria sinensis with AMV using RNA sequencing and reverse transcription polymerase chain reaction confirmation. De novo assembly and annotating of contigs revealed that RNA1, RNA2, and RNA3 genomic fragments consist of 3,690, 2,636, and 2,057 nucleotides (nt) for IR-VM and 3,690, 2,594, and 2,057 nt for IR-WS. RNA1 and RNA3 segments of IR-VM and IR-WS closely resembled those of the Chinese isolate HZ, with 99.23-99.26% and 98.04-98.09% nt identity, respectively. Their RNA2 resembled that of Canadian isolate CaM and American isolate OH-2-2017, with 97.96-98.07% nt identity. The P2 gene revealed more nucleotide diversity compared with other genes. Genes in the AMV genome were under dominant negative selection during evolution, and the P1 and coat protein (CP) proteins were subject to the strongest and weakest purifying selection, respectively. In the population genetic analysis based on the CP gene sequences, all 107 AMV isolates fell into two main clades (A, B) and isolates of clade A were further divided into three groups with significant subpopulation differentiation. The results indicated moderate genetic variation within and no clear geographic or genetic structure between the studied populations, implying moderate gene flow can play an important role in differentiation and distribution of genetic diversity among populations. Several factors have shaped the genetic structure and diversity of AMV: selection, recombination/reassortment, gene flow, and random processes such as founder effects.