• 제목/요약/키워드: Transcriptome sequencing

검색결과 168건 처리시간 0.023초

Analysis of Whole Transcriptome Sequencing Data: Workflow and Software

  • Yang, In Seok;Kim, Sangwoo
    • Genomics & Informatics
    • /
    • 제13권4호
    • /
    • pp.119-125
    • /
    • 2015
  • RNA is a polymeric molecule implicated in various biological processes, such as the coding, decoding, regulation, and expression of genes. Numerous studies have examined RNA features using whole transcriptome sequencing (RNA-seq) approaches. RNA-seq is a powerful technique for characterizing and quantifying the transcriptome and accelerates the development of bioinformatics software. In this review, we introduce routine RNA-seq workflow together with related software, focusing particularly on transcriptome reconstruction and expression quantification.

SNP Discovery from Transcriptome of Cashmere Goat Skin

  • Wang, Lele;Zhang, Yanjun;Zhao, Meng;Wang, Ruijun;Su, Rui;Li, Jinquan
    • Asian-Australasian Journal of Animal Sciences
    • /
    • 제28권9호
    • /
    • pp.1235-1243
    • /
    • 2015
  • The goat Capra hircus is one of several economically important livestock in China. Advances in molecular genetics have led to the identification of several single nucleotide variation markers associated with genes affecting economic traits. Validation of single nucleotide variations in a whole-transcriptome sequencing is critical for understanding the information of molecular genetics. In this paper, we aim to develop a large amount of convinced single nucleotide polymorphisms (SNPs) for Cashmere goat through transcriptome sequencing. In this study, the transcriptomes of Cashmere goat skin at four stages were measured using RNA-sequencing and 90% to 92% unique-mapped-reads were obtained from total-mapped-reads. A total of 56,231 putative SNPs distributed among 10,057 genes were identified. The average minor allele frequency of total SNPs was 18%. GO and KEGG pathway analysis were conducted to analyze the genes containing SNPs. Our follow up biological validation revealed that 64% of SNPs were true SNPs. Our results show that RNA-sequencing is a fast and efficient method for identification of a large number of SNPs. This work provides significant genetic resources for further research on Cashmere goats, especially for the high density linkage map construction and genome-wide association studies.

Combined transcriptome and proteome analyses reveal differences in the longissimus dorsi muscle between Kazakh cattle and Xinjiang brown cattle

  • Yan, XiangMin;Wang, Jia;Li, Hongbo;Gao, Liang;Geng, Juan;Ma, Zhen;Liu, Jianming;Zhang, Jinshan;Xie, Penggui;Chen, Lei
    • Animal Bioscience
    • /
    • 제34권9호
    • /
    • pp.1439-1450
    • /
    • 2021
  • Objective: With the rapid development of proteomics sequencing and RNA sequencing technology, multi-omics analysis has become a current research hotspot. Our previous study indicated that Xinjiang brown cattle have better meat quality than Kazakh cattle. In this study, Xinjiang brown cattle and Kazakh cattle were used as the research objects. Methods: Proteome sequencing and RNA sequencing technology were used to analyze the proteome and transcriptome of the longissimus dorsi muscle of the two breeds of adult steers (n = 3). Results: In this project, 22,677 transcripts and 1,874 proteins were identified through quantitative analysis of the transcriptome and proteome. By comparing the identified transcriptome and proteome, we found that 1,737 genes were identified at both the transcriptome and proteome levels. The results of the study revealed 12 differentially expressed genes and proteins: troponin I1, crystallin alpha B, cysteine, and glycine rich protein 3, phosphotriesterase-related, myosin-binding protein H, glutathione s-transferase mu 3, myosin light chain 3, nidogen 2, dihydropyrimidinase like 2, glutamate-oxaloacetic transaminase 1, receptor accessory protein 5, and aspartoacylase. We performed functional enrichment of these differentially expressed genes and proteins. The Kyoto encyclopedia of genes and genomes results showed that these differentially expressed genes and proteins are enriched in the fatty acid degradation and histidine metabolism signaling pathways. We performed parallel reaction monitoring (PRM) verification of the differentially expressed proteins, and the PRM results were consistent with the sequencing results. Conclusion: Our study provided and identified the differentially expressed genes and proteins. In addition, identifying functional genes and proteins with important breeding value will provide genetic resources and technical support for the breeding and industrialization of new genetically modified beef cattle breeds.

Genome-Wide SNP Calling Using Next Generation Sequencing Data in Tomato

  • Kim, Ji-Eun;Oh, Sang-Keun;Lee, Jeong-Hee;Lee, Bo-Mi;Jo, Sung-Hwan
    • Molecules and Cells
    • /
    • 제37권1호
    • /
    • pp.36-42
    • /
    • 2014
  • The tomato (Solanum lycopersicum L.) is a model plant for genome research in Solanaceae, as well as for studying crop breeding. Genome-wide single nucleotide polymorphisms (SNPs) are a valuable resource in genetic research and breeding. However, to do discovery of genome-wide SNPs, most methods require expensive high-depth sequencing. Here, we describe a method for SNP calling using a modified version of SAMtools that improved its sensitivity. We analyzed 90 Gb of raw sequence data from next-generation sequencing of two resequencing and seven transcriptome data sets from several tomato accessions. Our study identified 4,812,432 non-redundant SNPs. Moreover, the workflow of SNP calling was improved by aligning the reference genome with its own raw data. Using this approach, 131,785 SNPs were discovered from transcriptome data of seven accessions. In addition, 4,680,647 SNPs were identified from the genome of S. pimpinellifolium, which are 60 times more than 71,637 of the PI212816 transcriptome. SNP distribution was compared between the whole genome and transcriptome of S. pimpinellifolium. Moreover, we surveyed the location of SNPs within genic and intergenic regions. Our results indicated that the sufficient genome-wide SNP markers and very sensitive SNP calling method allow for application of marker assisted breeding and genome-wide association studies.

A Study on Transcriptome Analysis Using de novo RNA-sequencing to Compare Ginseng Roots Cultivated in Different Environments

  • Yang, Byung Wook
    • 한국자원식물학회:학술대회논문집
    • /
    • 한국자원식물학회 2018년도 춘계학술발표회
    • /
    • pp.5-5
    • /
    • 2018
  • Ginseng (Panax ginseng C.A. Meyer), one of the most widely used medicinal plants in traditional oriental medicine, is used for the treatment of various diseases. It has been classified according to its cultivation environment, such as field cultivated ginseng (FCG) and mountain cultivated ginseng (MCG). However, little is known about differences in gene expression in ginseng roots between field cultivated and mountain cultivated ginseng. In order to investigate the whole transcriptome landscape of ginseng, we employed High-Throughput sequencing technologies using the Illumina HiSeqTM2500 system, and generated a large amount of sequenced transcriptome from ginseng roots. Approximately 77 million and 87 million high-quality reads were produced in the FCG and MCG roots transcriptome analyses, respectively, and we obtained 256,032 assembled unigenes with an average length of 1,171 bp by de novo assembly methods. Functional annotations of the unigenes were performed using sequence similarity comparisons against the following databases: the non-redundant nucleotide database, the InterPro domains database, the Gene Ontology Consortium database, and the Kyoto Encyclopedia of Genes and Genomes pathway database. A total of 4,207 unigenes were assigned to specific metabolic pathways, and all of the known enzymes involved in starch and sucrose metabolism pathways were also identified in the KEGG library. This study indicated that alpha-glucan phosphorylase 1, putative pectinesterase/pectinesterase inhibitor 17, beta-amylase, and alpha-glucan phosphorylase isozyme H might be important factors involved in starch and sucrose metabolism between FCG and MCG in different environments.

  • PDF

Sequencing and Characterization of Divergent Marbling Levels in the Beef Cattle (Longissimus dorsi Muscle) Transcriptome

  • Chen, Dong;Li, Wufeng;Du, Min;Wu, Meng;Cao, Binghai
    • Asian-Australasian Journal of Animal Sciences
    • /
    • 제28권2호
    • /
    • pp.158-165
    • /
    • 2015
  • Marbling is an important trait regarding the quality of beef. Analysis of beef cattle transcriptome and its expression profile data are essential to extend the genetic information resources and would support further studies on beef cattle. RNA sequencing was performed in beef cattle using the Illumina High-Seq2000 platform. Approximately 251.58 million clean reads were generated from a high marbling (H) group and low marbling (L) group. Approximately 80.12% of the 19,994 bovine genes (protein coding) were detected in all samples, and 749 genes exhibited differential expression between the H and L groups based on fold change (>1.5-fold, p<0.05). Multiple gene ontology terms and biological pathways were found significantly enriched among the differentially expressed genes. The transcriptome data will facilitate future functional studies on marbling formation in beef cattle and may be applied to improve breeding programs for cattle and closely related mammals.

Transcriptome sequencing revealed the inhibitory mechanism of ketoconazole on clinical Microsporum canis

  • Wang, Mingyang;Zhao, Yan;Cao, Lingfang;Luo, Silong;Ni, Binyan;Zhang, Yi;Chen, Zeliang
    • Journal of Veterinary Science
    • /
    • 제22권1호
    • /
    • pp.4.1-4.13
    • /
    • 2021
  • Background: Microsporum canis is a zoonotic disease that can cause dermatophytosis in animals and humans. Objectives: In clinical practice, ketoconazole (KTZ) and other imidazole drugs are commonly used to treat M. canis infection, but its molecular mechanism is not completely understood. The antifungal mechanism of KTZ needs to be studied in detail. Methods: In this study, one strain of fungi was isolated from a canine suffering with clinical dermatosis and confirmed as M. canis by morphological observation and sequencing analysis. The clinically isolated M. canis was treated with KTZ and transcriptome sequencing was performed to identify differentially expressed genes in M. canis exposed to KTZ compared with those unexposed thereto. Results: At half-inhibitory concentration (½MIC), compared with the control group, 453 genes were significantly up-regulated and 326 genes were significantly down-regulated (p < 0.05). Quantitative reverse transcription polymerase chain reaction analysis verified the transcriptome results of RNA sequencing. Gene ontology enrichment analysis and Kyoto Encyclopedia of Genes and Genomes enrichment analysis revealed that the 3 pathways of RNA polymerase, steroid biosynthesis, and ribosome biogenesis in eukaryotes are closely related to the antifungal mechanism of KTZ. Conclusions: The results indicated that KTZ may change cell membrane permeability, destroy the cell wall, and inhibit mitosis and transcriptional regulation through CYP51, SQL, ERG6, ATM, ABCB1, SC, KER33, RPA1, and RNP genes in the 3 pathways. This study provides a new theoretical basis for the effective control of M. canis infection and the effect of KTZ on fungi.

Application of Pac-Bio Sequencing, Trinity, and rnaSPAdes Assembly for Transcriptome Analysis in Medicinal Crop Astragalus membranaceus

  • Ji-Nam Kang;Si Myung Lee
    • 한국작물학회:학술대회논문집
    • /
    • 한국작물학회 2022년도 추계학술대회
    • /
    • pp.254-254
    • /
    • 2022
  • Astragalus membranaceus (A. membranaceus) has traditionally been used as a medicinal plant in East Asia for the treatment ofvarious diseases. A. membranaceus belongs to the legume family and is known to be rich in substances such as flavonoids and saponins. Recent pharmacological studies of A. membranaceus have shown that the plant has immunomodulatory, anti-oxidant, anti-cancer, and anti-inflammatory effects. However, knowledge of major biosynthetic pathways in A. membranaceu is still lacking. Recently developed sequencing techniques enable high-quality transcriptome analysis in plants, which is recognized as an important part in elucidating the regulatory mechanisms of many plant secondary metabolic pathways. However, it is difficult to predict the number of transcripts because plant transcripts contain a large number of isoforms due to alternative splicing events, which can vary depending on the assembly platform used. In this study, we constructed three unigene sets using Pac-Bio isoform sequencing, Trinity and rnaSPAdes assembly for detailed transcriptome analysis mA. membranaceus. Furthermore, all genes involved in the flavonoid biosynthetic pathway were searched from three unigene sets, and structural comparisons and expression profiles between these genes were analyzed. The isoflavone synthesis was active in most tissues. Flavonol synthesis was mainly active in leaves and flowers, and anthocyanin synthesis was specific in flowers. Gene structural analysis revealed structural differences in the flavonoid-related genes derived from the three unigene sets. This study suggests the need for the application of multiple unigene sets for the analysis of key biosynthetic pathways in plants.

  • PDF

RNA-Seq De Novo Assembly and Differential Transcriptome Analysis of Korean Medicinal Herb Cirsium japonicum var. spinossimum

  • Roy, Neha Samir;Kim, Jung-A;Choi, Ah-Young;Ban, Yong-Wook;Park, Nam-Il;Park, Kyong-Cheul;Yang, Hee-sun;Choi, Ik-Young;Kim, Soonok
    • Genomics & Informatics
    • /
    • 제16권4호
    • /
    • pp.34.1-34.9
    • /
    • 2018
  • Cirsium japonicum belongs to the Asteraceae or Compositae family and is a medicinal plant in Asia that has a variety of effects, including tumour inhibition, improved immunity with flavones, and antidiabetic and hepatoprotective effects. Silymarin is synthesized by 4-coumaroyl-CoA via both the flavonoid and phenylpropanoid pathways to produce the immediate precursors taxifolin and coniferyl alcohol. Then, the oxidative radicalization of taxifolin and coniferyl alcohol produces silymarin. We identified the expression of genes related to the synthesis of silymarin in C. japonicum in three different tissues, namely, flowers, leaves, and roots, through RNA sequencing. We obtained 51,133 unigenes from transcriptome sequencing by de novo assembly using Trinity v2.1.1, TransDecoder v2.0.1, and CD-HIT v4.6 software. The differentially expressed gene analysis revealed that the expression of genes related to the flavonoid pathway was higher in the flowers, whereas the phenylpropanoid pathway was more highly expressed in the roots. In this study, we established a global transcriptome dataset for C. japonicum. The data shall not only be useful to focus more deeply on the genes related to product medicinal metabolite including flavolignan but also to study the functional genomics for genetic engineering of C. japonicum.

COEX-Seq: Convert a Variety of Measurements of Gene Expression in RNA-Seq

  • Kim, Sang Cheol;Yu, Donghyeon;Cho, Seong Beom
    • Genomics & Informatics
    • /
    • 제16권4호
    • /
    • pp.36.1-36.3
    • /
    • 2018
  • Next generation sequencing (NGS), a high-throughput DNA sequencing technology, is widely used for molecular biological studies. In NGS, RNA-sequencing (RNA-Seq), which is a short-read massively parallel sequencing, is a major quantitative transcriptome tool for different transcriptome studies. To utilize the RNA-Seq data, various quantification and analysis methods have been developed to solve specific research goals, including identification of differentially expressed genes and detection of novel transcripts. Because of the accumulation of RNA-Seq data in the public databases, there is a demand for integrative analysis. However, the available RNA-Seq data are stored in different formats such as read count, transcripts per million, and fragments per kilobase million. This hinders the integrative analysis of the RNA-Seq data. To solve this problem, we have developed a web-based application using Shiny, COEX-seq (Convert a Variety of Measurements of Gene Expression in RNA-Seq) that easily converts data in a variety of measurement formats of gene expression used in most bioinformatic tools for RNA-Seq. It provides a workflow that includes loading data set, selecting measurement formats of gene expression, and identifying gene names. COEX-seq is freely available for academic purposes and can be run on Windows, Mac OS, and Linux operating systems. Source code, sample data sets, and supplementary documentation are available as well.