• Title/Summary/Keyword: Transcriptome Sequencing

Search Result 175, Processing Time 0.028 seconds

Integrative Comparison of Burrows-Wheeler Transform-Based Mapping Algorithm with de Bruijn Graph for Identification of Lung/Liver Cancer-Specific Gene

  • Ajaykumar, Atul;Yang, Jung Jin
    • Journal of Microbiology and Biotechnology
    • /
    • v.32 no.2
    • /
    • pp.149-159
    • /
    • 2022
  • Cancers of the lung and liver are the top 10 leading causes of cancer death worldwide. Thus, it is essential to identify the genes specifically expressed in these two cancer types to develop new therapeutics. Although many messenger RNA (mRNA) sequencing data related to these cancer cells are available due to the advancement of next-generation sequencing (NGS) technologies, optimized data processing methods need to be developed to identify the novel cancer-specific genes. Here, we conducted an analytical comparison between Bowtie2, a Burrows-Wheeler transform-based alignment tool, and Kallisto, which adopts pseudo alignment based on a transcriptome de Bruijn graph using mRNA sequencing data on normal cells and lung/liver cancer tissues. Before using cancer data, simulated mRNA sequencing reads were generated, and the high Transcripts Per Million (TPM) values were compared. mRNA sequencing reads data on lung/liver cancer cells were also extracted and quantified. While Kallisto could directly give the output in TPM values, Bowtie2 provided the counts. Thus, TPM values were calculated by processing the Sequence Alignment Map (SAM) file in R using package Rsubread and subsequently in python. The analysis of the simulated sequencing data revealed that Kallisto could detect more transcripts and had a higher overlap over Bowtie2. The evaluation of these two data processing methods using the known lung cancer biomarkers concludes that in standard settings without any dedicated quality control, Kallisto is more effective at producing faster and more accurate results than Bowtie2. Such conclusions were also drawn and confirmed with the known biomarkers specific to liver cancer.

RNA-sequencing Profiles of Cell Cycle-Related Genes Upregulated during the G2-Phase in Giardia lamblia

  • Kim, Juri;Shin, Mee Young;Park, Soon-Jung
    • Parasites, Hosts and Diseases
    • /
    • v.57 no.2
    • /
    • pp.185-189
    • /
    • 2019
  • To identify the component(s) involved in cell cycle control in the protozoan Giardia lamblia, cells arrested at the G1/S- or G2-phase by treatment with nocodazole and aphidicolin were prepared from the synchronized cell cultures. RNA-sequencing analysis of the 2 stages of Giardia cell cycle identified several cell cycle genes that were up-regulated at the G2-phase. Transcriptome analysis of cells in 2 distinct cell cycle stages of G. lamblia confirmed previously reported components of cell cycle (PcnA, cyclin B, and CDK) and identified additional cell cycle components (NEKs, Mad2, spindle pole protein, and CDC14A). This result indicates that the cell cycle machinery operates in this protozoan, one of the earliest diverging eukaryotic lineages.

Dimensionality Reduction of RNA-Seq Data

  • Al-Turaiki, Isra
    • International Journal of Computer Science & Network Security
    • /
    • v.21 no.3
    • /
    • pp.31-36
    • /
    • 2021
  • RNA sequencing (RNA-Seq) is a technology that facilitates transcriptome analysis using next-generation sequencing (NSG) tools. Information on the quantity and sequences of RNA is vital to relate our genomes to functional protein expression. RNA-Seq data are characterized as being high-dimensional in that the number of variables (i.e., transcripts) far exceeds the number of observations (e.g., experiments). Given the wide range of dimensionality reduction techniques, it is not clear which is best for RNA-Seq data analysis. In this paper, we study the effect of three dimensionality reduction techniques to improve the classification of the RNA-Seq dataset. In particular, we use PCA, SVD, and SOM to obtain a reduced feature space. We built nine classification models for a cancer dataset and compared their performance. Our experimental results indicate that better classification performance is obtained with PCA and SOM. Overall, the combinations PCA+KNN, SOM+RF, and SOM+KNN produce preferred results.

A guide to phylotranscriptomic analysis for phycologists

  • Cheon, Seongmin;Lee, Sung-Gwon;Hong, Hyun-Hee;Lee, Hyun-Gwan;Kim, Kwang Young;Park, Chungoo
    • ALGAE
    • /
    • v.36 no.4
    • /
    • pp.333-340
    • /
    • 2021
  • Phylotranscriptomics is the study of phylogenetic relationships among taxa based on their DNA sequences derived from transcriptomes. Because of the relatively low cost of transcriptome sequencing compared with genome sequencing and the fact that phylotranscriptomics is almost as reliable as phylogenomics, the phylotranscriptomic analysis has recently emerged as the preferred method for studying evolutionary biology. However, it is challenging to perform transcriptomic and phylogenetic analyses together without programming expertise. This study presents a protocol for phylotranscriptomic analysis to aid marine biologists unfamiliar with UNIX command-line interface and bioinformatics tools. Here, we used transcriptomes to reconstruct a molecular phylogeny of dinoflagellate protists, a diverse and globally abundant group of marine plankton organisms whose large and complex genomic sequences have impeded conventional phylogenic analysis based on genomic data. We hope that our proposed protocol may serve as practical and helpful information for the training and education of novice phycologists.

Construction of genetic linkage maps of Allium cepa using genotyping-by-sequencing

  • Lee, Daewoong;Chung, Yong Suk;Kim, Changsoo;Jun, Tae-Hwan
    • Proceedings of the Korean Society of Crop Science Conference
    • /
    • 2017.06a
    • /
    • pp.117-117
    • /
    • 2017
  • The onion (Allium cepa L.) is the most widely cultivated species of the genus Allium, especially it has been valued because of the pungent flavor and aroma. Allium species including onion has very large genome sizes ranging from approximately 10 to 20 Gbp, which have complicated genomic studies and precluded genome sequencing until recently. A population of 186 F2 individuals derived from a cross of 'Umjinara' ${\times}$ 'Sinsunhwang' and the two parental lines were used for this study. For the development of framework map, various types of markers including SSRs, RAPD, SNPs, and CAPS makers have been used for polymorphism test. Especially, a lot of SNP and CAPS loci were developed from the onion transcriptome sequence by RNASEQ of two parental lines. The GBS libraries have been constructed based on a modified protocol from Poland Lab using a two-enzyme system. We have been developing markers showing polymorphism between two parental lines, and genotyping for all F2 individuals were finished for a number of polymorphic markers. For the construction of GBS libraries, a set of 192 barcoded adapters were generated from complementary oligonucleotides with XhoI overhang sequence and unique barcodes of length 4-8 bp and they have been tested using two parental linesto determine the optimum conditions for GBS analysis.

  • PDF

Identification of Plant Viruses Infecting Pear Using RNA Sequencing

  • Kim, Nam-Yeon;Lee, Hyo-Jeong;Kim, Hong-Sup;Lee, Su-Heon;Moon, Jae-Sun;Jeong, Rae-Dong
    • The Plant Pathology Journal
    • /
    • v.37 no.3
    • /
    • pp.258-267
    • /
    • 2021
  • Asian pear (Pyrus pyrifolia) is a widely cultivated and commercially important fruit crop, which is occasionally subject to severe economic losses due to latent viral infections. Thus, the aim of the present study was to examine and provide a comprehensive overview of virus populations infecting a major pear cultivar ('Singo') in Korea. From June 2017 to October 2019, leaf samples (n = 110) of pear trees from 35 orchards in five major pear-producing regions were collected and subjected to RNA sequencing. Most virus-associated contigs matched the sequences of known viruses, including apple stem grooving virus (ASGV) and apple stem pitting virus (ASPV). However, some contigs matched the sequences of apple green crinkle-associated virus and cucumber mosaic virus. In addition, three complete or nearly complete genomes were constructed based on transcriptome data and subjected to phylogenetic analyses. Based on the number of virus-associated reads, ASGV and ASPV were identified as the dominant viruses of 'Singo.' The present study describes the virome of a major pear cultivar in Korea, and looks into the diversity of viral communities in this cultivar. This study can provide valuable information on the complexity of genetic variability of viruses infecting pear trees.

Comparative Transcriptome Analysis of Caryophyllene-Treated Helicobacter pylori

  • Woo, Hyun Jun;Yang, Ji Yeong;Kwon, Hye Jin;Kim, Hyun Woo;Kim, Sa-Hyun;Kim, Jong-Bae
    • Microbiology and Biotechnology Letters
    • /
    • v.49 no.3
    • /
    • pp.440-448
    • /
    • 2021
  • Helicobacter pylori (H. pylori) establishes long-term infections associated with severe gastric diseases such as peptic ulceration and gastric cancer. Exposure to an antibacterial agent can help regulate the expression levels of its pathogenic genes. In this study, we analyzed the transcriptional changes in H. pylori genes induced by β-caryophyllene. We used next-generation sequencing (NGS) to analyze RNA expression changes, and reverse transcription-polymerase chain reaction (RT-PCR) was performed as required to verify the results. The NGS results showed that 30 out of 1,632 genes were expressed differentially by β-caryophyllene treatment. Eleven genes associated with DNA replication, virulence factors, and T4SS components were significantly downregulated. RT-PCR confirmed that treatment reduced the expression levels of 11 genes. RT-PCR showed the reduced expression of 11 genes (dnaE, dnaN, holB, gyrA, cagA, vacA, secA, flgE, virB2, virB4, and virB8) following β-caryophyllene treatment. These results suggest that β-caryophyllene can modulate various H. pylori pathogenic determinants and be a potential therapeutic agent for H. pylori infection.

Single-Cell Toolkits Opening a New Era for Cell Engineering

  • Lee, Sean;Kim, Jireh;Park, Jong-Eun
    • Molecules and Cells
    • /
    • v.44 no.3
    • /
    • pp.127-135
    • /
    • 2021
  • Since the introduction of RNA sequencing (RNA-seq) as a high-throughput mRNA expression analysis tool, this procedure has been increasingly implemented to identify cell-level transcriptome changes in a myriad of model systems. However, early methods processed cell samples in bulk, and therefore the unique transcriptomic patterns of individual cells would be lost due to data averaging. Nonetheless, the recent and continuous development of new single-cell RNA sequencing (scRNA-seq) toolkits has enabled researchers to compare transcriptomes at a single-cell resolution, thus facilitating the analysis of individual cellular features and a deeper understanding of cellular functions. Nonetheless, the rapid evolution of high throughput single-cell "omics" tools has created the need for effective hypothesis verification strategies. Particularly, this issue could be addressed by coupling cell engineering techniques with single-cell sequencing. This approach has been successfully employed to gain further insights into disease pathogenesis and the dynamics of differentiation trajectories. Therefore, this review will discuss the current status of cell engineering toolkits and their contributions to single-cell and genome-wide data collection and analyses.

Antimicrobial Activity of the Synthetic Peptide Scolopendrasin II from the Centipede Scolopendra subspinipes mutilans

  • Kwon, Young-Nam;Lee, Joon Ha;Kim, In-Woo;Kim, Sang-Hee;Yun, Eun-Young;Nam, Sung-Hee;Ahn, Mi-Young;Jeong, MiHye;Kang, Dong-Chul;Lee, In Hee;Hwang, Jae Sam
    • Journal of Microbiology and Biotechnology
    • /
    • v.23 no.10
    • /
    • pp.1381-1385
    • /
    • 2013
  • The centipede Scolopendra subpinipes mutilans is a medicinally important arthropod species. However, its transcriptome is not currently available and transcriptome analysis would be useful in providing insight into a molecular level approach. Hence, we performed de novo RNA sequencing of S. subpinipes mutilans using next-generation sequencing. We generated a novel peptide (scolopendrasin II) based on a SVM algorithm, and biochemically evaluated the in vitro antimicrobial activity of scolopendrasin II against various microbes. Scolopendrasin II showed antibacterial activities against gram-positive and -negative bacterial strains, including the yeast Candida albicans and antibiotic-resistant gram-negative bacteria, as determined by a radial diffusion assay and colony count assay without hemolytic activity. In addition, we confirmed that scolopendrasin II bound to the surface of bacteria through a specific interaction with lipoteichoic acid and a lipopolysaccharide, which was one of the bacterial cell-wall components. In conclusion, our results suggest that scolopendrasin II may be useful for developing peptide antibiotics.

Single Nucleotide Polymorphism Marker Discovery from Transcriptome Sequencing for Marker-assisted Backcrossing in Capsicum

  • Kang, Jin-Ho;Yang, Hee-Bum;Jeong, Hyeon-Seok;Choe, Phillip;Kwon, Jin-Kyung;Kang, Byoung-Cheorl
    • Horticultural Science & Technology
    • /
    • v.32 no.4
    • /
    • pp.535-543
    • /
    • 2014
  • Backcross breeding is the method most commonly used to introgress new traits into elite lines. Conventional backcross breeding requires at least 4-5 generations to recover the genomic background of the recurrent parent. Marker-assisted backcrossing (MABC) represents a new breeding approach that can substantially reduce breeding time and cost. For successful MABC, highly polymorphic markers with known positions in each chromosome are essential. Single nucleotide polymorphism (SNP) markers have many advantages over other marker systems for MABC due to their high abundance and amenability to genotyping automation. To facilitate MABC in hot pepper (Capsicum annuum), we utilized expressed sequence tags (ESTs) to develop SNP markers in this study. For SNP identification, we used Bukang $F_1$-hybrid pepper ESTs to prepare a reference sequence through de novo assembly. We performed large-scale transcriptome sequencing of eight accessions using the Illumina Genome Analyzer (IGA) IIx platform by Solexa, which generated small sequence fragments of about 90-100 bp. By aligning each contig to the reference sequence, 58,151 SNPs were identified. After filtering for polymorphism, segregation ratio, and lack of proximity to other SNPS or exon/intron boundaries, a total of 1,910 putative SNPs were chosen and positioned to a pepper linkage map. We further selected 412 SNPs evenly distributed on each chromosome and primers were designed for high throughput SNP assays and tested using a genetic diversity panel of 27 Capsicum accessions. The SNP markers clearly distinguished each accession. These results suggest that the SNP marker set developed in this study will be valuable for MABC, genetic mapping, and comparative genome analysis.