• Title/Summary/Keyword: single-cell RNA-sequencing data

Search Result 16, Processing Time 0.025 seconds

Variational Autoencoder Based Dimension Reduction and Clustering for Single-Cell RNA-seq Gene Expression (단일세포 RNA-SEQ의 유전자 발현 군집화를 위한 변이 자동인코더 기반의 차원감소와 군집화)

  • Chi, Sang-Mun
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.25 no.11
    • /
    • pp.1512-1518
    • /
    • 2021
  • Since single cell RNA sequencing provides the expression profiles of individual cells, it provides higher cellular differential resolution than traditional bulk RNA sequencing. Using these single cell RNA sequencing data, clustering analysis is generally conducted to find cell types and understand high level biological processes. In order to effectively process the high-dimensional single cell RNA sequencing data fir the clustering analysis, this paper uses a variational autoencoder to transform a high dimensional data space into a lower dimensional latent space, expecting to produce a latent space that can give more accurate clustering results. By clustering the features in the transformed latent space, we compare the performance of various classical clustering methods for single cell RNA sequencing data. Experimental results demonstrate that the proposed framework outperforms many state-of-the-art methods under various clustering performance metrics.

Dissecting Cellular Heterogeneity Using Single-Cell RNA Sequencing

  • Choi, Yoon Ha;Kim, Jong Kyoung
    • Molecules and Cells
    • /
    • v.42 no.3
    • /
    • pp.189-199
    • /
    • 2019
  • Cell-to-cell variability in gene expression exists even in a homogeneous population of cells. Dissecting such cellular heterogeneity within a biological system is a prerequisite for understanding how a biological system is developed, homeostatically regulated, and responds to external perturbations. Single-cell RNA sequencing (scRNA-seq) allows the quantitative and unbiased characterization of cellular heterogeneity by providing genome-wide molecular profiles from tens of thousands of individual cells. A major question in analyzing scRNA-seq data is how to account for the observed cell-to-cell variability. In this review, we provide an overview of scRNA-seq protocols, computational approaches for dissecting cellular heterogeneity, and future directions of single-cell transcriptomic analysis.

One-step spectral clustering of weighted variables on single-cell RNA-sequencing data (단세포 RNA 시퀀싱 데이터를 위한 가중변수 스펙트럼 군집화 기법)

  • Park, Min Young;Park, Seyoung
    • The Korean Journal of Applied Statistics
    • /
    • v.33 no.4
    • /
    • pp.511-526
    • /
    • 2020
  • Single-cell RNA-sequencing (scRNA-seq) data consists of each cell's RNA expression extracted from large populations of cells. One main purpose of using scRNA-seq data is to identify inter-cellular heterogeneity. However, scRNA-seq data pose statistical challenges when applying traditional clustering methods because they have many missing values and high level of noise due to technical and sampling issues. In this paper, motivated by analyzing scRNA-seq data, we propose a novel spectral-based clustering method by imposing different weights on genes when computing a similarity between cells. Assigning weights on genes and clustering cells are performed simultaneously in the proposed clustering framework. We solve the proposed non-convex optimization using an iterative algorithm. Both real data application and simulation study suggest that the proposed clustering method better identifies underlying clusters compared with existing clustering methods.

Single-Cell Toolkits Opening a New Era for Cell Engineering

  • Lee, Sean;Kim, Jireh;Park, Jong-Eun
    • Molecules and Cells
    • /
    • v.44 no.3
    • /
    • pp.127-135
    • /
    • 2021
  • Since the introduction of RNA sequencing (RNA-seq) as a high-throughput mRNA expression analysis tool, this procedure has been increasingly implemented to identify cell-level transcriptome changes in a myriad of model systems. However, early methods processed cell samples in bulk, and therefore the unique transcriptomic patterns of individual cells would be lost due to data averaging. Nonetheless, the recent and continuous development of new single-cell RNA sequencing (scRNA-seq) toolkits has enabled researchers to compare transcriptomes at a single-cell resolution, thus facilitating the analysis of individual cellular features and a deeper understanding of cellular functions. Nonetheless, the rapid evolution of high throughput single-cell "omics" tools has created the need for effective hypothesis verification strategies. Particularly, this issue could be addressed by coupling cell engineering techniques with single-cell sequencing. This approach has been successfully employed to gain further insights into disease pathogenesis and the dynamics of differentiation trajectories. Therefore, this review will discuss the current status of cell engineering toolkits and their contributions to single-cell and genome-wide data collection and analyses.

A semi-automatic cell type annotation method for single-cell RNA sequencing dataset

  • Kim, Wan;Yoon, Sung Min;Kim, Sangsoo
    • Genomics & Informatics
    • /
    • v.18 no.3
    • /
    • pp.26.1-26.6
    • /
    • 2020
  • Single-cell RNA sequencing (scRNA-seq) has been widely applied to provide insights into the cell-by-cell expression difference in a given bulk sample. Accordingly, numerous analysis methods have been developed. As it involves simultaneous analyses of many cell and genes, efficiency of the methods is crucial. The conventional cell type annotation method is laborious and subjective. Here we propose a semi-automatic method that calculates a normalized score for each cell type based on user-supplied cell type-specific marker gene list. The method was applied to a publicly available scRNA-seq data of mouse cardiac non-myocyte cell pool. Annotating the 35 t-stochastic neighbor embedding clusters into 12 cell types was straightforward, and its accuracy was evaluated by constructing co-expression network for each cell type. Gene Ontology analysis was congruent with the annotated cell type and the corollary regulatory network analysis showed upstream transcription factors that have well supported literature evidences. The source code is available as an R script upon request.

Integration of Single-Cell RNA-Seq Datasets: A Review of Computational Methods

  • Yeonjae Ryu;Geun Hee Han;Eunsoo Jung;Daehee Hwang
    • Molecules and Cells
    • /
    • v.46 no.2
    • /
    • pp.106-119
    • /
    • 2023
  • With the increased number of single-cell RNA sequencing (scRNA-seq) datasets in public repositories, integrative analysis of multiple scRNA-seq datasets has become commonplace. Batch effects among different datasets are inevitable because of differences in cell isolation and handling protocols, library preparation technology, and sequencing platforms. To remove these batch effects for effective integration of multiple scRNA-seq datasets, a number of methodologies have been developed based on diverse concepts and approaches. These methods have proven useful for examining whether cellular features, such as cell subpopulations and marker genes, identified from a certain dataset, are consistently present, or whether their condition-dependent variations, such as increases in cell subpopulations in particular disease-related conditions, are consistently observed in different datasets generated under similar or distinct conditions. In this review, we summarize the concepts and approaches of the integration methods and their pros and cons as has been reported in previous literature.

Recent advances in spatially resolved transcriptomics: challenges and opportunities

  • Lee, Jongwon;Yoo, Minsu;Choi, Jungmin
    • BMB Reports
    • /
    • v.55 no.3
    • /
    • pp.113-124
    • /
    • 2022
  • Single-cell RNA sequencing (scRNA-seq) has greatly advanced our understanding of cellular heterogeneity by profiling individual cell transcriptomes. However, cell dissociation from the tissue structure causes a loss of spatial information, which hinders the identification of intercellular communication networks and global transcriptional patterns present in the tissue architecture. To overcome this limitation, novel transcriptomic platforms that preserve spatial information have been actively developed. Significant achievements in imaging technologies have enabled in situ targeted transcriptomic profiling in single cells at single-molecule resolution. In addition, technologies based on mRNA capture followed by sequencing have made possible profiling of the genome-wide transcriptome at the 55-100 ㎛ resolution. Unfortunately, neither imaging-based technology nor capture-based method elucidates a complete picture of the spatial transcriptome in a tissue. Therefore, addressing specific biological questions requires balancing experimental throughput and spatial resolution, mandating the efforts to develop computational algorithms that are pivotal to circumvent technology-specific limitations. In this review, we focus on the current state-of-the-art spatially resolved transcriptomic technologies, describe their applications in a variety of biological domains, and explore recent discoveries demonstrating their enormous potential in biomedical research. We further highlight novel integrative computational methodologies with other data modalities that provide a framework to derive biological insight into heterogeneous and complex tissue organization.

Identification of ERBB pathway-activated cells in triple-negative breast cancer

  • Cho, Soo Young
    • Genomics & Informatics
    • /
    • v.17 no.1
    • /
    • pp.3.1-3.4
    • /
    • 2019
  • Intratumor heterogeneity within a single tumor mass is one of the hallmarks of malignancy and has been reported in various tumor types. The molecular characterization of intratumor heterogeneity in breast cancer is a significant challenge for effective treatment. Using single-cell RNA sequencing (RNA-seq) data from a public resource, an ERBB pathway activated triple-negative cell population was identified. The differential expression of three subtyping marker genes (ERBB2, ESR1, and PGR) was not changed in the bulk RNA-seq data, but the single-cell transcriptomes showed intratumor heterogeneity. This result shows that ERBB signaling is activated using an indirect route and that the molecular subtype is changed on a single-cell level. Our data propose a different view on breast cancer subtypes, clarifying much confusion in this field and contributing to precision medicine.

Transcriptional Heterogeneity of Cellular Senescence in Cancer

  • Junaid, Muhammad;Lee, Aejin;Kim, Jaehyung;Park, Tae Jun;Lim, Su Bin
    • Molecules and Cells
    • /
    • v.45 no.9
    • /
    • pp.610-619
    • /
    • 2022
  • Cellular senescence plays a paradoxical role in tumorigenesis through the expression of diverse senescence-associated (SA) secretory phenotypes (SASPs). The heterogeneity of SA gene expression in cancer cells not only promotes cancer stemness but also protects these cells from chemotherapy. Despite the potential correlation between cancer and SA biomarkers, many transcriptional changes across distinct cell populations remain largely unknown. During the past decade, single-cell RNA sequencing (scRNA-seq) technologies have emerged as powerful experimental and analytical tools to dissect such diverse senescence-derived transcriptional changes. Here, we review the recent sequencing efforts that successfully characterized scRNA-seq data obtained from diverse cancer cells and elucidated the role of senescent cells in tumor malignancy. We further highlight the functional implications of SA genes expressed specifically in cancer and stromal cell populations in the tumor microenvironment. Translational research leveraging scRNA-seq profiling of SA genes will facilitate the identification of novel expression patterns underlying cancer susceptibility, providing new therapeutic opportunities in the era of precision medicine.

Functional annotation of lung cancer-associated genetic variants by cell type-specific epigenome and long-range chromatin interactome

  • Lee, Andrew J.;Jung, Inkyung
    • Genomics & Informatics
    • /
    • v.19 no.1
    • /
    • pp.3.1-3.12
    • /
    • 2021
  • Functional interpretation of noncoding genetic variants associated with complex human diseases and traits remains a challenge. In an effort to enhance our understanding of common germline variants associated with lung cancer, we categorize regulatory elements based on eight major cell types of human lung tissue. Our results show that 21.68% of lung cancer-associated risk variants are linked to noncoding regulatory elements, nearly half of which are cell type-specific. Integrative analysis of high-resolution long-range chromatin interactome maps and single-cell RNA-sequencing data of lung tumors uncovers number of putative target genes of these variants and functionally relevant cell types, which display a potential biological link to cancer susceptibility. The present study greatly expands the scope of functional annotation of lung cancer-associated genetic risk factors and dictates probable cell types involved in lung carcinogenesis.