• Title/Summary/Keyword: GSEA

Search Result 20, Processing Time 0.027 seconds

NGSEA: Network-Based Gene Set Enrichment Analysis for Interpreting Gene Expression Phenotypes with Functional Gene Sets

  • Han, Heonjong;Lee, Sangyoung;Lee, Insuk
    • Molecules and Cells
    • /
    • v.42 no.8
    • /
    • pp.579-588
    • /
    • 2019
  • Gene set enrichment analysis (GSEA) is a popular tool to identify underlying biological processes in clinical samples using their gene expression phenotypes. GSEA measures the enrichment of annotated gene sets that represent biological processes for differentially expressed genes (DEGs) in clinical samples. GSEA may be suboptimal for functional gene sets; however, because DEGs from the expression dataset may not be functional genes per se but dysregulated genes perturbed by bona fide functional genes. To overcome this shortcoming, we developed network-based GSEA (NGSEA), which measures the enrichment score of functional gene sets using the expression difference of not only individual genes but also their neighbors in the functional network. We found that NGSEA outperformed GSEA in identifying pathway gene sets for matched gene expression phenotypes. We also observed that NGSEA substantially improved the ability to retrieve known anti-cancer drugs from patient-derived gene expression data using drug-target gene sets compared with another method, Connectivity Map. We also repurposed FDA-approved drugs using NGSEA and experimentally validated budesonide as a chemical with anti-cancer effects for colorectal cancer. We, therefore, expect that NGSEA will facilitate both pathway interpretation of gene expression phenotypes and anti-cancer drug repositioning. NGSEA is freely available at www.inetbio.org/ngsea.

Discovery of Cellular RhoA Functions by the Integrated Application of Gene Set Enrichment Analysis

  • Chun, Kwang-Hoon
    • Biomolecules & Therapeutics
    • /
    • v.30 no.1
    • /
    • pp.98-116
    • /
    • 2022
  • The small GTPase RhoA has been studied extensively for its role in actin dynamics. In this study, multiple bioinformatics tools were applied cooperatively to the microarray dataset GSE64714 to explore previously unidentified functions of RhoA. Comparative gene expression analysis revealed 545 differentially expressed genes in RhoA-null cells versus controls. Gene set enrichment analysis (GSEA) was conducted with three gene set collections: (1) the hallmark, (2) the Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway, and (3) the Gene Ontology Biological Process. GSEA results showed that RhoA is related strongly to diverse pathways: cell cycle/growth, DNA repair, metabolism, keratinization, response to fungus, and vesicular transport. These functions were verified by heatmap analysis, KEGG pathway diagramming, and direct acyclic graphing. The use of multiple gene set collections restricted the leakage of information extracted. However, gene sets from individual collections are heterogenous in gene element composition, number, and the contextual meaning embraced in names. Indeed, there was a limit to deriving functions with high accuracy and reliability simply from gene set names. The comparison of multiple gene set collections showed that although the gene sets had similar names, the gene elements were extremely heterogeneous. Thus, the type of collection chosen and the analytical context influence the interpretation of GSEA results. Nonetheless, the analyses of multiple collections made it possible to derive robust and consistent function identifications. This study confirmed several well-described roles of RhoA and revealed less explored functions, suggesting future research directions.

Performance Comparison of Two Gene Set Analysis Methods for Genome-wide Association Study Results: GSA-SNP vs i-GSEA4GWAS

  • Kwon, Ji-Sun;Kim, Ji-Hye;Nam, Doug-U;Kim, Sang-Soo
    • Genomics & Informatics
    • /
    • v.10 no.2
    • /
    • pp.123-127
    • /
    • 2012
  • Gene set analysis (GSA) is useful in interpreting a genome-wide association study (GWAS) result in terms of biological mechanism. We compared the performance of two different GSA implementations that accept GWAS p-values of single nucleotide polymorphisms (SNPs) or gene-by-gene summaries thereof, GSA-SNP and i-GSEA4GWAS, under the same settings of inputs and parameters. GSA runs were made with two sets of p-values from a Korean type 2 diabetes mellitus GWAS study: 259,188 and 1,152,947 SNPs of the original and imputed genotype datasets, respectively. When Gene Ontology terms were used as gene sets, i-GSEA4GWAS produced 283 and 1,070 hits for the unimputed and imputed datasets, respectively. On the other hand, GSA-SNP reported 94 and 38 hits, respectively, for both datasets. Similar, but to a lesser degree, trends were observed with Kyoto Encyclopedia of Genes and Genomes (KEGG) gene sets as well. The huge number of hits by i-GSEA4GWAS for the imputed dataset was probably an artifact due to the scaling step in the algorithm. The decrease in hits by GSA-SNP for the imputed dataset may be due to the fact that it relies on Z-statistics, which is sensitive to variations in the background level of associations. Judicious evaluation of the GSA outcomes, perhaps based on multiple programs, is recommended.

Gene Set Analysis - Absolute and Trim (절대치와 절삭을 이용한 유전자 집단 분석)

  • Lee, Kwang-Hyun;Lee, Sun-Ho
    • The Korean Journal of Applied Statistics
    • /
    • v.21 no.3
    • /
    • pp.523-535
    • /
    • 2008
  • Initial work of microarray data analysis focused on identification of differentially expressed genes, and recently, the focus has moved to discovering significant sets of functionally related genes. We describe some problems of GSEA and PAGE, and propose a modified method to identify significant gene sets. The results based on a simulated experiment and real data analysis using a set of publicly available data show the superiority of the newly proposed method, GSA-AT, in detecting significant pathways with the accurate prediction.

Identifying Statistically Significant Gene-Sets by Gene Set Enrichment Analysis Using Fisher Criterion (Fisher Criterion을 이용한 Gene Set Enrichment Analysis 기반 유의 유전자 집합의 검출 방법 연구)

  • Kim, Jae-Young;Shin, Mi-Young
    • Journal of the Institute of Electronics Engineers of Korea CI
    • /
    • v.45 no.4
    • /
    • pp.19-26
    • /
    • 2008
  • Gene set enrichment analysis (GSEA) is a computational method to identify statistically significant gene sets showing significant differences between two groups of microarray expression profiles and simultaneously uncover their biological meanings in an elegant way by employing gene annotation databases, such as Cytogenetic Band, KEGG pathways, gene ontology, and etc. For the gone set enrichment analysis, all the genes in a given dataset are first ordered by the signal-to-noise ratio between the groups and then further analyses are proceeded. Despite of its impressive results in several previous studies, however, gene ranking by the signal-to-noise ratio makes it difficult to consider highly up-regulated genes and highly down-regulated genes at the same time as the candidates of significant genes, which possibly reflect certain situations incurred in metabolic and signaling pathways. To deal with this problem, in this article, we investigate the gene set enrichment analysis method with Fisher criterion for gene ranking and also evaluate its effects in Leukemia related pathway analyses.

HPAI-resistant Ri chickens exhibit elevated antiviral immune-related gene expression

  • Thi Hao Vu;Jubi Heo;Yeojin Hong;Suyeon Kang;Ha Thi Thanh Tran;Hoang Vu Dang;Anh Duc Truong;Yeong Ho Hong
    • Journal of Veterinary Science
    • /
    • v.24 no.1
    • /
    • pp.13.1-13.11
    • /
    • 2023
  • Background: Highly pathogenic avian influenza viruses (HPAIVs) is an extremely contagious and high mortality rates in chickens resulting in substantial economic impact on the poultry sector. Therefore, it is necessary to elucidate the pathogenic mechanism of HPAIV for infection control. Objective: Gene set enrichment analysis (GSEA) can effectively avoid the limitations of subjective screening for differential gene expression. Therefore, we performed GSEA to compare HPAI-infected resistant and susceptible Ri chicken lines. Methods: The Ri chickens Mx(A)/BF2(B21) were chosen as resistant, and the chickens Mx(G)/BF2(B13) were selected as susceptible by genotyping the Mx and BF2 genes. The tracheal tissues of HPAIV H5N1 infected chickens were collected for RNA sequencing followed by GSEA analysis to define gene subsets to elucidate the sequencing results. Results: We identified four differentially expressed pathways, which were immune-related pathways with a total of 78 genes. The expression levels of cytokines (IL-1β, IL-6, IL-12), chemokines (CCL4 and CCL5), type interferons and their receptors (IFN-β, IFNAR1, IFNAR2, and IFNGR1), Jak-STAT signaling pathway genes (STAT1, STAT2, and JAK1), MHC class I and II and their co-stimulatory molecules (CD80, CD86, CD40, DMB2, BLB2, and B2M), and interferon stimulated genes (EIF2AK2 and EIF2AK1) in resistant chickens were higher than those in susceptible chickens. Conclusions: Resistant Ri chickens exhibit a stronger antiviral response to HPAIV H5N1 compared with susceptible chickens. Our findings provide insights into the immune responses of genetically disparate chickens against HPAIV.

Comparison of Invariant NKT Cells with Conventional T Cells by Using Gene Set Enrichment Analysis (GSEA)

  • Oh, Sae-Jin;Ahn, Ji-Ye;Chung, Doo-Hyun
    • IMMUNE NETWORK
    • /
    • v.11 no.6
    • /
    • pp.406-411
    • /
    • 2011
  • Background: Invariant Natural killer T (iNKT) cells, a distinct subset of CD1d-restricted T cells with invariant $V{\alpha}{\beta}$ TCR, functionally bridge innate and adaptive immunity. While iNKT cells share features with conventional T cells in some functional aspects, they simultaneously produce large amount of Th1 and Th2 cytokines upon T-cell receptor (TCR) ligation. However, gene expression pattern in two types of cells has not been well characterized. Methods: we performed comparative microarray analyses of gene expression in murine iNKT cells and conventional $CD4^+CD25^-$ ${\gamma}{\delta}TCR^-$ T cells by using Gene Set Enrichment Analysis (GSEA) method. Results: Here, we describe profound differences in gene expression pattern between iNKT cells and conventional $CD4^+CD25^-$ ${\gamma}{\delta}TCR^-$ T cells. Conclusion: Our results provide new insights into the functional competence of iNKT cells and a better understanding of their various roles during immune responses.

Analysis of gene expression during odontogenic differentiation of cultured human dental pulp cells

  • Seo, Min-Seock;Hwang, Kyung-Gyun;Kim, Hyong-Bum;Baek, Seung-Ho
    • Restorative Dentistry and Endodontics
    • /
    • v.37 no.3
    • /
    • pp.142-148
    • /
    • 2012
  • Objectives: We analyzed gene-expression profiles after 14 day odontogenic induction of human dental pulp cells (DPCs) using a DNA microarray and sought candidate genes possibly associated with mineralization. Materials and Methods: Induced human dental pulp cells were obtained by culturing DPCs in odontogenic induction medium (OM) for 14 day. Cells exposed to normal culture medium were used as controls. Total RNA was extracted from cells and analyzed by microarray analysis and the key results were confirmed selectively by reverse-transcriptase polymerase chain reaction (RT-PCR). We also performed a gene set enrichment analysis (GSEA) of the microarray data. Results: Six hundred and five genes among the 47,320 probes on the BeadChip differed by a factor of more than two-fold in the induced cells. Of these, 217 genes were upregulated, and 388 were down-regulated. GSEA revealed that in the induced cells, genes implicated in Apoptosis and Signaling by wingless MMTV integration (Wnt) were significantly upregulated. Conclusions: Genes implicated in Apoptosis and Signaling by Wnt are highly connected to the differentiation of dental pulp cells into odontoblast.

Identification of key genes and functional enrichment analysis of liver fibrosis in nonalcoholic fatty liver disease through weighted gene co-expression network analysis

  • Yue Hu;Jun Zhou
    • Genomics & Informatics
    • /
    • v.21 no.4
    • /
    • pp.45.1-45.11
    • /
    • 2023
  • Nonalcoholic fatty liver disease (NAFLD) is a common type of chronic liver disease, with severity levels ranging from nonalcoholic fatty liver to nonalcoholic steatohepatitis (NASH). The extent of liver fibrosis indicates the severity of NASH and the risk of liver cancer. However, the mechanism underlying NASH development, which is important for early screening and intervention, remains unclear. Weighted gene co-expression network analysis (WGCNA) is a useful method for identifying hub genes and screening specific targets for diseases. In this study, we utilized an mRNA dataset of the liver tissues of patients with NASH and conducted WGCNA for various stages of liver fibrosis. Subsequently, we employed two additional mRNA datasets for validation purposes. Gene set enrichment analysis (GSEA) was conducted to analyze gene function enrichment. Through WGCNA and subsequent analyses, complemented by validation using two additional datasets, we identified five genes (BICC1, C7, EFEMP1, LUM, and STMN2) as hub genes. GSEA analysis indicated that gene sets associated with liver metabolism and cholesterol homeostasis were uniformly downregulated. BICC1, C7, EFEMP1, LUM, and STMN2 were identified as hub genes of NASH, and were all related to liver metabolism, NAFLD, NASH, and related diseases. These hub genes might serve as potential targets for the early screening and treatment of NASH.

Deep Learning Approach Based on Transcriptome Profile for Data Driven Drug Discovery

  • Eun-Ji Kwon;Hyuk-Jin Cha
    • Molecules and Cells
    • /
    • v.46 no.1
    • /
    • pp.65-67
    • /
    • 2023
  • SMILES (simplified molecular-input line-entry system) information of small molecules parsed by one-hot array is passed to a convolutional neural network called black box. Outputs data representing a gene signature is then matched to the genetic signature of a disease to predict the appropriate small molecule. Efficacy of the predicted small molecules is examined by in vivo animal models. GSEA, gene set enrichment analysis.