• Title/Summary/Keyword: Gene Set Enrichment Analysis (GSEA)

Search Result 17, Processing Time 0.022 seconds

NGSEA: Network-Based Gene Set Enrichment Analysis for Interpreting Gene Expression Phenotypes with Functional Gene Sets

  • Han, Heonjong;Lee, Sangyoung;Lee, Insuk
    • Molecules and Cells
    • /
    • v.42 no.8
    • /
    • pp.579-588
    • /
    • 2019
  • Gene set enrichment analysis (GSEA) is a popular tool to identify underlying biological processes in clinical samples using their gene expression phenotypes. GSEA measures the enrichment of annotated gene sets that represent biological processes for differentially expressed genes (DEGs) in clinical samples. GSEA may be suboptimal for functional gene sets; however, because DEGs from the expression dataset may not be functional genes per se but dysregulated genes perturbed by bona fide functional genes. To overcome this shortcoming, we developed network-based GSEA (NGSEA), which measures the enrichment score of functional gene sets using the expression difference of not only individual genes but also their neighbors in the functional network. We found that NGSEA outperformed GSEA in identifying pathway gene sets for matched gene expression phenotypes. We also observed that NGSEA substantially improved the ability to retrieve known anti-cancer drugs from patient-derived gene expression data using drug-target gene sets compared with another method, Connectivity Map. We also repurposed FDA-approved drugs using NGSEA and experimentally validated budesonide as a chemical with anti-cancer effects for colorectal cancer. We, therefore, expect that NGSEA will facilitate both pathway interpretation of gene expression phenotypes and anti-cancer drug repositioning. NGSEA is freely available at www.inetbio.org/ngsea.

Identifying Statistically Significant Gene-Sets by Gene Set Enrichment Analysis Using Fisher Criterion (Fisher Criterion을 이용한 Gene Set Enrichment Analysis 기반 유의 유전자 집합의 검출 방법 연구)

  • Kim, Jae-Young;Shin, Mi-Young
    • Journal of the Institute of Electronics Engineers of Korea CI
    • /
    • v.45 no.4
    • /
    • pp.19-26
    • /
    • 2008
  • Gene set enrichment analysis (GSEA) is a computational method to identify statistically significant gene sets showing significant differences between two groups of microarray expression profiles and simultaneously uncover their biological meanings in an elegant way by employing gene annotation databases, such as Cytogenetic Band, KEGG pathways, gene ontology, and etc. For the gone set enrichment analysis, all the genes in a given dataset are first ordered by the signal-to-noise ratio between the groups and then further analyses are proceeded. Despite of its impressive results in several previous studies, however, gene ranking by the signal-to-noise ratio makes it difficult to consider highly up-regulated genes and highly down-regulated genes at the same time as the candidates of significant genes, which possibly reflect certain situations incurred in metabolic and signaling pathways. To deal with this problem, in this article, we investigate the gene set enrichment analysis method with Fisher criterion for gene ranking and also evaluate its effects in Leukemia related pathway analyses.

Discovery of Cellular RhoA Functions by the Integrated Application of Gene Set Enrichment Analysis

  • Chun, Kwang-Hoon
    • Biomolecules & Therapeutics
    • /
    • v.30 no.1
    • /
    • pp.98-116
    • /
    • 2022
  • The small GTPase RhoA has been studied extensively for its role in actin dynamics. In this study, multiple bioinformatics tools were applied cooperatively to the microarray dataset GSE64714 to explore previously unidentified functions of RhoA. Comparative gene expression analysis revealed 545 differentially expressed genes in RhoA-null cells versus controls. Gene set enrichment analysis (GSEA) was conducted with three gene set collections: (1) the hallmark, (2) the Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway, and (3) the Gene Ontology Biological Process. GSEA results showed that RhoA is related strongly to diverse pathways: cell cycle/growth, DNA repair, metabolism, keratinization, response to fungus, and vesicular transport. These functions were verified by heatmap analysis, KEGG pathway diagramming, and direct acyclic graphing. The use of multiple gene set collections restricted the leakage of information extracted. However, gene sets from individual collections are heterogenous in gene element composition, number, and the contextual meaning embraced in names. Indeed, there was a limit to deriving functions with high accuracy and reliability simply from gene set names. The comparison of multiple gene set collections showed that although the gene sets had similar names, the gene elements were extremely heterogeneous. Thus, the type of collection chosen and the analytical context influence the interpretation of GSEA results. Nonetheless, the analyses of multiple collections made it possible to derive robust and consistent function identifications. This study confirmed several well-described roles of RhoA and revealed less explored functions, suggesting future research directions.

Identification of key genes and functional enrichment analysis of liver fibrosis in nonalcoholic fatty liver disease through weighted gene co-expression network analysis

  • Yue Hu;Jun Zhou
    • Genomics & Informatics
    • /
    • v.21 no.4
    • /
    • pp.45.1-45.11
    • /
    • 2023
  • Nonalcoholic fatty liver disease (NAFLD) is a common type of chronic liver disease, with severity levels ranging from nonalcoholic fatty liver to nonalcoholic steatohepatitis (NASH). The extent of liver fibrosis indicates the severity of NASH and the risk of liver cancer. However, the mechanism underlying NASH development, which is important for early screening and intervention, remains unclear. Weighted gene co-expression network analysis (WGCNA) is a useful method for identifying hub genes and screening specific targets for diseases. In this study, we utilized an mRNA dataset of the liver tissues of patients with NASH and conducted WGCNA for various stages of liver fibrosis. Subsequently, we employed two additional mRNA datasets for validation purposes. Gene set enrichment analysis (GSEA) was conducted to analyze gene function enrichment. Through WGCNA and subsequent analyses, complemented by validation using two additional datasets, we identified five genes (BICC1, C7, EFEMP1, LUM, and STMN2) as hub genes. GSEA analysis indicated that gene sets associated with liver metabolism and cholesterol homeostasis were uniformly downregulated. BICC1, C7, EFEMP1, LUM, and STMN2 were identified as hub genes of NASH, and were all related to liver metabolism, NAFLD, NASH, and related diseases. These hub genes might serve as potential targets for the early screening and treatment of NASH.

Comparison of Invariant NKT Cells with Conventional T Cells by Using Gene Set Enrichment Analysis (GSEA)

  • Oh, Sae-Jin;Ahn, Ji-Ye;Chung, Doo-Hyun
    • IMMUNE NETWORK
    • /
    • v.11 no.6
    • /
    • pp.406-411
    • /
    • 2011
  • Background: Invariant Natural killer T (iNKT) cells, a distinct subset of CD1d-restricted T cells with invariant $V{\alpha}{\beta}$ TCR, functionally bridge innate and adaptive immunity. While iNKT cells share features with conventional T cells in some functional aspects, they simultaneously produce large amount of Th1 and Th2 cytokines upon T-cell receptor (TCR) ligation. However, gene expression pattern in two types of cells has not been well characterized. Methods: we performed comparative microarray analyses of gene expression in murine iNKT cells and conventional $CD4^+CD25^-$ ${\gamma}{\delta}TCR^-$ T cells by using Gene Set Enrichment Analysis (GSEA) method. Results: Here, we describe profound differences in gene expression pattern between iNKT cells and conventional $CD4^+CD25^-$ ${\gamma}{\delta}TCR^-$ T cells. Conclusion: Our results provide new insights into the functional competence of iNKT cells and a better understanding of their various roles during immune responses.

Analysis of gene expression during odontogenic differentiation of cultured human dental pulp cells

  • Seo, Min-Seock;Hwang, Kyung-Gyun;Kim, Hyong-Bum;Baek, Seung-Ho
    • Restorative Dentistry and Endodontics
    • /
    • v.37 no.3
    • /
    • pp.142-148
    • /
    • 2012
  • Objectives: We analyzed gene-expression profiles after 14 day odontogenic induction of human dental pulp cells (DPCs) using a DNA microarray and sought candidate genes possibly associated with mineralization. Materials and Methods: Induced human dental pulp cells were obtained by culturing DPCs in odontogenic induction medium (OM) for 14 day. Cells exposed to normal culture medium were used as controls. Total RNA was extracted from cells and analyzed by microarray analysis and the key results were confirmed selectively by reverse-transcriptase polymerase chain reaction (RT-PCR). We also performed a gene set enrichment analysis (GSEA) of the microarray data. Results: Six hundred and five genes among the 47,320 probes on the BeadChip differed by a factor of more than two-fold in the induced cells. Of these, 217 genes were upregulated, and 388 were down-regulated. GSEA revealed that in the induced cells, genes implicated in Apoptosis and Signaling by wingless MMTV integration (Wnt) were significantly upregulated. Conclusions: Genes implicated in Apoptosis and Signaling by Wnt are highly connected to the differentiation of dental pulp cells into odontoblast.

Developing a Parametric Method for Testing the Significance of Gene Sets in Microarray Data Analysis (마이크로어레이 자료분석에서 모수적 방법을 이용한 유전자군의 유의성 검정)

  • Lee, Sun-Ho;Lee, Seung-Kyu;Lee, Kwang-Hyun
    • Communications for Statistical Applications and Methods
    • /
    • v.16 no.3
    • /
    • pp.397-408
    • /
    • 2009
  • The development of microarray technology makes possible to analyse many thousands of genes simultaneously. While it is important to test each gene whether it shows changes in expression associated with a phenotype, human diseases are thought to occur through the interactions of multiple genes within a same functional cafe-gory. Recent research interests aims to directly test the behavior of sets of functionally related genes, instead of focusing on single genes. Gene set enrichment analysis(GSEA), significance analysis of microarray to gene-set analysis(SAM-GS) and parametric analysis of gene set enrichment(PAGE) have been applied widely as a tool for gene-set analyses. We describe their problems and propose an alternative method using a parametric analysis by adopting normal score transformation of gene expression values. Performance of the newly derived method is compared with previous methods on three real microarray datasets.

Identification of genomic regions and genes associated with subclinical ketosis in periparturient dairy cows

  • Jihwan Lee;KwangHyeon Cho;Kent A. Weigel;Heather M. White;ChangHee Do;Inchul Choi
    • Journal of Animal Science and Technology
    • /
    • v.66 no.3
    • /
    • pp.567-576
    • /
    • 2024
  • Subclinical ketosis (SCK) is a prevalent metabolic disorder that occurs during the transition to lactation period. It is defined as a high blood concentration of ketone bodies (beta-hydroxybutyric acid f ≥ 1.2 mmol/L) within the first few weeks of lactation, and often presents without clinical signs. SCK is mainly caused by negative energy balance (NEB). The objective of this study is to identify single nucleotide polymorphisms (SNPs) associated with SCK using genome-wide association studies (GWAS), and to predict the biological functions of proximal genes using gene-set enrichment analysis (GSEA). Blood samples were collected from 112 Holstein cows between 5 and 18 days postpartum to determine the incidence of SCK. Genomic DNA extracted from both SCK and healthy cows was examined using the Illumina Bovine SNP50K BeadChip for genotyping. GWAS revealed 194 putative SNPs and 163 genes associated with those SNPs. Additionally, GSEA showed that the genes retrieved by Database for Annotation, Visualization, and Integrated Discovery (DAVID) belonged to calcium signaling, starch and sucrose, immune network, and metabolic pathways. Furthermore, the proximal genes were found to be related to germ cell and early embryo development. In summary, this study proposes several feasible SNPs and genes associated with SCK through GWAS and GSEA. These candidates can be utilized in selective breeding programs to reduce the genetic risk for SCK and subfertility in high-performance dairy cows.

Deep Learning Approach Based on Transcriptome Profile for Data Driven Drug Discovery

  • Eun-Ji Kwon;Hyuk-Jin Cha
    • Molecules and Cells
    • /
    • v.46 no.1
    • /
    • pp.65-67
    • /
    • 2023
  • SMILES (simplified molecular-input line-entry system) information of small molecules parsed by one-hot array is passed to a convolutional neural network called black box. Outputs data representing a gene signature is then matched to the genetic signature of a disease to predict the appropriate small molecule. Efficacy of the predicted small molecules is examined by in vivo animal models. GSEA, gene set enrichment analysis.

HPAI-resistant Ri chickens exhibit elevated antiviral immune-related gene expression

  • Thi Hao Vu;Jubi Heo;Yeojin Hong;Suyeon Kang;Ha Thi Thanh Tran;Hoang Vu Dang;Anh Duc Truong;Yeong Ho Hong
    • Journal of Veterinary Science
    • /
    • v.24 no.1
    • /
    • pp.13.1-13.11
    • /
    • 2023
  • Background: Highly pathogenic avian influenza viruses (HPAIVs) is an extremely contagious and high mortality rates in chickens resulting in substantial economic impact on the poultry sector. Therefore, it is necessary to elucidate the pathogenic mechanism of HPAIV for infection control. Objective: Gene set enrichment analysis (GSEA) can effectively avoid the limitations of subjective screening for differential gene expression. Therefore, we performed GSEA to compare HPAI-infected resistant and susceptible Ri chicken lines. Methods: The Ri chickens Mx(A)/BF2(B21) were chosen as resistant, and the chickens Mx(G)/BF2(B13) were selected as susceptible by genotyping the Mx and BF2 genes. The tracheal tissues of HPAIV H5N1 infected chickens were collected for RNA sequencing followed by GSEA analysis to define gene subsets to elucidate the sequencing results. Results: We identified four differentially expressed pathways, which were immune-related pathways with a total of 78 genes. The expression levels of cytokines (IL-1β, IL-6, IL-12), chemokines (CCL4 and CCL5), type interferons and their receptors (IFN-β, IFNAR1, IFNAR2, and IFNGR1), Jak-STAT signaling pathway genes (STAT1, STAT2, and JAK1), MHC class I and II and their co-stimulatory molecules (CD80, CD86, CD40, DMB2, BLB2, and B2M), and interferon stimulated genes (EIF2AK2 and EIF2AK1) in resistant chickens were higher than those in susceptible chickens. Conclusions: Resistant Ri chickens exhibit a stronger antiviral response to HPAIV H5N1 compared with susceptible chickens. Our findings provide insights into the immune responses of genetically disparate chickens against HPAIV.